Skip to main content

Introduction

DataStori is a SaaS application to automate the ingestion and storage of data from cloud-based business applications. DataStori builds data pipelines from the user's source applications to their preferred data stores, and runs them on schedule or on-demand. DataStori is hosted in AWS US. Key features of DataStori:

  • Data Security: DataStori runs data pipelines from source applications and stores the output data in the customer's cloud. Customer data never leaves their cloud, either in processing or storage.

  • Add New Applications on the fly: DataStori generates data pipelines dynamically from API documentation, emails, SFTP folders and SQL database connections.

  • Data Testing: DataStori runs automated tests on your data to ensure data quality.

  • Data Snapshots: DataStori versions the data and provides ability to rollback to previous versions.

  • Data Schema Detection and Evolution : DataStori automatically defines the schema of stored data based on the API responses, CSV files or database tables from source applications. In case of changes to schema, the schema is evolved and the changes are tracked automatically. Both data types and column addition/deletion are tracked.

Read the next few sections to understand how DataStori works.