Skip to main content

What are Amorphic Datasets, Domains and Views?

headerImage

info

What are Datasets❓

  • What if you can add files to a folder that pushes data automatically to Redshift or AuroraMySQL or S3Athena?
  • What if new files added to a folder can append or reload to Redshift or AuroraMySQL or S3Athena?
  • What if you can query the files added to a folder using query engine or use automatically generated JDBC Connection String?
  • What if you automatically profile the data as you add them to a folder?
  • What if you automatically perform data quality checks on files being added to a folder?
  • What if you can automatically detect malware and quarantine automatically as we add files to a folder?
  • What if the files are automatically deleted up with an expiry date set in a folder ?
  • What if you can apply AI/ML (ML Model, Entity Recognizer) on files (structured or unstructured) in a folder and view AI/ML results?
  • What if you can enable deepsearch on the files in a folder using AWS Kendra ?
  • What if you automatically convert file formats (xlsx to csv) as you dump them to a folder ?
  • What if you control access to the files in a folder for a specific user or a group automatically?

Too good to be true? That magical folder exists, and we call it Dataset in Amorphic. 🤩

What are Domains?

  • Each Dataset is registered to a Domain.
  • Domain is nothing but a database/schema in the database world.
  • Domain may refer to the specification of the type of business/industry/purpose or other.
  • This will help the user maintain/segregate between different datasets.

What are Views?

  • Amorphic views are created to restrict access to a specific group of people or to create reports
  • Views are created on top of the datasets.
  • Views can be materialized or standard. Materialized views can be refreshed manually or automatically.
tip

Go to the next topic to create a Domain and Dataset.