AWS Launches Glue Data Quality Service to Improve Data Quality
AWS has made the Glue Data Quality service available to help companies optimize their data quality across data lakes and pipelines. According to AWS, when creating data lakes, many companies do not pay close attention to the quality of the data contained therein, resulting in ‘data swamps’.
Improving Data Quality
Improving data quality is often a difficult and lengthy process for engineers, due to the manual and meticulous sifting through the data, formulating the data quality requirements and coding for alerts for deteriorating data quality.
AWS Glue Data Quality Service
To make data quality monitoring more efficient and prevent potential negative problems for business use, AWS is launching its AWS Glue Data Quality service. The service automatically calculates statistics, provides examples for quality rules, monitors data and sends alerts if it detects that quality is deteriorating.
The new service is a serverless feature of the AWS Glue service and also takes care of infrastructure management and maintenance. Users can access the service through various platforms, such as the AWS Glue Data Catalog, Glue Studio, and Glue Studio notebooks, as well as from their preferred code editors.