Cloud Dataprep

Intelligent Data Preparation

Google Cloud Dataprep is an intelligent data service for visually exploring, cleaning, and preparing structured and unstructured data for analysis. Cloud Dataprep is serverless and works at any scale. There is no infrastructure to deploy or manage. Easy data preparation with clicks and no code.

Visual Interactivity, Ease of Use

Understand data instantly with visual data distributions. With each gesture in the UI Dataprep suggests and predicts your next ideal data transformation so you don’t have to write code.

Fast Data Preparation

Cloud Dataprep automatically detects schemas, datatypes, possible joins and anomalies such as missing values, outliers, and duplicates so you get to skip the time consuming work of profiling your data and go right to the data analysis.

Fully Managed and Powerful

Cloud Dataprep is an integrated partner service that is operated by another company, Trifacta. We work closely with Trifacta to provide a seamless user experience that removes the need for upfront software installation, separate licensing costs, or ongoing operational overhead. The service scales on demand to meet your growing data preparation needs so that you can stay focused on analysis.


Instant Data Exploration

Visually explore and interact with data in seconds. Instantly understand data distribution and patterns. You don't need to write code. You can prepare data with a few clicks.

Intelligent Data Cleansing

Cloud Dataprep automatically identifies data anomalies and helps you to take corrective actions fast. Get data transformation suggestions based on your usage pattern. Standardize, structure, and join datasets easily with a guided approach.


Cloud Dataprep is a serverless service, so you do not need to create or manage infrastructure. This helps you to keep your focus on the data preparation and analysis.

Seriously Powerful

Cloud Dataprep is built on top of the powerful Google Cloud Dataflow service. Cloud Dataprep is auto-scalable and can easily handle processing massive data sets.

Supports Common Data Sources of Any Size

Process diverse datasets - structured and unstructured. Transform data stored in CSV, JSON, or relational table formats. Prepare datasets of any size, megabytes to terabytes, with equal ease.

Integrated with Google Cloud Platform

Easily process data stored in Google Cloud Storage, Google BigQuery or from your desktop. Export clean data directly into BigQuery for further analysis. Seamlessly manage user access and data security with Google Cloud Identity and Access Management.