Estimated time: 30 minutes Build a data pipeline using Prophecy Agent. This tutorial walks you through exploring data, creating visualizations, and building transformations using natural language prompts. Follow the steps below to build a patient analytics pipeline.Documentation Index
Fetch the complete documentation index at: https://docs.prophecy.ai/llms.txt
Use this file to discover all available pages before exploring further.
Prerequisites
To complete this quickstart, you need a Prophecy fabric that uses Prophecy In Memory or Databricks as the compute engine.Set up a new project
First, you need to create the project where you will build your pipeline. You’ll also need to add data to the project for this quickstart.Create a project
- Click on the Create Entity button in the left navigation bar.
- Hover over the Project tile and click Create.
-
Give your project a Name, such as
Prophecy_Quickstart. - Under Team, select your personal team. (It will match your user email.)
- Under Select Template, choose Prophecy for Analysts.
- Click Complete.
Connect to the execution environment
- Select the default fabric or your own fabric.
- Click Save.
Add a new pipeline
- Minimize Agent chat to show the project landing page.
- On the project landing page, click Create Pipeline.
- For the Pipeline Name, enter
patient_analytics. - Leave the default Directory Path of
pipelines. Prophecy saves your compiled pipeline code in this folder of the project repository. - Click Create.
Add data to your project
Next, you’ll add some data to the project so the Agent can find and transform it.Create the seed data file
- Open the Source/Target gem category.
- Click Table. This adds a new Table gem to the canvas.
- Hover over the gem and click Open.
- Select + New Table.
- For the Type and Format, choose Seed.
-
Name the seed
patients_raw_data. - For the Seed path, choose seeds. Prophecy saves your seed file in this folder of the project repository.
- Click Next.
-
In the Properties tab, paste the following data.
patients_raw_data.csv
- Click Next.
- Click Load Data to preview the data in tabular format.
- Click Save.
Run the seed
Reindex your connection
- Open the Environment tab in the left sidebar.
- Below your connections, you’ll see a Missing Tables? callout.
- Click Refresh to trigger the knowledge graph indexer.
Explore the data
Now that you have data in your project, you can explore it using the Agent.Access the Agent
Explore your data
- A short list of relevant datasets with descriptions.
- A full list of matching datasets.
- The option to add datasets directly to your pipeline on hover.
-
Click on
patients_raw_data(the data you uploaded) in the chat to open a detailed preview dialog where you can:- View the table location.
- Examine the schema and column structure.
- Preview sample data.
- Review data profiles.
- Open an Explore session for dataset-specific queries.
- Close the preview dialog to return to the chat.
Get data samples
@patients_raw_data with your actual table path:- A table with the requested data.
- An option to preview the table for detailed examination.
- An option to add the table to your pipeline.
- SQL execution logs showing the query used.
Create visualizations
- An embedded chart in the chat showing patient counts by city.
- An option to preview the table for detailed examination.
- An option to add the table to your pipeline.
- SQL execution logs showing the query used.
- Visualization tab: View larger charts and download charts as images.
- Data tab: Examine underlying data and download data as JSON/Excel/CSV.
Build your pipeline
Now, you’ll build your pipeline transformations using the Agent. Keep the Visual view open (rather than the Code view) to see updates in real-time as the Agent adds or modifies gems on the canvas.Build your first transformation
@patient_records with the actual gem label from your canvas if it differs.The Agent will:- Add the appropriate gem(s) to your pipeline canvas. This prompt should produce an Aggregate gem.
- Execute the pipeline, generating data samples that you can review.
- Provide a description of the changes made.
- Show options to inspect, preview, or restore changes.
- Display SQL execution logs.
Inspect pipeline changes
- Click Inspect on the transformation response.
- Review the configuration panel starting with the first modified gem (highlighted in yellow).
- Use the Previous and Next buttons to navigate through modified gems.
- Examine input and output data to verify the transformation produces expected results.
- Understand the Agent’s approach.
- Verify that the transformation logic matches your expectations.
Add another transformation
1,544, Middlesex around 2,410, Worcester around 1,800, and Hampden 3,800.50.Save your results
Explore further
Try these additional tasks to extend your pipeline.Join with additional data
county_info with the following content:Connect your own data
This quickstart uses a Seed file as the source data. When you start building your own pipelines, you’ll likely want to use your own data from files or external systems. You can do this by:- Uploading files directly from your local filesystem using the upload file feature.
- Ingesting data from external systems using connections.
Sample prompts reference
Use these prompts as templates for your own pipelines.| Task | Prompt Example |
|---|---|
| Find data | Find datasets containing customer information |
| Sample data | Show me 5 random records from @sales_data |
| Filter data | Filter to only include orders from 2024 |
| Transform data | Calculate total revenue as quantity * price |
| Join data | Join the orders and customers tables |
| Parse data | Extract the fields from json_data as columns — This works best if you provide a sample JSON object. |
| Clean data | Remove rows where email is null |
| Aggregate data | Group by region and calculate average sales |
| Visualize data | Create a bar chart of monthly sales |
| Save results | Save the final output as a table |
Tips
Be specific with your requests
Be specific with your requests
Clean the data → Try Remove duplicate customer records and fill null values in the email columnWhy it helps: Reduces ambiguity so the Agent applies the correct operations.Use step-by-step instructions
Use step-by-step instructions
Aggregate orders by customer → Join with customers → Filter for orders made in 2024Why it helps: Improves reliability and makes it easier to debug or adjust each step.Review changes before proceeding
Review changes before proceeding
Leverage both the Agent and visual interface
Leverage both the Agent and visual interface
Troubleshooting
Agent can't find tables
Agent can't find tables
Conversation feels stuck or off track
Conversation feels stuck or off track
Results don't match expected values
Results don't match expected values

