The Knowledge Graph indexer retrieves metadata from your SQL warehouse and external data storage systems to build and maintain the metadata index that powers AI features in Prophecy. Prophecy automatically indexes your data environment when you create a fabric using your default credentials. After the initial run, you can configure indexing behavior to control when and how the indexer runs. This page covers:Documentation Index
Fetch the complete documentation index at: https://docs.prophecy.ai/llms.txt
Use this file to discover all available pages before exploring further.
- Scheduling automatic indexing
- Triggering manual runs
- Configuring separate authentication credentials for the indexer
How indexing works
The Knowledge Graph indexer processes each connection in your fabric separately. For each indexing run, the indexer:- Uses the credentials stored in the connection to authenticate with the external system.
- Retrieves metadata only for databases, schemas, tables, and storage locations that the configured identity is authorized to access.
- Indexes metadata such as table names, schemas, column names, data types, and available object descriptions.
- Updates the Knowledge Graph with the latest metadata.
The Knowledge Graph stores metadata only. It does not store actual warehouse data values, sampled records, or summaries of table contents.
Knowledge graph dependency checklist
Before configuring or troubleshooting the indexer, verify that the following requirements are satisfied.| # | Item | Why it matters | Required? |
|---|---|---|---|
| 1 | KG indexer has run successfully | AI features rely on indexed metadata for dataset discovery and schema-aware assistance | Required |
| 2 | Knowledge Graph services are running and reachable | AI features require access to Knowledge Graph retrieval services | Required |
| 3 | KNOWLEDGE_GRAPH_BASE_URL configured in sql-sandbox-config-map | Enables AI services to access Knowledge Graph retrieval APIs | Required |
| 4 | CLAUDE_MODEL configured in sql-sandbox-config-map | Specifies the Anthropic model used for Knowledge Graph-powered AI assistance | Required |
| 5 | AI_DATA_ACCESS_CLUSTER_ENABLED flag state is known | Determines whether the AI agent can perform live data operations or operate in metadata-only mode | Required |
| 6 | Sufficient worker resources are available | Indexing operations require adequate compute and memory resources | Required for indexing |
| 7 | KG indexing completed with SUCCESS status | Schema and dataset lookups depend on up-to-date metadata | Required |
| 8 | Incremental indexing schedule confirmed | Keeps metadata current as warehouse schemas evolve | Recommended |
Items 3 and 4 require updating the
sql-sandbox-config-map. See Knowledge graph configuration
for setup instructions.Configure automatic indexing
You can configure scheduled indexing to keep your Knowledge Graph up to date without manual intervention.- In Prophecy, open Metadata > Fabrics.
- Select the fabric where you wish to enable indexing.
- Open the Connections tab.
- Open the connection to be indexed.
- In the connection dialog, scroll to the Knowledge Graph Indexer tile and enable Knowledge Graph Periodic Indexing.
- Configure the schedule to run hourly, daily, or weekly.
Scheduling parameters
| Schedule type | Parameter | Description | Default |
|---|---|---|---|
| Hourly | Repeat every … from | The interval in hours between indexing runs, starting at a specific time. Example: Repeat every 2 hours from 12:00 AM. | Every 1 hour starting at 2:00 AM |
| Daily | Repeat at | The time of day when indexing runs. Example: Repeat at 9:00 AM. | 2:00 AM |
| Weekly | Repeat on | The day(s) of the week when indexing runs. Example: Repeat on Monday, Wednesday, Friday. | Sunday |
| Weekly | Repeat at | The time of day when indexing runs. Example: Repeat at 9:00 AM. | 2:00 AM |
Manually trigger indexing
You may need to manually trigger indexing if newly created tables or schemas are not yet available to AI features. To manually trigger indexing:- In Prophecy, open Metadata > Fabrics.
- Select the fabric you want to index.
- Open the Connections tab.
- Open the connection to be indexed.
- Scroll to the Knowledge Graph Indexing Status tile in the connection dialog.
- Click Start to begin indexing and monitor progress.
- Open a project in the project editor.
- Attach the fabric you want to index.
- Open the Environment tab in the left sidebar.
- Locate the Missing Tables? callout below your connections.
- Click Refresh.
Add separate authentication for the indexer
In some environments, administrators may want more granular control over which metadata is indexed into the Knowledge Graph. For Databricks connections, Prophecy supports configuring separate authentication credentials specifically for the Knowledge Graph indexer. There are two types of credentials stored in a connection:- Pipeline Development and Scheduled Execution credentials control how pipelines authenticate when they run.
- Knowledge Graph Indexer credentials control how the indexer authenticates when retrieving metadata on a schedule.
The Knowledge Graph indexer always uses the same identity as the pipeline development identity if the pipeline development authentication strategy is Personal Access Token (rather than OAuth). This section does not apply when using PAT authentication.
Prerequisites
Before configuring dedicated credentials for the Knowledge Graph indexer, you must:- Upgrade to Prophecy 4.2.2 or later.
- Configure your SQL warehouse connection with a Databricks connection. Other SQL warehouses are not currently supported for separate indexer authentication.
- Be a Prophecy administrator.
- Be a Databricks administrator with permission to assign appropriate access to the indexing identity.
Procedure
To configure separate authentication for the Knowledge Graph indexer:- In Prophecy, navigate to Metadata > Fabrics.
- Select the target fabric.
- Open the Connections tab.
- Edit the SQL Warehouse Connection.
- Scroll to the Knowledge Graph Indexer section.
-
Configure authentication based on your pipeline development authentication method:
- If you use User OAuth for **pipeline development, choose either OAuth (User) or OAuth (Service Principal) for the Knowledge Graph indexer.
- If you use Service Principal OAuth for **pipeline development, you can only use Service Principal OAuth for the Knowledge Graph indexer.
Service Principal OAuth (recommended)
Recommended for production and scheduled indexing because credentials do not expire.- Configuration: Reuse pipeline development credentials or provide a separate Service Principal Client ID and Client Secret.
- Indexed metadata: Metadata for all warehouse objects the service principal is authorized to access.
If pipeline development uses User OAuth, Prophecy continues to enforce user-level permissions even when the Knowledge Graph indexer uses service principal credentials.
User OAuth
Recommended primarily for development environments.- Configuration: Uses the same app registration as pipeline development.
- Indexed metadata: Metadata for warehouse objects the authenticated user is authorized to access.
- Limitations: Scheduled indexing can fail when user credentials expire or require reauthentication.

