# DataFramer ## Docs - [API & MCP](https://docs.dataframer.ai/api-and-mcp.md): Two ways to use DataFramer outside the UI: the Python SDK and MCP (Model Context Protocol). - [Create an Anonymization Run](https://docs.dataframer.ai/api-reference/anonymization-runs/create.md): Start a new anonymization run to detect and mask sensitive data on your data - [Delete Anonymization Run](https://docs.dataframer.ai/api-reference/anonymization-runs/delete.md): Soft-delete an anonymization run so it no longer appears in list views - [Download All Anonymized Files](https://docs.dataframer.ai/api-reference/anonymization-runs/download-all.md): Get a presigned URL to download all anonymized files as a ZIP archive - [Download Anonymized File](https://docs.dataframer.ai/api-reference/anonymization-runs/download-file.md): Get a presigned URL to download a single anonymized file from a completed run - [Get Anonymization Run](https://docs.dataframer.ai/api-reference/anonymization-runs/get.md): Retrieve an anonymization run by ID. Also use this endpoint to poll for completion. - [List Anonymization Runs](https://docs.dataframer.ai/api-reference/anonymization-runs/list.md): List all anonymization runs for your company, newest first - [Rotate API key](https://docs.dataframer.ai/api-reference/api-keys/rotate-api-key.md): Programmatically rotate your API key. Authenticates using your current API key as a Bearer token (not JWT). The current key is immediately revoked and a new key is returned. - [Create seed dataset](https://docs.dataframer.ai/api-reference/datasets/create.md): Create a new seed dataset with uploaded files - [Create seed dataset from ZIP](https://docs.dataframer.ai/api-reference/datasets/create-from-zip.md): Create a seed dataset by uploading a ZIP file - [Delete seed dataset](https://docs.dataframer.ai/api-reference/datasets/delete.md): Delete a seed dataset and all its files - [Get seed dataset](https://docs.dataframer.ai/api-reference/datasets/get.md): Retrieve a specific seed dataset by ID - [List seed datasets](https://docs.dataframer.ai/api-reference/datasets/list.md): Get all seed datasets for the user's company - [Create evaluation](https://docs.dataframer.ai/api-reference/evaluations/create.md): Start a new evaluation for a completed run - [Get evaluation](https://docs.dataframer.ai/api-reference/evaluations/get.md): Retrieve full evaluation details including distribution analysis - [List evaluations for a run](https://docs.dataframer.ai/api-reference/evaluations/list.md): Retrieve all evaluations for a specific run - [Download seed file](https://docs.dataframer.ai/api-reference/files/download.md): Get a URL for downloading a seed dataset file - [Create red team run](https://docs.dataframer.ai/api-reference/red-teaming/create-run.md): Create a new red team run to generate adversarial prompts - [Create red team spec](https://docs.dataframer.ai/api-reference/red-teaming/create-spec.md): Create a new red team specification - [Delete red team run](https://docs.dataframer.ai/api-reference/red-teaming/delete-run.md): Delete a red team run - [Delete red team spec](https://docs.dataframer.ai/api-reference/red-teaming/delete-spec.md): Delete a red team spec - [Get red team run](https://docs.dataframer.ai/api-reference/red-teaming/get-run.md): Get a specific red team run - [Get red team spec](https://docs.dataframer.ai/api-reference/red-teaming/get-spec.md): Get a specific red team spec - [List red team runs](https://docs.dataframer.ai/api-reference/red-teaming/list-runs.md): Get all red team runs for your company - [List red team specs](https://docs.dataframer.ai/api-reference/red-teaming/list-specs.md): Get all red team specs - [Get red team run status](https://docs.dataframer.ai/api-reference/red-teaming/run-status.md): Get the current status of a red team run - [Update red team spec](https://docs.dataframer.ai/api-reference/red-teaming/update-spec.md): Update a red team spec - [Cancel run](https://docs.dataframer.ai/api-reference/runs/cancel.md): Cancel a running or pending generation job - [Delete run](https://docs.dataframer.ai/api-reference/runs/delete.md): Delete a run and its generated files - [Download all generated files](https://docs.dataframer.ai/api-reference/runs/download-all.md): Download all generated files from a run as a single ZIP archive - [Download generated file](https://docs.dataframer.ai/api-reference/runs/download-file.md): Get presigned URL for downloading a generated file - [Start a new generation run](https://docs.dataframer.ai/api-reference/runs/generate.md): Start a new data generation run - [Get run](https://docs.dataframer.ai/api-reference/runs/get.md): Retrieve a specific run by ID - [List runs](https://docs.dataframer.ai/api-reference/runs/list.md): Get all runs for the user's company, ordered by creation time (newest first) - [Create spec](https://docs.dataframer.ai/api-reference/specs/create.md): Generate a new spec (async operation) - [Delete spec](https://docs.dataframer.ai/api-reference/specs/delete.md): Delete a spec by ID - [Get spec](https://docs.dataframer.ai/api-reference/specs/get.md): Retrieve a specific spec by ID - [List specs](https://docs.dataframer.ai/api-reference/specs/list.md): Retrieve information about all existing specs - [Update spec](https://docs.dataframer.ai/api-reference/specs/update.md): Replace a spec with a new version specified using YAML - [Core Concepts](https://docs.dataframer.ai/concepts.md): Understand how DataFramer generates synthetic data - [Data Anonymization](https://docs.dataframer.ai/detection-anonymization-augmentation.md): Detect PII, PHI, financial data, identity documents, and more—then anonymize or augment your datasets using AI models and pattern-based rules - [What is DataFramer?](https://docs.dataframer.ai/index.md): Generate realistic, diverse synthetic datasets at scale — from example data or a text description - [Databricks integration](https://docs.dataframer.ai/integrations/databricks.md): Overview of the DataFramer - Databricks integration. DataFramer is a Validated Databricks Partner. - [Multi-File & Multi-Folder](https://docs.dataframer.ai/integrations/databricks-multi-file.md): Generate high-quality synthetic multi-file and multi-folder datasets within Databricks using DataFramer. - [Single File](https://docs.dataframer.ai/integrations/databricks-single-file.md): Generate high-quality synthetic single-file datasets within Databricks using DataFramer. - [Quickstart](https://docs.dataframer.ai/quickstart.md): Generate your first synthetic dataset in 5 minutes - [Release Notes](https://docs.dataframer.ai/releases/release-notes.md) - [Basic Use of Python SDK](https://docs.dataframer.ai/tutorials/basic-python-workflow.md): End-to-end example: create a spec from a description, generate samples, and download results. - [Financial Bank Statement Extraction with Golden Labels](https://docs.dataframer.ai/tutorials/financial-bank-statement-extraction.md): Generate synthetic financial statement PDFs with co-located Q&A golden labels using a multi-folder seed dataset, then evaluate an LLM's financial data extraction accuracy on the generated documents. - [Fine-Tuning with Synthetic Training Data](https://docs.dataframer.ai/tutorials/finetuning-with-synthetic-data.md): Use DataFramer to generate diverse training data from a handful of examples, then fine-tune an open-source LLM and evaluate style transfer. - [Fraud Detection: Labeled Training Data for Impossible Geographic Jumps](https://docs.dataframer.ai/tutorials/fraud-detection-geo-jump.md): Go from a 10-row unlabeled credit card transaction seed to 100s or 1000s of fraud-labeled training samples by teaching DataFramer a concept that never existed in your data. - [Insurance Underwriting Model Drift Detection](https://docs.dataframer.ai/tutorials/insurance-underwriting-model-drift-detection.md): Simulate a shifted life-insurance applicant population with DataFramer, score it with a frozen underwriting model, and inspect whether the change looks like true risk or calibration drift. - [Folder Generation](https://docs.dataframer.ai/tutorials/multi-folder-workflow.md): Complete guide to working with multi-folder datasets using the DataFramer SDK - [PII/PHI Anonymization](https://docs.dataframer.ai/tutorials/pii-phi-anonymization.md): Detect and mask Personally Identifiable Information and Protected Health Information in your datasets using the DataFramer SDK - [Support Chatbot Broader Evaluations with Contextual Eval Sets](https://docs.dataframer.ai/tutorials/support-chatbot-broader-evals.md): Go from 11 hand-labelled seed rows to a 50-sample (1000s if needed) eval set — complete with golden labels — by using DataFramer to generate targeted edge-case interactions for your support chatbot. - [Complete Generation Guide](https://docs.dataframer.ai/workflow.md): Step-by-step walkthrough of all DataFramer features ## OpenAPI Specs - [openapi.with-code-samples](https://app.stainless.com/api/spec/documented/dataframer/openapi.with-code-samples.json) ## Optional - [Support](mailto:info@dataframer.ai)