Start a new data generation run
GET /api/dataframer/runs/{id}/ until status changes from PENDING/IN_PROGRESS to SUCCEEDED or FAILED.API Key authentication. Format: "Bearer YOUR_API_KEY"
Request body for creating a generation run
ID of the spec to use for generation. Spec must be in SUCCEEDED status.
Number of samples to generate
1 <= x <= 20000Version number to use (optional, defaults to latest)
x >= 1Model for generation. Use -thinking suffix to enable thinking mode. For databricks/ models, you must also provide databricks_client_id, databricks_client_secret, and databricks_api_base.
anthropic/claude-opus-4-6, anthropic/claude-opus-4-6-thinking, anthropic/claude-sonnet-4-6, anthropic/claude-sonnet-4-6-thinking, anthropic/claude-haiku-4-5, anthropic/claude-haiku-4-5-thinking, deepseek-ai/DeepSeek-V3.1, moonshotai/Kimi-K2-Instruct, openai/gpt-oss-120b, deepseek-ai/DeepSeek-R1-0528-tput, Qwen/Qwen2.5-72B-Instruct-Turbo, gemini/gemini-3-pro-preview, gemini/gemini-3-pro-preview-thinking, databricks/databricks-claude-3-7-sonnet, databricks/databricks-claude-haiku-4-5, databricks/databricks-claude-opus-4-1, databricks/databricks-claude-opus-4-5, databricks/databricks-claude-opus-4-6, databricks/databricks-claude-sonnet-4, databricks/databricks-claude-sonnet-4-5, databricks/databricks-gemini-2-5-flash, databricks/databricks-gemini-2-5-pro, databricks/databricks-gemini-3-flash, databricks/databricks-gemini-3-pro, databricks/databricks-gpt-5 Model for outline generation
anthropic/claude-opus-4-6, anthropic/claude-opus-4-6-thinking, anthropic/claude-sonnet-4-6, anthropic/claude-sonnet-4-6-thinking, anthropic/claude-haiku-4-5, anthropic/claude-haiku-4-5-thinking, deepseek-ai/DeepSeek-V3.1, moonshotai/Kimi-K2-Instruct, openai/gpt-oss-120b, deepseek-ai/DeepSeek-R1-0528-tput, Qwen/Qwen2.5-72B-Instruct-Turbo, gemini/gemini-3-pro-preview, gemini/gemini-3-pro-preview-thinking, databricks/databricks-claude-3-7-sonnet, databricks/databricks-claude-haiku-4-5, databricks/databricks-claude-opus-4-1, databricks/databricks-claude-opus-4-5, databricks/databricks-claude-opus-4-6, databricks/databricks-claude-sonnet-4, databricks/databricks-claude-sonnet-4-5, databricks/databricks-gemini-2-5-flash, databricks/databricks-gemini-2-5-pro, databricks/databricks-gemini-3-flash, databricks/databricks-gemini-3-pro, databricks/databricks-gpt-5 Model for revisions and filtering (only used if revision_types or filtering_types is set)
anthropic/claude-opus-4-6, anthropic/claude-opus-4-6-thinking, anthropic/claude-sonnet-4-6, anthropic/claude-sonnet-4-6-thinking, anthropic/claude-haiku-4-5, anthropic/claude-haiku-4-5-thinking, deepseek-ai/DeepSeek-V3.1, moonshotai/Kimi-K2-Instruct, openai/gpt-oss-120b, deepseek-ai/DeepSeek-R1-0528-tput, Qwen/Qwen2.5-72B-Instruct-Turbo, gemini/gemini-3-pro-preview, gemini/gemini-3-pro-preview-thinking, databricks/databricks-claude-3-7-sonnet, databricks/databricks-claude-haiku-4-5, databricks/databricks-claude-opus-4-1, databricks/databricks-claude-opus-4-5, databricks/databricks-claude-opus-4-6, databricks/databricks-claude-sonnet-4, databricks/databricks-claude-sonnet-4-5, databricks/databricks-gemini-2-5-flash, databricks/databricks-gemini-2-5-pro, databricks/databricks-gemini-3-flash, databricks/databricks-gemini-3-pro, databricks/databricks-gpt-5 List of revision types to apply. Valid values: 'coherence_flow' (fix formatting/flow issues), 'consistency' (fix internal contradictions), 'distinguishability' (seeded only — blend with seed style), 'conformance' (verify property compliance). Defaults to empty (no revisions).
coherence_flow, consistency, distinguishability, conformance List of filtering quality gates. Valid values: 'structural' (reject severe format issues), 'conformance' (reject property violations). Documents that fail are regenerated. Defaults to empty (no filtering).
structural, conformance Maximum number of revision cycles. 2-3 is a solid pick for complex documents requiring internal consistency, e.g. financial reports, invoices, etc.; increase to 3-5 for highest quality or when generated data has issues.
1 <= x <= 5(advanced) How to shuffle seed examples between samples
none, sample, field, prompt (advanced) Maximum number of seed examples to include in prompts. By default, only as many seeds as fit in 10K tokens are used. Use this to override the default.
x >= 1(advanced) Use unified multifield generation. This helps to reduce the generation cost by processing all fields together rather than one by one.
Token budget for extended thinking during generation. Only applies to models with -thinking suffix.
x >= 1024Token budget for extended thinking during outline generation. Only applies to models with -thinking suffix.
x >= 1024Token budget for extended thinking during revisions. Only applies to models with -thinking suffix.
x >= 1024Databricks service principal application (client) ID. Required when using databricks/models.
Databricks service principal secret. Required when using databricks/models.
Databricks Model Serving endpoint URL (e.g. https://adb-xxx.azuredatabricks.net/serving-endpoints). Required when using databricks/models.
Skip outline and part-by-part generation. Generates a document draft directly in a single call. Faster and cheaper but not suitable for very long documents.
List of tools available to the LLM during generation. Currently supported: 'calculator' (sandboxed Python for numerical verification). Roughly doubles cost and time. Defaults to empty.
Use revision_types and filtering_types instead. When set to true without those fields, enables all revision types and structural filtering.
Run created and submitted
Run ID for polling status