Generate synthetic data samples based on a spec using AI models.
The generation process:
Supports two sample types:
API Key authentication. Format: "Bearer YOUR_API_KEY"
ID of the spec to use for generation
Number of samples to generate
1 <= x <= 20000AI model to use for generation
anthropic/claude-sonnet-4-5, anthropic/claude-sonnet-4-5-thinking, anthropic/claude-haiku-4-5, deepseek-ai/DeepSeek-V3.1, moonshotai/Kimi-K2-Instruct, openai/gpt-oss-120b, deepseek-ai/DeepSeek-R1-0528-tput, Qwen/Qwen2.5-72B-Instruct-Turbo Specific version ID to use (optional, defaults to latest version)
AI model for evaluation (short samples only)
anthropic/claude-sonnet-4-5, anthropic/claude-sonnet-4-5-thinking, anthropic/claude-haiku-4-5, deepseek-ai/DeepSeek-V3.1, moonshotai/Kimi-K2-Instruct, openai/gpt-oss-120b, deepseek-ai/DeepSeek-R1-0528-tput, Qwen/Qwen2.5-72B-Instruct-Turbo AI model for outline generation (long samples only)
anthropic/claude-sonnet-4-5, anthropic/claude-sonnet-4-5-thinking, anthropic/claude-haiku-4-5, deepseek-ai/DeepSeek-V3.1, moonshotai/Kimi-K2-Instruct, openai/gpt-oss-120b, deepseek-ai/DeepSeek-R1-0528-tput, Qwen/Qwen2.5-72B-Instruct-Turbo AI model for revisions (long samples only)
anthropic/claude-sonnet-4-5, anthropic/claude-sonnet-4-5-thinking, anthropic/claude-haiku-4-5, deepseek-ai/DeepSeek-V3.1, moonshotai/Kimi-K2-Instruct, openai/gpt-oss-120b, deepseek-ai/DeepSeek-R1-0528-tput, Qwen/Qwen2.5-72B-Instruct-Turbo Enable revision cycles
Type of samples to generate
short, long Max feedback iterations (short samples only)
0 <= x <= 20Use staged generation approach (short samples only)
Use historical feedback (short samples only)
Number of examples to include in prompt (short samples only)
1 <= x <= 50Max revision cycles (long samples only)
1 <= x <= 5Thinking budget for generation model (tokens)
x >= 1024Thinking budget for evaluation model (tokens, short samples)
x >= 1024Thinking budget for outline model (tokens, long samples)
x >= 1024Thinking budget for revision model (tokens, long samples)
x >= 1024Seed shuffling level for long samples. Controls trade-off between prompt caching efficiency and data diversity.
none, sample, field, prompt SQL validation level for long samples with SQL content
syntax, syntax+schema, syntax+schema+execute Maximum number of seed examples to include in prompts (long samples only). If not set, all seeds are used (subject to token limits).
x >= 1