Troubleshooting

Common Issues

Authentication Errors

401 Unauthorized - Authentication credentials not provided

Cause: Missing or incorrect Authorization headerSolution:

# Correct format
curl -H "Authorization: Bearer YOUR_API_KEY" ...

# NOT these:
# Authorization: YOUR_API_KEY
# Bearer YOUR_API_KEY

Verify your API key is correct and not expired.

403 Forbidden - Invalid API key

Cause: API key is invalid, revoked, or expiredSolution:

Log in to https://app.aimon.ai
Go to Account → API Key
Generate a new API key
Update your application with the new key

429 Too Many Requests - Rate limit exceeded

Cause: Exceeded rate limits (100 requests/minute)Solution:

Implement exponential backoff
Add delays between requests
Cache responses when possible
Contact support for higher limits

import time
from requests.adapters import HTTPAdapter
from urllib3.util.retry import Retry

retry_strategy = Retry(
    total=3,
    status_forcelist=[429, 500, 502, 503, 504],
    backoff_factor=2
)

Dataset Issues

File upload fails

Possible causes:

File too large (>100 MB)
File corrupted
Incorrect encoding (not UTF-8)
Unsupported file format

Solutions:

Check file size: ls -lh yourfile.csv
Verify file opens correctly
Convert to UTF-8: iconv -f ISO-8859-1 -t UTF-8 input.csv > output.csv
Ensure correct file extension

ZIP file rejected for MULTI_FOLDER

Cause: Incorrect folder structure in ZIPSolution:

Folders must be at ZIP root, not nested
No empty folders
No files at root level

# Correct way to create ZIP
cd your_data_folder
zip -r ../data.zip .

# Verify structure
unzip -l ../data.zip

Wrong dataset type chosen

Problem: Created SINGLE_FILE but need MULTI_FILESolution:

Dataset type cannot be changed after creation
Delete the dataset
Create new dataset with correct type
Re-upload files

Analysis & Specification Issues

Analysis takes too long

Normal duration: 2-10 minutesIf longer than 15 minutes:

Check status endpoint for errors
Verify dataset files are not corrupt
Try with smaller dataset first
Contact support if persistent

Analysis fails with error

Common causes:

Dataset files corrupted or unreadable
Unsupported file format
Files not UTF-8 encoded
Dataset too small (<3 samples)

Solution:

Check error message in status response
Verify all files are valid
Ensure at least 3-5 samples in dataset
Test with known-good data first

Generated spec doesn't capture requirements

Problem: Specification misses important patternsSolution:

Manually edit the specification
Add specific requirements explicitly
Provide more diverse seed data
Add examples of edge cases

YAML syntax error when updating spec

Problem: Invalid YAML when updating specificationSolution:

Validate YAML: https://www.yamllint.com/
Use spaces for indentation, not tabs
Escape special characters
Use | for multiline strings:

requirements: |
  - First requirement
  - Second requirement
  - Third requirement

Generation Issues

Generation stuck or very slow

Normal times:

Short samples: 5-30 seconds each
Long samples: 2-10 minutes each

If much slower:

Check status for progress updates
High system load may slow processing
Large batches take longer
Contact support if no progress for 30+ minutes

Generation fails immediately

Common causes:

Specification not in READY status
Wrong sample type for dataset
Invalid parameters
Model timeout

Solution:

Verify spec status is READY
Use long samples for structured data (CSV, JSON)
Check error message in status response
Try with smaller batch (5-10 samples)

Short samples not supported error

Cause: Short samples incompatible with dataset typeSolution:

Use sample_type: "long" instead
Short samples don’t work for:
- Single-file CSV/JSON/JSONL datasets
- Complex structured data

Poor sample quality

Problem: Samples don’t match requirementsSolutions:

Review and clarify specification requirements
Add more specific constraints
Provide better quality seed data
Try different model (e.g., claude-sonnet vs haiku)
Adjust temperature (0.5-0.8 for balance)

Samples too similar / not enough variation

Problem: Generated samples lack diversitySolutions:

Increase temperature (try 0.8-0.9)
Add more variation axes to specification
Ensure variation values are distinct
Generate larger batch for better distribution
Provide more diverse seed data

Evaluation Issues

Low conformance rate (<70%)

Cause: Samples don’t meet requirementsSolutions:

Review failed samples to identify patterns
Clarify vague requirements in spec
Simplify complex or contradictory requirements
Add format examples to specification
Try different model

Unbalanced classification distribution

Cause: Some variation values rarely appearSolutions:

Clarify variation axis descriptions
Make all variation values equally specific
Reduce number of variation axes
Generate larger batch
Regenerate with adjusted spec

Evaluation takes too long

Normal time: 5-10 minutes for 100 samplesIf longer:

Wait up to 15 minutes
Check evaluation status periodically
Large batches take proportionally longer
Contact support if stuck >30 minutes

Local Development Issues

Docker containers won't start

Causes:

Docker not running
Port conflicts
Insufficient resources

Solutions:

# Check Docker is running
docker info

# Check for port conflicts
lsof -i :3001
lsof -i :8000

# View container logs
docker compose logs service-name

# Rebuild containers
docker compose build --no-cache
docker compose up

LocalStack not responding

Problem: SQS/S3 services unavailableSolutions:

# Restart LocalStack
docker compose restart localstack

# Check LocalStack health
curl http://localhost:4566/_localstack/health

# Verify queues exist
aws --endpoint-url=http://localhost:4566 sqs list-queues

# Recreate queues if needed
cd dataframer-in-box
docker exec sqs-setup /bin/sh /sqs-setup.sh

Database migration errors

Problem: Django migrations failSolutions:

# Access backend container
docker compose exec ui-backend bash

# Run migrations manually
python manage.py makemigrations
python manage.py migrate

# If persistent, clean volumes
./dataframer_manager.sh
# Select: "Clean data volumes and restart"

Service can't connect to database/redis

Cause: Network or configuration issuesSolutions:

Verify all services are running: docker compose ps
Check service logs: docker compose logs postgres redis
Verify /etc/hosts configuration
Restart affected services: docker compose restart service-name

Error Messages Reference

Common Error Codes

Error	Status	Meaning	Solution
Authentication failed	401	Invalid API key	Check API key format and validity
Permission denied	403	Insufficient permissions	Verify API key has access
Resource not found	404	Dataset/spec doesn’t exist	Check resource ID is correct
Invalid parameters	400	Bad request data	Review API documentation
Rate limit exceeded	429	Too many requests	Implement backoff/retry logic
Server error	500	Internal error	Retry or contact support
Timeout	504	Request took too long	Reduce batch size or retry

Getting Help

Before Contacting Support

Gather this information:

Error message (complete error text)
Request details (endpoint, parameters)
Resource IDs (dataset_id, spec_id, task_id)
Timestamp of when error occurred
Steps to reproduce the issue

Debugging Tips

Enable verbose logging:

import logging
logging.basicConfig(level=logging.DEBUG)

Check API response:

response = requests.post(url, headers=headers, json=data)
print(f"Status: {response.status_code}")
print(f"Response: {response.text}")

Verify environment:

# Check API connectivity
curl -I https://df-api.dataframer.ai/health

# Test authentication
curl -H "Authorization: Bearer $API_KEY" \
  https://df-api.dataframer.ai/api/users/me/

FAQ

How long does generation take?

Short samples: 5-30 seconds each
Long samples: 2-10 minutes each
Total time depends on batch size and available workers
Large batches (100+) may take several hours

What's the maximum dataset size?

Maximum file size: 100 MB per file
Maximum samples: 1000 per dataset
Recommended: 10-50 samples for best results
Minimum: 3-5 samples for meaningful analysis

Can I cancel a running generation?

Currently, generation cannot be canceled once started. The task will run to completion or failure.

How do I improve sample quality?

Clarify specification requirements
Provide higher quality seed data
Use appropriate sample type (short vs long)
Adjust temperature (0.7 is default)
Try different models
Iterate: generate small batches, review, refine

What models are supported?

anthropic/claude-sonnet-4-5 (default, recommended)
anthropic/claude-haiku-4-5 (faster, lower cost)
gemini/gemini-2.5-pro (complex reasoning)
openai/gpt-4.1 (alternative high quality)

Are generated samples copyright-free?

Generated samples are owned by you. However, they’re derived from your seed data and LLM outputs, so consider:

Your seed data licensing
Your use case and jurisdiction
Consult legal counsel for specific questions

Still Need Help?

Contact Support

Email [email protected] with your issue details

Contact Support

UI Docs

API Core Concepts

API Tutorials

Troubleshooting

Common Issues

Authentication Errors

Dataset Issues

Analysis & Specification Issues

Generation Issues

Evaluation Issues

Local Development Issues

Error Messages Reference

Common Error Codes

Getting Help

Before Contacting Support

Debugging Tips

FAQ

Still Need Help?

UI Docs

API Core Concepts

API Tutorials

Troubleshooting

​Common Issues

​Authentication Errors

​Dataset Issues

​Analysis & Specification Issues

​Generation Issues

​Evaluation Issues

​Local Development Issues

​Error Messages Reference

​Common Error Codes

​Getting Help

​Before Contacting Support

​Debugging Tips

​FAQ

​Still Need Help?

Contact Support

Common Issues

Authentication Errors

Dataset Issues

Analysis & Specification Issues

Generation Issues

Evaluation Issues

Local Development Issues

Error Messages Reference

Common Error Codes

Getting Help

Before Contacting Support

Debugging Tips

FAQ

Still Need Help?