mirror of
https://github.com/langgenius/dify.git
synced 2026-05-13 08:57:28 +08:00
The dataset_queries table grows without bound because every RAG retrieval and hit-test inserts a row. This adds a configurable Celery Beat task (clean_dataset_queries_task) that deletes rows older than a retention period (default 60 days) in batches, gated by ENABLE_CLEAN_DATASET_QUERIES_TASK. Retention is clamped to max(config, PLAN_SANDBOX_CLEAN_DAY_SETTING) to avoid breaking clean_unused_datasets_task which reads DatasetQuery.created_at. Also adds a created_at index on dataset_queries via alembic migration to keep the delete scan performant as the table grows. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> |
||
|---|---|---|
| .. | ||
| deploy | ||
| enterprise | ||
| extra | ||
| feature | ||
| middleware | ||
| observability | ||
| packaging | ||
| remote_settings_sources | ||
| __init__.py | ||
| app_config.py | ||