opensource/dify - dify - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
GareArc	4e3112bd7f	feat(telemetry): add enterprise OTEL telemetry with gateway, traces, metrics, and logs	2026-02-06 01:02:19 -08:00
GareArc	4a9b74f86b	refactor(telemetry): simplify by eliminating TelemetryFacade Problem: The telemetry system had unnecessary abstraction layers and bad practices from the last 3 commits introducing the gateway implementation: - TelemetryFacade class wrapper around emit() function - String literals instead of SignalType enum - Dictionary mapping enum → string instead of enum → enum - Unnecessary ENTERPRISE_TELEMETRY_GATEWAY_ENABLED feature flag - Duplicate guard checks scattered across files - Non-thread-safe TelemetryGateway singleton pattern - Missing guard in ops_trace_task.py causing RuntimeError spam Solution: 1. Deleted TelemetryFacade - replaced with thin emit() function in core/telemetry/__init__.py 2. Added SignalType enum ('trace' \| 'metric_log') to enterprise/telemetry/contracts.py 3. Replaced CASE_TO_TRACE_TASK_NAME dict with CASE_TO_TRACE_TASK: dict[TelemetryCase, TraceTaskName] 4. Deleted is_gateway_enabled() and _emit_legacy() - using existing ENTERPRISE_ENABLED + ENTERPRISE_TELEMETRY_ENABLED instead 5. Extracted _should_drop_ee_only_event() helper to eliminate duplicate checks 6. Moved TelemetryGateway singleton to ext_enterprise_telemetry.py: - Init once in init_app() for thread-safety - Access via get_gateway() function 7. Re-added guard to ops_trace_task.py to prevent RuntimeError when EE=OFF but CE tracing enabled 8. Updated 11 caller files to import 'emit as telemetry_emit' instead of 'TelemetryFacade' Result: - 322 net lines deleted (533 removed, 211 added) - All 91 tests pass - Thread-safe singleton pattern - Cleaner API surface: from TelemetryFacade.emit() to telemetry_emit() - Proper enum usage throughout - No RuntimeError spam in EE=OFF + CE=ON scenario	2026-02-05 22:41:09 -08:00
GareArc	4d47339ce6	feat: Add parent trace context propagation for workflow-as-tool hierarchy Enables distributed tracing for nested workflows across all trace providers (Langfuse, LangSmith, community providers). When a workflow invokes another workflow via workflow-as-tool, the child workflow now includes parent context attributes that allow trace systems to reconstruct the full execution tree. Changes: - Add parent_trace_context field to WorkflowTool - Set parent context in tool node when invoking workflow-as-tool - Extract and pass parent context through app generator This is a community enhancement (ungated) that improves distributed tracing for all users. Parent context includes: trace_id, node_execution_id, workflow_run_id, and app_id.	2026-02-05 20:19:29 -08:00
GareArc	55c0fe503d	fix(telemetry): correct enterprise-only trace filtering logic The logic was inverted - we were blocking all CE traces and only allowing enterprise traces. The correct logic should be: - Allow all CE traces (workflow, message, tool, etc.) - Only block enterprise-only traces when enterprise telemetry is disabled Before: if event.name not in _ENTERPRISE_ONLY_TRACES: return After: if event.name in _ENTERPRISE_ONLY_TRACES and not is_enterprise_telemetry_enabled(): return	2026-02-05 20:15:12 -08:00
GareArc	adadf1ec5f	refactor(telemetry): migrate to type-safe enum-based event routing with centralized enterprise filtering Changes: - Change TelemetryEvent.name from str to TraceTaskName enum for type safety - Remove hardcoded trace_task_name_map from facade (no mapping needed) - Add centralized enterprise-only filter in TelemetryFacade.emit() - Rename is_telemetry_enabled() to is_enterprise_telemetry_enabled() - Update all 11 call sites to pass TraceTaskName enum values - Remove redundant enterprise guard from draft_trace.py - Add unit tests for TelemetryFacade.emit() routing (6 tests) - Add unit tests for TraceQueueManager telemetry guard (5 tests) - Fix test fixture scoping issue for full test suite compatibility - Fix tenant_id handling in agent tool callback handler Benefits: - 100% type-safe: basedpyright catches errors at compile time - No string literals: eliminates entire class of typo bugs - Single point of control: centralized filtering in facade - All guards removed except facade - Zero regressions: 4887 tests passing Verification: - make lint: PASS - make type-check: PASS (0 errors, 0 warnings) - pytest: 4887 passed, 8 skipped	2026-02-05 20:15:12 -08:00
GareArc	ed222945aa	refactor(telemetry): introduce TelemetryFacade to centralize event emission Migrate from direct TraceQueueManager.add_trace_task calls to TelemetryFacade.emit with TelemetryEvent abstraction. This reduces CE code invasion by consolidating telemetry logic in core/telemetry/ with a single guard in ops_trace_manager.py.	2026-02-05 20:15:11 -08:00
GareArc	2d60be311d	fix: extract model_provider from model_config in prompt generation trace The model_provider field in prompt generation traces was being incorrectly extracted by parsing the model name (e.g., 'deepseek-chat'), which resulted in an empty string when the model name didn't contain a '/' character. Now extracts the provider directly from the model_config parameter, with a fallback to the old parsing logic for backward compatibility. Changes: - Update _emit_prompt_generation_trace to accept model_config parameter - Extract provider from model_config.get('provider') when available - Update all 6 call sites to pass model_config - Maintain backward compatibility with fallback logic	2026-02-05 20:15:11 -08:00
GareArc	80ee2e982e	fix(telemetry): prevent UUID validation error for tenant-prefixed storage IDs - get_ops_trace_instance was trying to query App table with storage_id format "tenant-{uuid}" - This caused psycopg2.errors.InvalidTextRepresentation when app_id is None - Added early return for tenant-prefixed storage identifiers to skip App lookup - Enterprise telemetry still works correctly with these storage IDs	2026-02-05 20:15:11 -08:00
GareArc	5bbc938a0d	fix(telemetry): add prompt generation trace emission for no_variable=false path - The no_variable=false code path in generate_rule_config was missing trace emission - Added timing wrapper and _emit_prompt_generation_trace call to ensure metrics/logs are captured - Trace now emitted on both success and failure cases for consistency with no_variable=true path	2026-02-05 20:15:10 -08:00
GareArc	052f50805f	feat(telemetry): add node_execution_id and app_id support to trace metadata - Forward kwargs to message_trace to preserve node_execution_id - Add node_execution_id extraction to all trace methods - Add app_id parameter to prompt generation API endpoints - Enable app_id tracing for rule_generate, code_generate, and structured_output operations	2026-02-05 20:15:10 -08:00
GareArc	f5043a8ac8	fix(telemetry): enable metrics and logs for standalone prompt generation Remove app_id parameter from three endpoints and update trace manager to use tenant_id as storage identifier when app_id is unavailable. This allows standalone prompt generation utilities to emit telemetry. Changes: - controllers/console/app/generator.py: Remove app_id=None from 3 endpoints (RuleGenerateApi, RuleCodeGenerateApi, RuleStructuredOutputGenerateApi) - core/ops/ops_trace_manager.py: Use tenant_id fallback in send_to_celery - Extract tenant_id from task.kwargs when app_id is None - Use 'tenant-{tenant_id}' format as storage identifier - Skip traces only if neither app_id nor tenant_id available The trace metadata still contains the actual tenant_id, so enterprise telemetry correctly emits metrics and logs grouped by tenant.	2026-02-05 20:15:10 -08:00
GareArc	22c8d8d772	feat(telemetry): add prompt generation telemetry to Enterprise OTEL - Add PromptGenerationTraceInfo trace entity with operation_type field - Implement telemetry for rule-generate, code-generate, structured-output, instruction-modify operations - Emit metrics: tokens (total/input/output), duration histogram, requests counter, errors counter - Emit structured logs with model info and operation context - Content redaction controlled by ENTERPRISE_INCLUDE_CONTENT env var - Fix user_id propagation in TraceTask kwargs - Fix latency calculation when llm_result is None No spans exported - metrics and logs only for lightweight observability.	2026-02-05 20:14:49 -08:00
GareArc	8ceb1ed96f	feat(telemetry): add input/output token split to enterprise OTEL traces - Add PROMPT_TOKENS and COMPLETION_TOKENS to WorkflowNodeExecutionMetadataKey - Store prompt/completion tokens in node execution metadata JSON (no schema change) - Calculate workflow-level token split by summing node executions on-the-fly - Export gen_ai.usage.input_tokens and output_tokens to enterprise telemetry - Add semantic convention constants for token attributes - Maintain backward compatibility (historical data shows null) BREAKING: None MIGRATION: None (uses JSON metadata, no schema changes)	2026-02-05 20:12:30 -08:00
GareArc	701f02f853	feat(telemetry): add invoked_by user tracking to enterprise OTEL	2026-02-05 20:12:29 -08:00
GareArc	3461c3a8ef	feat(enterprise): Add OTEL telemetry with slim traces, metrics, and structured logs - Add EnterpriseOtelTrace handler with span emission for workflows and nodes - Implement minimal-span strategy: slim spans + detailed companion logs - Add deterministic span/trace IDs for cross-workflow trace correlation - Add metric collection at 100% accuracy (counters & histograms) - Add event handlers for app lifecycle and feedback telemetry - Add cross-workflow trace linking with parent context propagation - Add OTEL exporter with configurable sampling and privacy controls - Wire enterprise telemetry into workflow execution pipeline - Add telemetry configuration in enterprise configs	2026-02-05 20:12:28 -08:00
Asuka Minato	491fa9923b	refactor: port api/controllers/console/datasets/data_source.py /datasets/metadata.py /service_api/dataset/metadata.py /nodes/agent/agent_node.py api/core/workflow/nodes/datasource/datasource_node.py api/services/dataset_service.py to match case (#31836 )	2026-02-02 21:03:16 +09:00
Asuka Minato	ce2c41bbf5	refactor: port api/controllers/console/datasets/datasets_document.py api/controllers/service_api/app/annotation.py api/core/app/app_config/easy_ui_based_app/agent/manager.py api/core/app/apps/pipeline/pipeline_generator.py api/core/workflow/nodes/knowledge_retrieval/knowledge_retrieval_node.py to match case (#31832 )	2026-02-02 19:07:30 +09:00
FFXN	41177757e6	fix: summary index bug (#31810 ) Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> Co-authored-by: Jyong <76649700+JohnJyong@users.noreply.github.com> Co-authored-by: zxhlyh <jasonapring2015@outlook.com> Co-authored-by: Yansong Zhang <916125788@qq.com> Co-authored-by: hj24 <mambahj24@gmail.com> Co-authored-by: CodingOnStar <hanxujiang@dify.ai> Co-authored-by: CodingOnStar <hanxujiang@dify.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2026-02-02 09:45:17 +08:00
yyh	4f826b4641	refactor(typing): use enum types for workflow status fields (#31792 )	2026-02-02 09:41:34 +08:00
Asuka Minato	7828508b30	refactor: remove all reqparser (#29289 ) Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: Stephen Zhou <38493346+hyoban@users.noreply.github.com>	2026-02-01 13:43:14 +09:00
盐粒 Yanli	b8cb5f5ea2	refactor(typing): Fixup typing A2 - workflow engine & nodes (#31723 ) Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> Co-authored-by: Asuka Minato <i@asukaminato.eu.org>	2026-01-31 18:00:56 +09:00
盐粒 Yanli	5bc99995fc	fix(api): align graph protocols for response streaming (#31777 )	2026-01-31 01:57:36 +09:00
lif	24b280a0ed	fix(i18n): improve Chinese translation of Max Tokens (#31771 ) Signed-off-by: majiayu000 <1835304752@qq.com>	2026-01-30 20:19:35 +08:00
QuantumGhost	90fe9abab7	revert: revert human input relevant code (#31766 ) Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>	2026-01-30 19:18:49 +08:00
QuantumGhost	f90fa2b186	fix(api): fix workflow state persistence issue (#31752 ) Ensure workflow pause configuration is correctly set for all entrypoints.	2026-01-30 17:44:29 +08:00
盐粒 Yanli	5a7dfd15b8	fix: Drain non-stream plugin chunk iterator (#31564 )	2026-01-30 16:54:56 +08:00
Asuka Minato	89abea26f9	refactor: rm some dict api/controllers/console/app/generator.py api/core/llm_generator/llm_generator.py (#31709 ) Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>	2026-01-30 17:37:20 +09:00
Jax	95d68437d1	fix(redis): Redis Cluster eval errors by adding hash tags to trigger debug keys (#31701 )	2026-01-30 16:05:02 +08:00
QuantumGhost	03e3acfc71	feat(api): Human Input Node (backend part) (#31646 ) The backend part of the human in the loop (HITL) feature and relevant architecture / workflow engine changes. Signed-off-by: yihong0618 <zouzou0208@gmail.com> Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> Co-authored-by: -LAN- <laipz8200@outlook.com> Co-authored-by: 盐粒 Yanli <yanli@dify.ai> Co-authored-by: CrabSAMA <40541269+CrabSAMA@users.noreply.github.com> Co-authored-by: Stephen Zhou <38493346+hyoban@users.noreply.github.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: yihong <zouzou0208@gmail.com> Co-authored-by: Joel <iamjoel007@gmail.com>	2026-01-30 10:18:49 +08:00
盐粒 Yanli	5bf0251554	chore(typing): reduce ty excludes for A1 (#31721 ) Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>	2026-01-30 02:38:57 +08:00
Nie Ronghua	ceb6914793	refactor(model): Refactor plugin model schema cache to be process-global to prevent redundant Daemon API calls (#31689 ) Signed-off-by: -LAN- <laipz8200@outlook.com> Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> Co-authored-by: -LAN- <laipz8200@outlook.com>	2026-01-29 14:31:15 +08:00
盐粒 Yanli	dbfc47e8b0	fix: SSRF in WordExtractor URL download (credit to @EaEa0001 ) (#31678 ) Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2026-01-29 14:01:21 +08:00
FFXN	c2473d85dc	feat: Add summary index for knowledge. (#31625 ) Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> Co-authored-by: Jyong <76649700+JohnJyong@users.noreply.github.com> Co-authored-by: zxhlyh <jasonapring2015@outlook.com> Co-authored-by: Yansong Zhang <916125788@qq.com> Co-authored-by: hj24 <mambahj24@gmail.com> Co-authored-by: CodingOnStar <hanxujiang@dify.ai> Co-authored-by: CodingOnStar <hanxujiang@dify.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2026-01-29 13:47:35 +08:00
eux	b48a10d7ec	feat(qdrant): implement full-text search with multi-keyword support (#31658 )	2026-01-29 11:12:18 +08:00
fenglin	91532ef429	fix: add list type support for ToolInput constant value in tool node (#31276 ) Co-authored-by: qiaofenglin <qiaofenglin@baidu.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2026-01-29 10:49:29 +08:00
-LAN-	24ebe2f5c6	refactor(graph_engine): Add a Config class for graph engine. (#31663 ) Signed-off-by: -LAN- <laipz8200@outlook.com>	2026-01-28 19:57:55 +08:00
-LAN-	3d414678e3	fix(graph_engine): Cannot run single iteration or loop node (#31470 ) Signed-off-by: -LAN- <laipz8200@outlook.com> Co-authored-by: Yeuoly <45712896+Yeuoly@users.noreply.github.com>	2026-01-28 01:05:59 +08:00
-LAN-	d76ad15fca	refactor(graph_engine): move observability layer and persistence laye… (#31620 )	2026-01-28 00:54:21 +08:00
heyszt	eca26a9b9b	feat: Enhances OpenTelemetry node parsers (#30706 ) Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>	2026-01-27 15:30:21 +08:00
E.G	f6be9cd90d	refactor: replace request.args.get with Pydantic BaseModel validation (#31104 ) Co-authored-by: GlobalStar117 <GlobalStar117@users.noreply.github.com> Co-authored-by: Asuka Minato <i@asukaminato.eu.org> Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2026-01-27 10:48:42 +08:00
wangxiaolei	e48419937b	feat: chatflow support multimodal (#31293 ) Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>	2026-01-27 00:24:48 +08:00
盐粒 Yanli	92011d0a31	refactor: LLM plugin invoke parsing (#31499 ) Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>	2026-01-26 14:59:57 +08:00
Asuka Minato	b9f1d65d4f	refactor: example of refine dict / Mapping (#31498 )	2026-01-26 10:23:38 +08:00
TomoOkuyama	0772d49257	fix(api): fix IRIS hybrid search returning zero results (#31309 ) Co-authored-by: Tomo Okuyama <tomo.okuyama@intersystems.com>	2026-01-24 10:29:19 +08:00
-LAN-	67eb8c052d	refactor: single-node workflow runner helpers (#31472 )	2026-01-24 10:27:44 +08:00
fenglin	e8f9d64651	fix(tools): fix ToolInvokeMessage Union type parsing issue (#31450 ) Co-authored-by: qiaofenglin <qiaofenglin@baidu.com>	2026-01-24 10:18:06 +08:00
-LAN-	c575c34ca6	refactor: Move workflow node factory to app workflow (#31385 ) Signed-off-by: -LAN- <laipz8200@outlook.com>	2026-01-22 18:08:21 +08:00
wangxiaolei	a112caf5ec	fix: use thread local isolation the context (#31410 )	2026-01-22 18:02:54 +08:00
zejiewang	811e43d0d4	fix: non-auto variable type params of agent node tool are not correctly parsed (#31128 ) Co-authored-by: wangzejie <wangzejie@meicai.cn> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>	2026-01-22 14:43:21 +08:00
wangxiaolei	211c57f7b6	fix: remove _try_resolve_user_from_request (#31360 )	2026-01-21 21:19:11 +08:00

1 2 3 4 5 ...

3137 Commits