Frederick2313072
626e71cb3b
feat: implement content-based deduplication for document segments
...
- Add database index on (dataset_id, index_node_hash) for efficient deduplication queries
- Add deduplication check in SegmentService.create_segment and multi_create_segment methods
- Add deduplication check in DatasetDocumentStore.add_documents method to prevent duplicate embedding processing
- Skip creating segments with identical content hashes across the entire dataset
This prevents duplicate content from being re-processed and re-embedded when uploading documents with repeated content, improving efficiency and reducing unnecessary compute costs.
2025-09-20 06:28:14 +08:00
Xiyuan Chen
9f138ef246
Refactor WorkflowService to handle missing default credentials gracef… ( #25960 )
2025-09-19 00:45:35 -07:00
Yongtao Huang
b3dafd913b
Chore: correct inconsistent logging and typo ( #25945 )
2025-09-19 10:36:16 +08:00
-LAN-
a173546c8d
Fix: replace stdout prints with debug logging ( #25931 )
...
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2025-09-18 21:03:20 +08:00
-LAN-
4ba1292455
refactor: replace print statements with proper logging ( #25773 )
2025-09-18 20:35:47 +08:00
Maries
bb01c31f30
fix(api): enhance data handling in RagPipelineDslService to filter credentials ( #25926 )
2025-09-18 18:36:49 +08:00
quicksand
680eb7a9f6
fix(datasets): retrieval_model null issue when updating dataset info ( #25907 )
2025-09-18 17:58:06 +08:00
crazywoola
878420463c
fix: Message => str ( #25876 )
2025-09-18 17:57:57 +08:00
Novice
8f86f5749d
chore: Update the value of sys.dialogue_count to start from 1. ( #25905 )
2025-09-18 15:52:52 +08:00
Novice
21230a8eb2
fix: handle None description in MCP tool transformation ( #25872 )
...
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
2025-09-18 13:11:38 +08:00
-LAN-
85cda47c70
feat: knowledge pipeline ( #25360 )
...
Signed-off-by: -LAN- <laipz8200@outlook.com>
Co-authored-by: twwu <twwu@dify.ai>
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
Co-authored-by: jyong <718720800@qq.com>
Co-authored-by: Wu Tianwei <30284043+WTW0313@users.noreply.github.com>
Co-authored-by: QuantumGhost <obelisk.reg+git@gmail.com>
Co-authored-by: lyzno1 <yuanyouhuilyz@gmail.com>
Co-authored-by: quicksand <quicksandzn@gmail.com>
Co-authored-by: Jyong <76649700+JohnJyong@users.noreply.github.com>
Co-authored-by: lyzno1 <92089059+lyzno1@users.noreply.github.com>
Co-authored-by: zxhlyh <jasonapring2015@outlook.com>
Co-authored-by: Yongtao Huang <yongtaoh2022@gmail.com>
Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
Co-authored-by: Joel <iamjoel007@gmail.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: nite-knite <nkCoding@gmail.com>
Co-authored-by: Hanqing Zhao <sherry9277@gmail.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Harry <xh001x@hotmail.com>
2025-09-18 12:49:10 +08:00
zyssyz123
7dadb33003
fix: remove billing cache when add or delete app or member ( #25885 )
2025-09-18 12:18:07 +08:00
Xiyuan Chen
8635aacb46
Enhance LLM model configuration validation to include active status c… ( #25759 )
...
Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
2025-09-15 23:15:53 -07:00
quicksand
bdbe078630
fix(mcp): prevent masked headers from overwriting real values ( #25722 )
2025-09-15 19:24:12 +08:00
Garfield Dai
88d5e27fe8
Release/e-1.8.1 ( #25613 )
...
Co-authored-by: zxhlyh <jasonapring2015@outlook.com>
Co-authored-by: GareArc <chen4851@purdue.edu>
Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
Co-authored-by: hjlarry <hjlarry@163.com>
2025-09-15 14:49:23 +08:00
-LAN-
bab4975809
chore: add ast-grep rule to convert Optional[T] to T | None ( #25560 )
...
Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
2025-09-15 13:06:33 +08:00
Krito.
a13d7987e0
chore: adopt StrEnum and auto() for some string-typed enums ( #25129 )
...
Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
2025-09-12 21:14:26 +08:00
kenwoodjw
c91253d05d
fix segment deletion race condition ( #24408 )
...
Signed-off-by: kenwoodjw <blackxin55+@gmail.com>
Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
2025-09-12 15:29:57 +08:00
zyssyz123
c2fcd2895b
Feat/email register refactor ( #25369 )
...
Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
Co-authored-by: Joel <iamjoel007@gmail.com>
2025-09-12 10:24:54 +08:00
Yongtao Huang
4511f4f537
Remove redundant parse_args call in WorkflowByIdApi.patch ( #25498 )
2025-09-12 09:40:41 +08:00
Yongtao Huang
a1322ddb5d
Fix: correct has_more pagination logic in get_conversational_variable ( #25484 )
...
Signed-off-by: Yongtao Huang<yongtaoh2022@gmail.com>
2025-09-12 09:32:22 +08:00
Eric Guo
70e4d6be34
Fix 500 in dataset page. ( #25474 )
2025-09-10 15:57:04 +08:00
Asuka Minato
cbc0e639e4
update sql in batch ( #24801 )
...
Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
Co-authored-by: -LAN- <laipz8200@outlook.com>
2025-09-10 13:00:17 +08:00
Newton José
6574e9f0b2
Fix: Add Password Validation to Account Creation ( #25382 )
2025-09-10 08:58:39 +08:00
-LAN-
08dd3f7b50
Fix basedpyright type errors ( #25435 )
...
Signed-off-by: -LAN- <laipz8200@outlook.com>
Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
2025-09-10 01:54:26 +08:00
Yongtao Huang
2ac7a9c8fc
Chore: thanks to bump-pydantic ( #25437 )
2025-09-09 20:07:17 +08:00
Asuka Minato
38057b1b0e
add typing to all wraps ( #25405 )
...
Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
2025-09-09 16:48:33 +08:00
ttz12345
d2e50a508c
Fix:About the error problem of creating an empty knowledge base interface in service_api ( #25398 )
...
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
2025-09-09 15:18:31 +08:00
Xiyuan Chen
64c9a2f678
Feat/credential policy ( #25151 )
...
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
2025-09-08 23:45:05 -07:00
zyssyz123
ea61420441
Revert "feat: email register refactor" ( #25367 )
2025-09-08 19:20:09 +08:00
kenwoodjw
598ec07c91
feat: enable dsl export encrypt dataset id or not ( #25102 )
...
Signed-off-by: kenwoodjw <blackxin55+@gmail.com>
2025-09-08 18:03:24 +08:00
zyssyz123
860ee20c71
feat: email register refactor ( #25344 )
...
Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
2025-09-08 17:51:43 +08:00
Cluas
f891c67eca
feat: add MCP server headers support #22718 ( #24760 )
...
Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
Co-authored-by: Novice <novice12185727@gmail.com>
2025-09-08 14:10:55 +08:00
Asuka Minato
16a3e21410
more assert ( #24996 )
...
Signed-off-by: -LAN- <laipz8200@outlook.com>
Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
Co-authored-by: -LAN- <laipz8200@outlook.com>
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
2025-09-08 09:59:43 +08:00
-LAN-
9b8a03b53b
[Chore/Refactor] Improve type annotations in models module ( #25281 )
...
Signed-off-by: -LAN- <laipz8200@outlook.com>
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
2025-09-08 09:42:27 +08:00
Asuka Minato
a78339a040
remove bare list, dict, Sequence, None, Any ( #25058 )
...
Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
Co-authored-by: -LAN- <laipz8200@outlook.com>
2025-09-06 03:32:23 +08:00
-LAN-
a2e0f80c01
[Chore/Refactor] Improve type checking configuration ( #25185 )
...
Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
2025-09-05 08:34:18 +08:00
Will
de768af099
fix: reset password ( #25172 )
2025-09-04 20:34:56 +08:00
Will
d36ce782b7
fix: update account profile ( #25150 )
2025-09-04 18:32:51 +08:00
Tonlo
d5aaee614f
fix recommended apps reading from db logic ( #25071 )
2025-09-04 11:14:37 +08:00
-LAN-
53c4a8787f
[Chore/Refactor] Improve type safety and resolve type checking issues ( #25104 )
2025-09-04 09:35:32 +08:00
非法操作
b673560b92
feat: improve multi model credentials ( #25009 )
...
Co-authored-by: Claude <noreply@anthropic.com>
2025-09-03 13:52:31 +08:00
-LAN-
9d5956cef8
[Chore/Refactor] Switch from MyPy to Basedpyright for type checking ( #25047 )
...
Signed-off-by: -LAN- <laipz8200@outlook.com>
2025-09-03 11:52:26 +08:00
Will
b5216df4fe
fix: xxx is not bound to a Session ( #24966 )
2025-09-02 21:37:06 +08:00
GuanMu
25a11bfafc
Export DSL from history ( #24939 )
...
Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
2025-09-02 21:36:52 +08:00
Bowen Liang
7b379e2a61
chore: apply ty checks on api code with script and ci action ( #24653 )
2025-09-02 16:05:13 +08:00
ZalterCitty
bd482eb8ef
fix wrong filter handle for saved messages ( #24891 )
...
Co-authored-by: zhuqingchao <zhuqingchao@xiaomi.com>
2025-09-01 16:32:08 +08:00
Frederick2313072
5b3cc560d5
fix:hard-coded top-k fallback issue. ( #24879 )
2025-09-01 15:46:37 +08:00
willzhao
ffba341258
[CHORE]: remove redundant-cast ( #24807 )
2025-09-01 14:05:32 +08:00
Asuka Minato
24e2b72b71
Update ast-grep pattern for session.query ( #24828 )
...
Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
2025-08-31 17:03:51 +08:00