dify/extractor at 00935fe52619e53970ab6cf483eaa67662b15a3b - dify

mirror of https://github.com/langgenius/dify.git synced 2026-03-11 03:29:44 +08:00

History

盐粒 Yanli dbfc47e8b0 fix: SSRF in WordExtractor URL download (credit to @EaEa0001 ) (#31678 ) Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>		2026-01-29 14:01:21 +08:00
..
blob
entity	Feat/update notion preview (#29345 )	2025-12-16 16:43:45 +08:00
firecrawl	fix: fix firecrawl url concat (#30008 )	2025-12-24 09:40:32 +08:00
unstructured	refactor: use dynamic max characters for chunking in extractors (#26782 )	2025-10-13 10:22:59 +08:00
watercrawl
csv_extractor.py
excel_extractor.py	perf(core/rag): optimize Excel extractor performance and memory usage (#29551 )	2025-12-12 12:15:03 +08:00
extract_processor.py	feat: support image extraction in PDF RAG extractor (#30399 )	2025-12-31 15:49:06 +08:00
extractor_base.py
helpers.py	fix: detect_file_encodings TypeError: tuple indices must be integers or slices, not str (#29595 )	2025-12-17 13:58:05 +08:00
html_extractor.py
jina_reader_extractor.py
markdown_extractor.py
notion_extractor.py	fix: handle missing `credential_id` (#30051 )	2025-12-24 11:21:51 +08:00
pdf_extractor.py	feat: support image extraction in PDF RAG extractor (#30399 )	2025-12-31 15:49:06 +08:00
text_extractor.py
word_extractor.py	fix: SSRF in WordExtractor URL download (credit to @EaEa0001 ) (#31678 )	2026-01-29 14:01:21 +08:00