dify/api/core/rag/extractor
李龙飞 8dc5d98174 Fix: Correctly handle merged cells in DOCX tables to prevent content duplication and loss (#27871)
Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
Co-authored-by: crazywoola <100913391+crazywoola@users.noreply.github.com>
2025-11-14 14:43:40 +08:00
..
blob
entity
firecrawl
unstructured
watercrawl
csv_extractor.py
excel_extractor.py
extract_processor.py
extractor_base.py
helpers.py
html_extractor.py
jina_reader_extractor.py
markdown_extractor.py
notion_extractor.py
pdf_extractor.py
text_extractor.py
word_extractor.py