Commit History
Remove use of eval() from recognizer.py (#4480)
1b54e4d
Mathias Panzenböck
commited on
Remove use of eval() from ocr.py (#4481)
ba2fa9c
Mathias Panzenböck
commited on
Accelerate titles' embeddings. (#4492)
b07ff91
Kevin Hu
commited on
Refactor. (#4487)
83ee116
Kevin Hu
commited on
fix:t_recognizer TypeError: 'super' object is not callable (#4404)
b57c1d6
Zhi-Qiang You
youzhiqiang
Kevin Hu
commited on
Fix t_recognizer issue. (#4387)
957cf28
Kevin Hu
commited on
Fix param error. (#4355)
2820402
Kevin Hu
commited on
Fix t_recognizer.py after model updating. (#4330)
1955ace
Kevin Hu
commited on
Catch the exception while parsing pptx. (#4202)
74b229a
Kevin Hu
commited on
Add Llama3.3 (#4174)
84758ca
Kevin Hu
commited on
fix chunk method "Table" losing content when the Excel file has multi… (#4123)
75faa66
Upgrades Document Layout Analysis model. (#4054)
0129457
Kevin Hu
commited on
Fix json file parse (#4004)
a11ffc5
Add back beartype (#3967)
63cb972
Added static check at PR CI (#3921)
fe9b6b3
Fix errors detected by Ruff (#3918)
0404a52
Fix parsing JSON file error (#3829)
43bceb7
Fix: page_chars attribute does not exist in some formats of PDF (#3796)
3c857ed
Fix out of boundary. (#3786)
1275b47
Kevin Hu
commited on
Edit chunk shall update instead of insert it (#3709)
1b2aab6
Always open text file for write with UTF-8 (#3688)
9a73da2
Added kb_id filter to knn. Fix #3458 (#3513)
aebd986
Introduced beartype (#3460)
eae0334
Move settings initialization after module init phase (#3438)
6101699
Use consistent log file names, introduced initLogger (#3403)
8bc2fc9
fix bug about fetching knowledge graph (#3394)
6a9fa6b
Kevin Hu
commited on
Rework logging (#3358)
22fe41e
Fix:#3230 When parsing a docx file using the Book parsing method, to_page is always -1, resulting in a block count of 0 even if parsing is successful (#3249)
ff43695
kuschzzp
Kevin Hu
commited on
search between multiple indiices for team function (#3079)
95863fc
Kevin Hu
commited on
bigger resolution for OCR (#2919)
7b6220c
Kevin Hu
commited on
fix: torch dependency start error (#2777)
0de98c4
chore: update operators.py (#2724)
b2a5c0f
trival (#2650)
03a3e60
Kevin Hu
commited on
rename get_txt to get_text (#2649)
d624a4e
Kevin Hu
commited on
Add get_txt function (#2639)
82adb1c
yqkcn
Kevin Hu
commited on
Fix: renrank_model and pdf_parser bugs | Update: session API (#2601)
678763e
liuhua
liuhua
commited on
add lighten control (#2567)
dbcbb17
Kevin Hu
commited on
fix parsing spaces in russian language PDFs (#1987) (#2427)
bac5213
optimize text parser (#2144)
13b285d
Kevin Hu
commited on
Format file format from Windows/dos to Unix (#1949)
aeb6dbc
fix parameter error (#1925)
1f75d02
Kevin Hu
commited on
add sql to naive parser (#1908)
ef2a724
Kevin Hu
commited on
add support for eml file parser (#1768)
6ed07a9
Fix docx parser line bug (#1715)
dda4c86
H
Kevin Hu
commited on
fix: When parsing the bold content in PDF, the result is duplicated. (#1729)
971f83c
Fix pdfparser error (#1707)
ea41e57
H
commited on
Fix pdfparser content confusion (#1700)
eec0415
H
commited on
pypdf2 to pypdf (#1684)
10534c3
Kevin Hu
commited on
fix bug of ragflowdocxpparser (#1642)
fe5404c
Kevin Hu
commited on