Go to file
Blade He 276ff93a1d Optimize drilldown algorithm
Share class names with currency
Reason
The currency in document not next to share name
Solution
If can't get relevant text from PDF page contents, and the last word of share class name belongs to currency, remove currency from share class name, then try again.
After implementing this solution, recall is from 95% to 96%
Can't find relevant text from current PDF page text
Reason
Hence apply try to merge previous page text into current page, perhaps the text is from previous page text.
Solution
Try to get previous page and search relevant value.
After implementing this solution, recall is from 96% to 98%.
2024-11-26 16:35:07 -06:00
configuration dynamic loading instructions for multilingual. 2024-11-20 17:00:22 -06:00
core dynamic loading instructions for multilingual. 2024-11-20 17:00:22 -06:00
instructions dynamic loading instructions for multilingual. 2024-11-20 17:00:22 -06:00
utils Optimize drilldown algorithm 2024-11-26 16:35:07 -06:00
yml Create EMEA AR API code file. 2024-11-26 11:24:29 -06:00
.gitignore Create EMEA AR API code file. 2024-11-26 11:24:29 -06:00
app_emea_ar.py Create EMEA AR API code file. 2024-11-26 11:24:29 -06:00
drilldown_practice.py 1. optimize drilldown algorithm 2024-11-25 15:11:03 -06:00
main.py Optimize drilldown algorithm 2024-11-26 16:35:07 -06:00
pdf_table_extraction.py auto-fix json data format 2024-08-19 17:59:32 -05:00
playground.ipynb update general prompts 2024-08-29 17:05:58 -05:00
playground.py 1. Text-and-image all in one chat function by ChatGPT4o 2024-08-26 17:17:39 -05:00
prepare_data.py update for more statistics methods 2024-11-06 16:39:42 -06:00
requirements.txt update python libraries 2024-11-25 11:11:02 -06:00