Share class names with currency Reason The currency in document not next to share name Solution If can't get relevant text from PDF page contents, and the last word of share class name belongs to currency, remove currency from share class name, then try again. After implementing this solution, recall is from 95% to 96% Can't find relevant text from current PDF page text Reason Hence apply try to merge previous page text into current page, perhaps the text is from previous page text. Solution Try to get previous page and search relevant value. After implementing this solution, recall is from 96% to 98%. |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| biz_utils.py | ||
| gpt_utils.py | ||
| logger.py | ||
| pdf_download.py | ||
| pdf_util.py | ||
| s3_util.py | ||
| similarity.py | ||
| sql_query_util.py | ||
| sys_util.py | ||