Commit Graph

8 Commits

Author SHA1 Message Date
Blade He d25bae936c Optimize investment mapping algorithm. 2024-09-26 12:18:37 -05:00
Blade He 0f14bf4a7a 1. get document/ provider mapping data
2. optimize metrics algorithm
3. Expand max token length since switch ChatGPT4o to 2024-08-06 version.
2024-09-23 17:21:02 -05:00
Blade He 98e86a6cfd realize to calculate data extraction metrics. 2024-09-18 17:10:54 -05:00
Blade He 878383a72c support extract the continuous page(s) for not missing next page data which without table header. 2024-09-06 16:29:35 -05:00
Blade He 6519dc23d4 support filter pages by data point keywords 2024-08-23 16:38:11 -05:00
Blade He 993664cf78 a lot of functions to prepare data. 2024-08-22 10:37:56 -05:00
Blade He f91e0cf1a8 auto-fix json data format 2024-08-19 17:59:32 -05:00
Blade He fa46b45ad5 support output tables as markdown format from pdf documents 2024-08-19 15:49:45 -05:00