|
432 | 432 | - [https://github.com/javpower/JavaVision](https://github.com/javpower/JavaVision) |
433 | 433 | - 数据管道 [https://github.com/orchest](https://github.com/orchest) |
434 | 434 | - 数据科学Web [https://github.com/plotly/dash](https://github.com/plotly/dash) |
435 | | -- 扫描PDF [https://github.com/baicunko/scanyourpdf](https://github.com/baicunko/scanyourpdf) |
436 | | -- [https://github.com/eosphoros-ai/DB-GPT](https://github.com/eosphoros-ai/DB-GPT) |
437 | | -- [https://github.com/oomol-lab/pdf-craft](https://github.com/oomol-lab/pdf-craft) |
438 | | -- PDF翻译 [https://github.com/Byaidu/PDFMathTranslate](https://github.com/Byaidu/PDFMathTranslate) |
439 | | -- [https://github.com/Feather-2/paper-burner-x](https://github.com/Feather-2/paper-burner-x) |
440 | | -- [https://github.com/xunbu/docutranslate](https://github.com/xunbu/docutranslate) |
441 | | -- PDF转Markdown [https://github.com/jorben/markpdfdown](https://github.com/jorben/markpdfdown) |
442 | 435 | - 签名 [https://github.com/SigmaHQ/sigma](https://github.com/SigmaHQ/sigma) |
443 | 436 | - [https://github.com/Alic-yuan/nlp-beginner-finish](https://github.com/Alic-yuan/nlp-beginner-finish) |
444 | 437 | - [https://github.com/heartexlabs/label-studio](https://github.com/heartexlabs/label-studio) |
|
492 | 485 |
|
493 | 486 | + [https://github.com/topics/ocr](https://github.com/topics/ocr) |
494 | 487 | + [https://github.com/topics/ocr-recognition](https://github.com/topics/ocr-recognition) |
| 488 | ++ [https://github.com/topics/scene-text-recognition](https://github.com/topics/scene-text-recognition) |
| 489 | ++ [https://github.com/topics/scene-text-detection](https://github.com/topics/scene-text-detection) |
495 | 490 | + [https://github.com/search?q=ocr](https://github.com/search?q=ocr) |
| 491 | ++ 评估基准 [https://github.com/opendatalab/OmniDocBench](https://github.com/opendatalab/OmniDocBench) |
496 | 492 |
|
497 | 493 |
|
498 | 494 | - `ImportError: libGL.so.1: cannot open shared object file: No such file or directory` |
|
530 | 526 | * [https://sourceforge.net/projects/jocr](https://sourceforge.net/projects/jocr) |
531 | 527 | * [https://github.com/cdli-gh/Cuneiform-OCR](https://github.com/cdli-gh/Cuneiform-OCR) |
532 | 528 | * [https://www.gnu.org/software/ocrad](https://www.gnu.org/software/ocrad) |
| 529 | +* [https://github.com/Topdu/OpenOCR](https://github.com/Topdu/OpenOCR) |
| 530 | + * [https://www.gitpp.com/open-embodied/openocr](https://www.gitpp.com/open-embodied/openocr) |
533 | 531 | * [https://github.com/tesseract-ocr/tesseract](https://github.com/tesseract-ocr/tesseract) |
534 | 532 | * [https://github.com/nguyenq/tess4j](https://github.com/nguyenq/tess4j) |
535 | | - * [https://sourceforge.net/projects/tess4j](https://sourceforge.net/projects/tess4j) |
| 533 | + * [https://sourceforge.net/projects/tess4j](https://sourceforge.net/projects/tess4j) |
536 | 534 | * [https://github.com/manisandro/gImageReader](https://github.com/manisandro/gImageReader) |
537 | 535 | * [https://github.com/datalab-to/surya](https://github.com/datalab-to/surya) |
538 | 536 | * [https://github.com/scribeocr/scribeocr](https://github.com/scribeocr/scribeocr) |
539 | 537 | * [https://github.com/plantree/ocr-pwa](https://github.com/plantree/ocr-pwa) |
540 | 538 | * [https://github.com/ianzhao05/textshot](https://github.com/ianzhao05/textshot) |
541 | 539 | * [https://github.com/mittagessen/kraken](https://github.com/mittagessen/kraken) |
542 | 540 | * [https://github.com/ai-ng/2txt](https://github.com/ai-ng/2txt) |
543 | | -* [https://github.com/facebookresearch/nougat](https://github.com/facebookresearch/nougat) |
544 | 541 | * [https://github.com/Greedysky/TTKOCR](https://github.com/Greedysky/TTKOCR) |
545 | 542 | * [https://github.com/robbyzhaox/myocr](https://github.com/robbyzhaox/myocr) |
546 | 543 | * [https://github.com/Calamari-OCR/calamari](https://github.com/Calamari-OCR/calamari) |
547 | 544 | * [https://github.com/JaidedAI/EasyOCR](https://github.com/JaidedAI/EasyOCR) |
548 | 545 | * [https://github.com/mindee/doctr](https://github.com/mindee/doctr) |
549 | 546 | * [https://github.com/xushengfeng/eSearch](https://github.com/xushengfeng/eSearch) |
550 | 547 | * [https://github.com/Nutlope/llama-ocr](https://github.com/Nutlope/llama-ocr) |
551 | | -* [https://github.com/Ucas-HaoranWei/GOT-OCR2.0](https://github.com/Ucas-HaoranWei/GOT-OCR2.0) |
552 | 548 | * [https://github.com/OpenGVLab/InternVL](https://github.com/OpenGVLab/InternVL) |
553 | 549 | * [https://github.com/allenai/olmocr](https://github.com/allenai/olmocr) |
554 | 550 | * [https://github.com/getomni-ai/zerox](https://github.com/getomni-ai/zerox) |
555 | | -* [https://github.com/ocrmypdf/OCRmyPDF](https://github.com/ocrmypdf/OCRmyPDF) |
556 | 551 | * [https://github.com/vikParuchuri/marker](https://github.com/vikParuchuri/marker) |
557 | 552 | * [https://github.com/open-mmlab/mmocr](https://github.com/open-mmlab/mmocr) |
558 | 553 | * [https://github.com/clovaai/donut](https://github.com/clovaai/donut) |
559 | 554 | * [https://github.com/chineseocr](https://github.com/chineseocr) |
560 | 555 | * [https://github.com/docling-project/docling](https://github.com/docling-project/docling) |
| 556 | + * [https://huggingface.co/ibm-granite/granite-docling-258M](https://huggingface.co/ibm-granite/granite-docling-258M) |
561 | 557 | * [https://github.com/breezedeus/CnOCR](https://github.com/breezedeus/CnOCR) |
| 558 | + * [https://github.com/breezedeus/pix2text](https://github.com/breezedeus/pix2text) |
| 559 | +* [https://github.com/Unstructured-IO/unstructured](https://github.com/Unstructured-IO/unstructured) |
562 | 560 | * [https://github.com/allenai/olmocr](https://github.com/allenai/olmocr) |
563 | 561 | * [https://github.com/xyTom/snippai](https://github.com/xyTom/snippai) |
564 | 562 | * [https://github.com/ouyanghuiyu/chineseocr_lite](https://github.com/ouyanghuiyu/chineseocr_lite) |
|
582 | 580 | * 说图解画 [https://github.com/ShurshanX/AI-Image-Description](https://github.com/ShurshanX/AI-Image-Description) |
583 | 581 |
|
584 | 582 |
|
| 583 | +- 扫描PDF [https://github.com/baicunko/scanyourpdf](https://github.com/baicunko/scanyourpdf) |
| 584 | +- [https://github.com/eosphoros-ai/DB-GPT](https://github.com/eosphoros-ai/DB-GPT) |
| 585 | +- [https://github.com/oomol-lab/pdf-craft](https://github.com/oomol-lab/pdf-craft) |
| 586 | +- PDF翻译 [https://github.com/Byaidu/PDFMathTranslate](https://github.com/Byaidu/PDFMathTranslate) |
| 587 | +- [https://github.com/Feather-2/paper-burner-x](https://github.com/Feather-2/paper-burner-x) |
| 588 | +- [https://github.com/xunbu/docutranslate](https://github.com/xunbu/docutranslate) |
| 589 | +- PDF转Markdown [https://github.com/jorben/markpdfdown](https://github.com/jorben/markpdfdown) |
| 590 | +- PDF转文本 [https://github.com/datalab-to/marker](https://github.com/datalab-to/marker) |
| 591 | +- [https://github.com/ApurveKaranwal/pyOCR](https://github.com/ApurveKaranwal/pyOCR) |
| 592 | +- [https://github.com/facebookresearch/nougat](https://github.com/facebookresearch/nougat) |
| 593 | +- [https://github.com/ocrmypdf/OCRmyPDF](https://github.com/ocrmypdf/OCRmyPDF) |
| 594 | +- [https://github.com/Filimoa/open-parse](https://github.com/Filimoa/open-parse) |
| 595 | + |
| 596 | + |
| 597 | + |
| 598 | +**Vision Language Models(VLM)** |
| 599 | + |
| 600 | +* [https://github.com/rednote-hilab/dots.ocr](https://github.com/rednote-hilab/dots.ocr) |
| 601 | +* [https://github.com/allenai/olmocr](https://github.com/allenai/olmocr) |
| 602 | +* [https://github.com/datalab-to/chandra](https://github.com/datalab-to/chandra) |
| 603 | +* [https://github.com/opendatalab/MinerU](https://github.com/opendatalab/MinerU) |
| 604 | +* [https://github.com/Ucas-HaoranWei/GOT-OCR2.0](https://github.com/Ucas-HaoranWei/GOT-OCR2.0) |
| 605 | +* [https://github.com/EvolvingLMMs-Lab/lmms-eval](https://github.com/EvolvingLMMs-Lab/lmms-eval) |
| 606 | + |
| 607 | + |
585 | 608 |
|
586 | 609 |
|
587 | 610 | ## 通用聊天机器人 |
|
0 commit comments