Skip to content
Change the repository type filter

All

    Repositories list

    • docling

      Public
      Get your documents ready for gen AI
      Python
      MIT License
      2915.7k345Updated Nov 6, 2024Nov 6, 2024
    • Simple package to extract text with coordinates from programmatic PDFs
      C++
      MIT License
      72111Updated Nov 5, 2024Nov 5, 2024
    • A python library to define and validate data types in Docling.
      Python
      MIT License
      52422Updated Nov 1, 2024Nov 1, 2024
    • Python
      MIT License
      63342Updated Nov 1, 2024Nov 1, 2024
    • Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.
      C++
      MIT License
      32011Updated Oct 23, 2024Oct 23, 2024
    • Interact with the Deep Search platform for new knowledge explorations and discoveries
      Python
      MIT License
      19131811Updated Oct 17, 2024Oct 17, 2024
    • Running Docling as an API service
      Makefile
      MIT License
      31201Updated Oct 11, 2024Oct 11, 2024
    • MolGrapher: Graph-based Visual Recognition of Chemical Structures
      Python
      MIT License
      14600Updated Oct 9, 2024Oct 9, 2024
    • CSS
      MIT License
      1700Updated Oct 8, 2024Oct 8, 2024
    • ci-tester

      Public
      0100Updated Sep 20, 2024Sep 20, 2024
    • quackling

      Public archive
      Build document-native LLM applications
      Python
      MIT License
      14900Updated Sep 11, 2024Sep 11, 2024
    • Mognet is a fast, simple framework to build distributed applications using task queues.
      Python
      MIT License
      2901Updated Aug 7, 2024Aug 7, 2024
    • Examples using the Deep Search functionalities
      Python
      MIT License
      144104Updated Aug 7, 2024Aug 7, 2024
    • PatCID

      Public
      Python
      MIT License
      02320Updated Aug 2, 2024Aug 2, 2024
    • Python
      MIT License
      0500Updated Jul 8, 2024Jul 8, 2024
    • Python
      MIT License
      0600Updated Jul 8, 2024Jul 8, 2024
    • SemTabNet

      Public
      Repository for ACL paper: "Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIs"
      Python
      MIT License
      0400Updated Jul 1, 2024Jul 1, 2024
    • .github

      Public
      0100Updated Jun 24, 2024Jun 24, 2024
    • MolGrapher: Graph-based Visual Recognition of Chemical Structures
      Python
      MIT License
      0500Updated Mar 25, 2024Mar 25, 2024
    • Repository to detect scientific software in documents for Chan Zuckerberg Initiative workshop
      Python
      MIT License
      0200Updated Oct 26, 2023Oct 26, 2023
    • langchain

      Public
      ⚡ Building applications with LLMs through composability ⚡
      Python
      MIT License
      15k100Updated May 18, 2023May 18, 2023
    • Website of the ICDAR 2023 DocLayNet competition
      1100Updated Apr 26, 2023Apr 26, 2023
    • DocLayNet

      Public
      DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
      Other
      1526530Updated Feb 1, 2023Feb 1, 2023
    • Example NLP Annotator API used for integrating with the IBM DeepSearch CPS platform
      Python
      Apache License 2.0
      31000Updated Sep 8, 2022Sep 8, 2022