Skip to content
Change the repository type filter

All

    Repositories list

    • AI-SIM

      Public
      0000Updated Jul 25, 2025Jul 25, 2025
    • Jupyter Notebook
      0000Updated Jul 23, 2025Jul 23, 2025
    • A project of automatically extracting migration records from Finnish church books.
      0100Updated Jul 15, 2025Jul 15, 2025
    • Handwritten text recognition annotations
      Python
      0000Updated Jul 14, 2025Jul 14, 2025
    • Turku NLP list of publications
      TeX
      3000Updated Jul 8, 2025Jul 8, 2025
    • Jupyter Notebook
      0000Updated Jun 27, 2025Jun 27, 2025
    • A Jekyll version of the "Editorial" theme by HTML5 UP.
      JavaScript
      156300Updated Jun 25, 2025Jun 25, 2025
    • Handwritten text recognition pipeline for table data
      Jupyter Notebook
      0000Updated Jun 18, 2025Jun 18, 2025
    • TCBLex

      Public
      Jupyter Notebook
      0000Updated Jun 16, 2025Jun 16, 2025
    • collab on elephant health score prediction
      Python
      0000Updated Jun 13, 2025Jun 13, 2025
    • Python
      0000Updated Jun 9, 2025Jun 9, 2025
    • Code for FinerWeb-10BT – tools for cleaning web data line by line using LLMs
      Python
      2000Updated Jun 4, 2025Jun 4, 2025
    • Jupyter Notebook
      0000Updated Jun 4, 2025Jun 4, 2025
    • Jupyter Notebook
      0000Updated Jun 3, 2025Jun 3, 2025
    • Python
      1400Updated May 30, 2025May 30, 2025
    • Python
      1100Updated May 15, 2025May 15, 2025
    • Introduction to Natural Language Processing
      Jupyter Notebook
      26500Updated May 14, 2025May 14, 2025
    • 0300Updated May 9, 2025May 9, 2025
    • Code for the paper "Analyzing register variation in web texts through automatic segmentation"
      Python
      0000Updated May 2, 2025May 2, 2025
    • HTML
      0460Updated Apr 30, 2025Apr 30, 2025
    • 0000Updated Apr 2, 2025Apr 2, 2025
    • Jupyter Notebook
      0400Updated Mar 8, 2025Mar 8, 2025
    • 0600Updated Mar 5, 2025Mar 5, 2025
    • 0000Updated Jan 31, 2025Jan 31, 2025
    • 0000Updated Jan 30, 2025Jan 30, 2025
    • FinCORE

      Public
      Finnish Corpus of Online REgisters
      Python
      0200Updated Jan 29, 2025Jan 29, 2025
    • Stuff for the Text Mining course
      Jupyter Notebook
      92800Updated Jan 28, 2025Jan 28, 2025
    • Code for the large LUMI run of ECCO ocr correction
      Python
      0000Updated Jan 16, 2025Jan 16, 2025
    • Clusters with keywords grouped based on their word embeddings
      0100Updated Jan 14, 2025Jan 14, 2025
    • Code to try out ocr postcorrection with language models
      Jupyter Notebook
      0210Updated Dec 16, 2024Dec 16, 2024