Skip to content
Change the repository type filter

All

    Repositories list

    • Repository containing the open source code of works published at the FBK MT unit.
      Python
      44700Updated Jul 8, 2025Jul 8, 2025
    • Repository containing the open source code of the subtitler developed by the FBK MT unit.
      Python
      0200Updated Jun 11, 2025Jun 11, 2025
    • This repository contains the code associated with the Interspeech 2025 paper "Echoes of Phonetics: Unveiling Relevant Acoustic Cues for ASR via Feature Attribution".
      Jupyter Notebook
      1200Updated May 28, 2025May 28, 2025
    • This repository contains the code associated with the ACL2025 paper "Different Speech Translation Models Encode and Translate Speaker Gender Differently".
      Python
      0100Updated May 28, 2025May 28, 2025
    • fbk-llm

      Public
      This repository contains all the code for the works of the FBK MT Unit on foundation models and LLMs.
      Python
      0710Updated May 7, 2025May 7, 2025
    • Python
      1300Updated Apr 17, 2025Apr 17, 2025
    • This repository contains the software to run the anonymization service developed within the CEF Data MarketPlace project.
      Dockerfile
      4110Updated Nov 25, 2024Nov 25, 2024
    • mosel

      Public
      Collection of Open Source Speech Data
      615900Updated Nov 8, 2024Nov 8, 2024
    • Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSIVE textual corpus.
      Python
      22200Updated Aug 28, 2024Aug 28, 2024
    • subsonar

      Public
      Evaluate the quality of SRT files using the multilingual multimodal SONAR model.
      Python
      01410Updated May 18, 2024May 18, 2024
    • pangolinn

      Public
      As a Pangolin looks for bugs and catches them, the goal of this library is ot help developers finding bugs in their neural networks and newly-created models.
      Python
      01310Updated May 18, 2024May 18, 2024
    • smarterp

      Public
      Python
      0000Updated Feb 3, 2023Feb 3, 2023
    • TMOP

      Public
      Translation Memory Open-source Purifier
      Python
      103420Updated Nov 6, 2022Nov 6, 2022
    • This repository contains the software to run the TM cleaning service developed within the CEF Data MarketPlace project.
      Dockerfile
      1200Updated Aug 9, 2021Aug 9, 2021
    • Dockerfile
      0000Updated Jun 30, 2021Jun 30, 2021
    • WIT3

      Public
      store some small data for WIT3 website
      0000Updated Jan 12, 2021Jan 12, 2021
    • This repository contains the software to run the TM clustering service developed within the CEF Data MarketPlace project.
      Dockerfile
      1000Updated Dec 20, 2020Dec 20, 2020
    • modernmt

      Public
      Neural Adaptive Machine Translation that adapts to context and learns from corrections.
      Java
      73000Updated Feb 25, 2020Feb 25, 2020
    • An open-source tool for automatic speech recognition ASR quality estimation.
      Python
      92340Updated Dec 12, 2019Dec 12, 2019
    • azure

      Public
      Shell
      0000Updated Oct 3, 2017Oct 3, 2017
    • NeuralMT

      Public
      FBK Neural Machine Translation Toolkit with instance-based adaptation
      Python
      2110Updated Aug 11, 2017Aug 11, 2017
    • Adaptive MT server
      Python
      0000Updated Mar 11, 2017Mar 11, 2017
    • nematus

      Public
      Python
      5510Updated Jul 21, 2016Jul 21, 2016
    • Computation using data flow graphs for scalable machine learning
      C++
      75k000Updated Feb 1, 2016Feb 1, 2016
    • CPMT

      Public
      Context Prediction Modelling Tool
      C++
      1000Updated Oct 14, 2015Oct 14, 2015
    • CSWA

      Public
      Continuous Space Word Alignment Model
      C++
      0100Updated Sep 3, 2015Sep 3, 2015
    • DTAMT

      Public
      Dynamic Topic Adaptation in Machine Translation
      Perl
      0100Updated May 19, 2015May 19, 2015