If you have any questions about the library, please feel free to contact us. Email: [email protected]
A comprehensive list of papers about '[Knowledge Fusion: The Integration of Model Capabilities.]'.
As the comprehensive capabilities of foundational large models rapidly improve, similar general abilities have emerged across different models, making capability transfer and fusion between them more feasible. Knowledge fusion aims to integrate existing LLMs of diverse architectures and capabilities into a more powerful model through efficient methods such as knowledge distillation, model merging, mixture of experts, and PEFT, thereby reducing the need for costly LLM development and adaptation. We provide a comprehensive overview of model merging methods and theories, covering their applications across various fields and scenarios, including LLMs, MLLMs, image generation, model compression, continual learning, and more. Finally, we highlight the challenges of knowledge fusion and explore future research directions.
- Awesome-Knowledge-Fuse
| Paper Title | Pub & Date | Key Word |
|---|---|---|
| Fine-Tuning Linear Layers Only Is a Simple yet Effective Way for Task Arithmetic | 2024 | Arxiv |
| Tangent Transformers for Composition,Privacy and Removal | 2024 | ICLR |
| Parameter Efficient Multi-task Model Fusion with Partial Linearization | 2024 | ICLR |
| Task Arithmetic in the Tangent Space: Improved Editing of Pre-Trained Models | 2023 | NeurIPS |
| Paper Title | Year | Conference/Journal |
|---|---|---|
| Composing parameter-efficient modules with arithmetic operation | 2023 | NeurIPS |
| Editing models with task arithmetic | 2023 | ICLR |
| Model fusion via optimal transport | 2020 | NeurIPS |
| Weight averaging for neural networks and local resampling schemes | 1996 | AAAI Workshop |
Star History
We invite all researchers to contribute to this repository, 'Knowledge Fusion: The Integration of Model Capabilities'. If you have any questions about the library, please feel free to contact us.
Email: [email protected]
