Titan AI LogoTitan AI

ms-swift

10,790
935
Python

Project Description

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Phi4, ...) (AAAI 2025).

ms-swift: Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM

Project Title

ms-swift — Scalable and Lightweight Infrastructure for Fine-Tuning Large Language Models

Overview

ms-swift is an official framework provided by the ModelScope community designed for fine-tuning and deploying large language models (LLMs) and multi-modal large models. It supports training, inference, evaluation, quantization, and deployment for over 500 LLMs and 200 multi-modal large models, incorporating the latest training technologies and lightweight techniques.

Key Features

  • Supports training and deployment of 500+ LLMs and 200+ multi-modal models
  • Incorporates advanced training technologies like LoRA, QLoRA, and human alignment methods like DPO, GRPO
  • Accelerates inference, evaluation, and deployment with vLLM, SGLang, and LMDeploy
  • Offers model quantization with GPTQ, AWQ, and BNB technologies

Use Cases

  • Researchers and developers fine-tuning and deploying large language models for various applications
  • Enterprises leveraging multi-modal models for tasks like image and text processing
  • Academia using pre-trained models for research and development in natural language processing

Advantages

  • Supports a wide range of model types and dataset types for comprehensive training and deployment
  • Compatible with various hardware, including CPUs, GPUs, and NPU
  • Provides a Gradio-based Web UI for easy interaction and a wealth of best practices

Limitations / Considerations

  • The project's license is currently unknown, which may affect its use in commercial applications
  • As with any framework, there may be a learning curve for new users to effectively utilize all features

Similar / Related Projects

  • Hugging Face Transformers: A library of pre-trained models for Natural Language Processing, differing in its focus on a wide array of NLP tasks and model types.
  • TensorFlow: A powerful open-source software library for machine learning, differing in its broader scope beyond just LLMs and multi-modal models.
  • PyTorch: An open-source machine learning library for Python, known for its flexibility and dynamic computation graph, differing in its general-purpose nature.

Basic Information


📊 Project Information

  • Project Name: ms-swift
  • GitHub URL: https://github.com/modelscope/ms-swift
  • Programming Language: Python
  • ⭐ Stars: 10,076
  • 🍴 Forks: 885
  • 📅 Created: 2023-08-01
  • 🔄 Last Updated: 2025-09-25

🏷️ Project Topics

Topics: [, ", d, e, e, p, s, e, e, k, -, r, 1, ", ,, , ", e, m, b, e, d, d, i, n, g, ", ,, , ", g, r, p, o, ", ,, , ", i, n, t, e, r, n, v, l, ", ,, , ", l, i, g, e, r, ", ,, , ", l, l, a, m, a, ", ,, , ", l, l, a, m, a, 4, ", ,, , ", l, l, m, ", ,, , ", l, o, r, a, ", ,, , ", m, e, g, a, t, r, o, n, ", ,, , ", m, o, e, ", ,, , ", m, u, l, t, i, m, o, d, a, l, ", ,, , ", o, p, e, n, -, r, 1, ", ,, , ", p, e, f, t, ", ,, , ", q, w, e, n, 3, ", ,, , ", q, w, e, n, 3, -, n, e, x, t, ", ,, , ", q, w, e, n, 3, -, o, m, n, i, ", ,, , ", q, w, e, n, 3, -, v, l, ", ,, , ", r, e, r, a, n, k, e, r, ", ,, , ", s, f, t, ", ]



This article is automatically generated by AI based on GitHub project information and README content analysis

Titan AI Explorehttps://www.titanaiexplore.com/projects/ms-swift-673412924en-USTechnology

Project Information

Created on 8/1/2023
Updated on 10/31/2025