mistral-finetune

open-sourceagent-frameworks
3.1k
Stars
+-8
Stars/month
0
Releases (6m)

Star Growth

3.0k3.1k3.1kMar 27Apr 1

Overview

mistral-finetune 是一个轻量级代码库,专为Mistral模型的内存高效微调而设计。基于LoRA(Low-Rank Adaptation)训练范式,该工具冻结大部分模型权重,仅训练1-2%的额外权重(以低秩矩阵扰动形式)。这种方法显著降低了微调的内存需求和计算成本,同时保持了良好的性能。代码库针对多GPU单节点训练设置进行了优化,支持从7B到123B等不同规模的Mistral模型。最新版本已支持Mistral Large v2和Mistral Nemo,为用户提供了从小型到大型模型的完整微调解决方案。该工具特别适合资源有限但需要定制化大语言模型的研究者和开发者。

Deep Analysis

Key Differentiator

vs torchtune / Axolotl / generic LoRA: official Mistral fine-tuning codebase optimized specifically for Mistral model architectures — supports the full Mistral family from 7B to 123B with strict data validation

Capabilities

  • Memory-efficient LoRA fine-tuning of Mistral models
  • Support for 7B, 8x7B, 8x22B, 12B (Nemo), and 123B (Large v2) models
  • Multi-GPU single-node training optimization
  • Instruct and pretrain data format support
  • Function calling fine-tuning with tool definitions
  • Strict data formatting validation
  • Weight-controlled training per conversation turn

🔗 Integrations

Mistral modelsPyTorchTekken tokenizerColab notebooks

Best For

  • Teams fine-tuning Mistral models for domain-specific tasks
  • Production LoRA training with official Mistral tooling
  • Function calling and instruction-following fine-tuning

Not Ideal For

  • Fine-tuning non-Mistral models (use torchtune instead)
  • Users without access to high-end GPUs
  • Full parameter fine-tuning requirements

Languages

Python

Deployment

pip install -r requirements.txtA100/H100 GPU recommended

Known Limitations

  • Optimized for A100/H100 GPUs — limited on consumer hardware
  • Mistral models only (not general-purpose fine-tuning)
  • Strict data format requirements (JSONL with specific schemas)
  • Large models require significant VRAM (123B needs multi-GPU)
  • LoRA only — no full fine-tuning support

Pros

  • + 内存效率极高,使用LoRA技术仅需训练1-2%的参数,大幅降低硬件要求
  • + 支持完整的Mistral模型系列,从7B到123B,覆盖不同应用场景
  • + 针对多GPU训练优化,在A100/H100等高端GPU上性能卓越

Cons

  • - 相对固化的实现方案,在数据格式等方面比较固执己见,灵活性有限
  • - 对于某些模型(如Mistral Nemo)存在内存峰值需求高的问题
  • - 主要专注于Mistral模型系列,不支持其他架构的模型

Use Cases

  • 为特定领域任务微调Mistral模型,如金融、医疗或法律文本处理
  • 在资源受限环境下对大型语言模型进行定制化训练
  • 研究机构或企业内部对Mistral模型进行针对性优化和部署

Getting Started

1. 克隆仓库并安装依赖:git clone https://github.com/mistralai/mistral-finetune.git && pip install -r requirements.txt 2. 下载目标Mistral模型权重文件到本地目录 3. 配置训练参数并运行微调脚本开始训练

Compare mistral-finetune