supervised-finetuning

Here are 36 public repositories matching this topic...

InternLM / xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

agent chatbot conversational-ai peft baichuan msagent large-language-models llm supervised-finetuning llava llm-training chatglm2 internlm llama2 qwen chatglm3 mixtral llama3 phi3

Updated Dec 27, 2024
Python

InternLM / InternLM-XComposer

Star

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

foundation gpt language-model multimodal multi-modality vision-transformer gpt-4 visual-language-learning llm chatgpt instruction-tuning large-language-model supervised-finetuning mllm vision-language-model large-vision-language-model

Updated Dec 26, 2024
Python

Tebmer / Awesome-Knowledge-Distillation-of-LLMs

Star

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.

compression feedback survey alignment self-training multi-modal knowledge-distillation data-augmentation kd data-synthesis self-distillation instruction-following llm large-language-model supervised-finetuning

Updated Oct 22, 2024

GaryYufei / AlignLLMHumanSurvey

Star

Aligning Large Language Models with Human: A Survey

awesome survey llama gpt-4 large-language-models llms chatgpt rlhf supervised-finetuning llama2 chinese-llama

Updated Sep 11, 2023

magpie-align / magpie

Star

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

nlp paper dataset alignment gemma synthetic-data synthetic-dataset-generation llm supervised-finetuning llama2 qwen2 llama3 phi3

Updated Jan 9, 2025
Python

chaoswork / sft_datasets

Star

开源SFT数据集整理,随时补充

datasets chinese-dataset large-language-models llms supervised-finetuning

Updated Jun 2, 2023

LIN-SHANG / InstructERC

Star

The offical realization of InstructERC

unified-data-processing emotion-recognition-in-conversation large-language-models supervised-finetuning chatglm-6b llama-7b chatglm2-6b llama2-7b

Updated Dec 18, 2024
Python

sail-sg / sdft

Star

[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".

language-model self-distillation supervised-finetuning

Updated Nov 2, 2024
Shell

yyDing1 / ScaleQuest

Star

We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.

nlp mathematics dataset mistral synthetic-data llms supervised-finetuning qwen2 deepseek-math llama3

Updated Oct 27, 2024
Python

NVlabs / catk

Star

Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models

autonomous-driving imitation-learning traffic-simulation waymo-open-dataset supervised-finetuning next-token-prediction

Updated Dec 10, 2024
Python

ZhengxiangShi / InstructionModelling

Star

[NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With Loss Over Instructions"

natural-language-processing language-model instruction-tuning supervised-finetuning

Updated May 24, 2024
Python

inst-it / inst-it

Star

Official repository of "Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning"

multimodal instruction-tuning supervised-finetuning multimodal-large-language-models large-multimodal-models

Updated Dec 15, 2024

fanqiwan / KCA

Star

EMNLP'2024: Knowledge Verification to Nip Hallucination in the Bud

machine-learning hallucination large-language-models supervised-finetuning

Updated Mar 10, 2024
Python

BUAADreamer / MLLM-Finetuning-Demo

Star

使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory

transformers lora pretraining huggingface-datasets supervised-finetuning mllm llava finetune-llm llama-factory paligemma yi-vl

Updated Sep 8, 2024
Python

quanshr / AugCon

Star

[AAAI 2025]Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity

synthetic-data large-language-model supervised-finetuning

Updated Aug 17, 2024
Python

bhattbhavesh91 / google-gemma-finetuning-n2sql

Sponsor

Star

Finetuning Google's Gemma Model for Translating Natural Language into SQL

google lora gemma natural-language-to-sql fine-tuning finetuning supervised-finetuning finetuning-llms

Updated Feb 22, 2024
Jupyter Notebook

liziniu / GEM

Star

Code for Paper (Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity)

diversity generalization distribution-matching large-language-models supervised-finetuning

Updated Oct 23, 2024
Python

BUAADreamer / Qwen2-VL-History

Star

Qwen2-VL在文旅领域的LLaMA-Factory微调案例 The case for fine-tuning Qwen2-VL in the field of historical literature and museums

beauty museum history supervised-finetuning mllm multimodal-large-language-models llama-factory qwen2-vl

Updated Sep 17, 2024

sovit-123 / lm_sft

Star

Various LMs/LLMs below 3B parameters (for now) trained using SFT (Supervised Fine Tuning) for several downstream tasks

gpt bert gemma gpt2 large-language-models llms supervised-finetuning

Updated May 16, 2024
Jupyter Notebook

KwokHing / AI-Planet-LLM-Bootcamp-Challenge

Star

An LLM challenge to (i) fine-tune pre-trained HuggingFace transformer model to build a Code Generation language model, and (ii) build a retrieval-augmented generation (RAG) application using LangChain

language-model sentence-embeddings fine-tuning transformer-models llm langchain supervised-finetuning qlora embeddings-model retrieval-augmented-generation mistral-7b ocra-mini-3b

Updated Nov 23, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the supervised-finetuning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the supervised-finetuning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

supervised-finetuning

Here are 36 public repositories matching this topic...

InternLM / xtuner

InternLM / InternLM-XComposer

Tebmer / Awesome-Knowledge-Distillation-of-LLMs

GaryYufei / AlignLLMHumanSurvey

magpie-align / magpie

chaoswork / sft_datasets

LIN-SHANG / InstructERC

sail-sg / sdft

yyDing1 / ScaleQuest

NVlabs / catk

ZhengxiangShi / InstructionModelling

inst-it / inst-it

fanqiwan / KCA

BUAADreamer / MLLM-Finetuning-Demo

quanshr / AugCon

bhattbhavesh91 / google-gemma-finetuning-n2sql

liziniu / GEM

BUAADreamer / Qwen2-VL-History

sovit-123 / lm_sft

KwokHing / AI-Planet-LLM-Bootcamp-Challenge

Improve this page

Add this topic to your repo