2023年6月13日 13:15:06go评论87阅读模式

英文:

LMM Fine Tuning - Supervised Fine Tuning Trainer (SFTTrainer) vs transformers Trainer

问题

何时应选择监督微调训练器（SFTTrainer）而不是常规的Transformers训练器，当涉及到对语言模型（LLMs）进行指令微调时？根据我的了解，常规的Transformers训练器通常指的是无监督微调，通常用于在进行监督微调后执行输入输出模式格式化等任务。似乎有各种各样的微调任务具有类似的特性，但有些使用SFTTrainer，而其他使用常规训练器。在选择这两种方法之间应考虑哪些因素？

我正在寻找使用huggingface和trl库微调LLM以生成json到json转换（匹配json中的文本）。

英文:

When should one opt for the Supervised Fine Tuning Trainer (SFTTrainer) instead of the regular Transformers Trainer when it comes to instruction fine-tuning for Language Models (LLMs)? From what I gather, the regular Transformers Trainer typically refers to unsupervised fine-tuning, often utilized for tasks such as Input-Output schema formatting after conducting supervised fine-tuning. There seem to be various examples of fine-tuning tasks with similar characteristics, but with some employing the SFTTrainer and others using the regular Trainer. Which factors should be considered in choosing between the two approaches?

I looking for Fine Tuning a LLM for generating json to json transformation (matching texts in json) using huggingface and trl libraries.

答案1

得分: 0

same as Trainer but accepts a peft config so it can run lora fine-tuning.

英文:

same as Trainer but accepts a peft config so it can run lora fine-tuning.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

LMM Fine Tuning – Supervised Fine Tuning Trainer (SFTTrainer) vs transformers Trainer

问题

答案1

Lora fine-tuning taking too long

Hugging Face transformer – 对象不可调用

如何在我的数据上运行Hugging Face的预训练模型？

Hugging Face Transformer：模型 bio_ClinicalBERT 没有针对任何任务进行训练吗？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论