2023年5月11日 19:39:44go评论137阅读模式

英文:

"The model 'MPTForCausalLM' is not supported for text-generation"- The following warning is coming when trying to use MPT-7B instruct

问题

I am using a VM of GCP(e2-highmem-4 (Efficient Instance, 4 vCPUs, 32 GB RAM)) to load the model and use it. Here is the code I have written-

import torch
from transformers import pipeline
from transformers import AutoTokenizer, AutoModelForSequenceClassification
import transformers
config = transformers.AutoConfig.from_pretrained(
  'mosaicml/mpt-7b-instruct',
  trust_remote_code=True,
)
# config.attn_config['attn_impl'] = 'flash'

model = transformers.AutoModelForCausalLM.from_pretrained(
  'mosaicml/mpt-7b-instruct',
  config=config,
  torch_dtype=torch.bfloat16,
  trust_remote_code=True,
  cache_dir="./cache"
)
from transformers import AutoTokenizer
tokenizer = AutoTokenizer.from_pretrained("EleutherAI/gpt-neox-20b", cache_dir="./cache")
text_gen = pipeline("text-generation", model=model, tokenizer=tokenizer)
text_gen(text_inputs="what is 2+2?")

Now the code is taking way too long to generate the text. Am I doing something wrong? or is there any way to make things faster?
Also, when creating the pipeline, I am getting the following warning-

The model 'MPTForCausalLM' is not supported for text-generation

I tried generating text by using it but it was stuck for a long time.

英文:

I am using a VM of GCP(e2-highmem-4 (Efficient Instance, 4 vCPUs, 32 GB RAM)) to load the model and use it. Here is the code I have written-

import torch
from transformers import pipeline
from transformers import AutoTokenizer, AutoModelForSequenceClassification
import transformers
config = transformers.AutoConfig.from_pretrained(
  &#39;mosaicml/mpt-7b-instruct&#39;,
  trust_remote_code=True,
)
# config.attn_config[&#39;attn_impl&#39;] = &#39;flash&#39;

model = transformers.AutoModelForCausalLM.from_pretrained(
  &#39;mosaicml/mpt-7b-instruct&#39;,
  config=config,
  torch_dtype=torch.bfloat16,
  trust_remote_code=True,
  cache_dir=&quot;./cache&quot;
)
from transformers import AutoTokenizer
tokenizer = AutoTokenizer.from_pretrained(&quot;EleutherAI/gpt-neox-20b&quot;, cache_dir=&quot;./cache&quot;)
text_gen = pipeline(&quot;text-generation&quot;, model=model, tokenizer=tokenizer)
text_gen(text_inputs=&quot;what is 2+2?&quot;)

Now the code is taking way too long to generate the text. Am I doing something wrong? or is there any way to make things faster?
Also, when creating the pipeline, I am getting the following warning-\

The model 'MPTForCausalLM' is not supported for text-generation

I tried generating text by using it but it was stuck for a long time.

答案1

得分: 1

你可以尝试使用GPU实例，因为尝试在CPU上使用像这样的大型LLMS几乎是没有希望的。

不管怎样，我也遇到了“模型 'MPTForCausalLM' 不支持文本生成”的问题，这就是我来到这个帖子的原因。尽管有警告，文本生成对我来说确实有效。

英文:

You might want to try a GPU instances, because trying to use bigger LLMS like this with CPUs is pretty much a lost cause now.

Anyhow, I also got that "The model 'MPTForCausalLM' is not supported for text-generation" Which is why I ended up in this thread. Text Generation did work for me despite the warning.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

"The model 'MPTForCausalLM' is not supported for text-generation"- The following warning is coming when trying to use MPT-7B instruct

问题

答案1

你可以在哪里找到spacy.py文件以重命名。

Deep Learning training slower on Google Cloud VM than Local PC.

为什么我的函数陷入无限循环？

Not able to modify and configure instance variables from within my init method but still in same class. Python Tkinter

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论