2023年4月11日 01:46:58go评论157阅读模式

英文:

using llama_index with mac m1

问题

Question #1:

你是否有办法在搭载 M1 CPU 的 Mac 上使用 llama_index？

我无法通过下面的断言：

AssertionError                            Traceback (most recent call last)
...
AssertionError: Torch not compiled with CUDA enabled

显然，我没有 Nvidia 显卡，但我已经了解到 Pytorch 现在也支持 Mac M1。

我尝试运行以下示例：

from llama_index import SimpleDirectoryReader, LangchainEmbedding, GPTListIndex, GPTSimpleVectorIndex, PromptHelper
from langchain.embeddings.huggingface import HuggingFaceEmbeddings
from llama_index import LLMPredictor, ServiceContext
import torch
from langchain.llms.base import LLM
from transformers import pipeline

class customLLM(LLM):
    model_name = "google/flan-t5-large"
    pipeline = pipeline("text2text-generation", model=model_name, device=0, model_kwargs={"torch_dtype": torch.bfloat16})

    def _call(self, prompt, stop=None):
        return self.pipeline(prompt, max_length=9999)[0]["generated_text"]

    def _identifying_params(self):
        return {"name_of_model": self.model_name}

    def _llm_type(self):
        return "custom"

llm_predictor = LLMPredictor(llm=customLLM())

Question #2:

假设上面的答案是否定的 - 我不介意在 Google Colab 上使用 GPU，但一旦索引建立完成，是否可以下载它并在我的 Mac 上使用？

例如：

在 Google Colab 上：

service_context = ServiceContext.from_defaults(llm_predictor=llm_predictor, embed_model=embed_model)
index = GPTSimpleVectorIndex.from_documents(documents, service_context=service_context)
index.save_to_disk('index.json')

... 然后在我的 Mac 上使用 load_from_file。

英文:

Question #1:

Is there a way of using Mac with M1 CPU and llama_index together?

I cannot pass the bellow assertion:

AssertionError                            Traceback (most recent call last)
&lt;ipython-input-1-f2d62b66882b&gt; in &lt;module&gt;
      6 from transformers import pipeline
      7 
----&gt; 8 class customLLM(LLM):
      9     model_name = &quot;google/flan-t5-large&quot;
     10     pipeline = pipeline(&quot;text2text-generation&quot;, model=model_name, device=0, model_kwargs={&quot;torch_dtype&quot;:torch.bfloat16})

&lt;ipython-input-1-f2d62b66882b&gt; in customLLM()
      8 class customLLM(LLM):
      9     model_name = &quot;google/flan-t5-large&quot;
---&gt; 10     pipeline = pipeline(&quot;text2text-generation&quot;, model=model_name, device=0, model_kwargs={&quot;torch_dtype&quot;:torch.bfloat16})
     11 
     12     def _call(self, prompt, stop=None):

~/Library/Python/3.9/lib/python/site-packages/transformers/pipelines/__init__.py in pipeline(task, model, config, tokenizer, feature_extractor, framework, revision, use_fast, use_auth_token, device, device_map, torch_dtype, trust_remote_code, model_kwargs, pipeline_class, **kwargs)
    868         kwargs[&quot;device&quot;] = device
    869 
--&gt; 870     return pipeline_class(model=model, framework=framework, task=task, **kwargs)

~/Library/Python/3.9/lib/python/site-packages/transformers/pipelines/text2text_generation.py in __init__(self, *args, **kwargs)
     63 
     64     def __init__(self, *args, **kwargs):
---&gt; 65         super().__init__(*args, **kwargs)
     66 
     67         self.check_model_type(

~/Library/Python/3.9/lib/python/site-packages/transformers/pipelines/base.py in __init__(self, model, tokenizer, feature_extractor, modelcard, framework, task, args_parser, device, binary_output, **kwargs)
    776         # Special handling
    777         if self.framework == &quot;pt&quot; and self.device.type != &quot;cpu&quot;:
--&gt; 778             self.model = self.model.to(self.device)
    779 
    780         # Update config with task specific parameters

~/Library/Python/3.9/lib/python/site-packages/transformers/modeling_utils.py in to(self, *args, **kwargs)
   1680             )
   1681         else:
-&gt; 1682             return super().to(*args, **kwargs)
   1683 
   1684     def half(self, *args):

~/Library/Python/3.9/lib/python/site-packages/torch/nn/modules/module.py in to(self, *args, **kwargs)
   1143             return t.to(device, dtype if t.is_floating_point() or t.is_complex() else None, non_blocking)
   1144 
-&gt; 1145         return self._apply(convert)
   1146 
   1147     def register_full_backward_pre_hook(

~/Library/Python/3.9/lib/python/site-packages/torch/nn/modules/module.py in _apply(self, fn)
    795     def _apply(self, fn):
    796         for module in self.children():
--&gt; 797             module._apply(fn)
    798 
    799         def compute_should_use_set_data(tensor, tensor_applied):

~/Library/Python/3.9/lib/python/site-packages/torch/nn/modules/module.py in _apply(self, fn)
    818             # `with torch.no_grad():`
    819             with torch.no_grad():
--&gt; 820                 param_applied = fn(param)
    821             should_use_set_data = compute_should_use_set_data(param, param_applied)
    822             if should_use_set_data:

~/Library/Python/3.9/lib/python/site-packages/torch/nn/modules/module.py in convert(t)
   1141                 return t.to(device, dtype if t.is_floating_point() or t.is_complex() else None,
   1142                             non_blocking, memory_format=convert_to_format)
-&gt; 1143             return t.to(device, dtype if t.is_floating_point() or t.is_complex() else None, non_blocking)
   1144 
   1145         return self._apply(convert)

~/Library/Python/3.9/lib/python/site-packages/torch/cuda/__init__.py in _lazy_init()
    237                 &quot;multiprocessing, you must use the &#39;spawn&#39; start method&quot;)
    238         if not hasattr(torch._C, &#39;_cuda_getDeviceCount&#39;):
--&gt; 239             raise AssertionError(&quot;Torch not compiled with CUDA enabled&quot;)
    240         if _cudart is None:
    241             raise AssertionError(

AssertionError: Torch not compiled with CUDA enabled

Obviously I've no Nvidia card, but I've read Pytorch is now supporting Mac M1 as well

I'm trying to run the below example:

from llama_index import SimpleDirectoryReader, LangchainEmbedding, GPTListIndex,GPTSimpleVectorIndex, PromptHelper
from langchain.embeddings.huggingface import HuggingFaceEmbeddings
from llama_index import LLMPredictor, ServiceContext
import torch
from langchain.llms.base import LLM
from transformers import pipeline

class customLLM(LLM):
    model_name = &quot;google/flan-t5-large&quot;
    pipeline = pipeline(&quot;text2text-generation&quot;, model=model_name, device=0, model_kwargs={&quot;torch_dtype&quot;:torch.bfloat16})

    def _call(self, prompt, stop=None):
        return self.pipeline(prompt, max_length=9999)[0][&quot;generated_text&quot;]
 
    def _identifying_params(self):
        return {&quot;name_of_model&quot;: self.model_name}

    def _llm_type(self):
        return &quot;custom&quot;


llm_predictor = LLMPredictor(llm=customLLM())

Question #2:

Assuming the answer for the above is no - I don't mind using Google Colab with GPU, but once the index will be made, will it be possible to download it and use it on my Mac?

i.e. something like:

on Google Colab:

service_context = ServiceContext.from_defaults(llm_predictor=llm_predictor, embed_model=embed_model)
index = GPTSimpleVectorIndex.from_documents(documents, service_context=service_context)
index.save_to_disk(&#39;index.json&#39;)

... and later on my Mac use load_from_file

答案1

得分: 1

为什么要传递 device=0？如果 isinstance(device, int)，PyTorch 会假定 device 是 CUDA 设备的索引，因此会出现错误。尝试使用 device="cpu"（或者也许只需删除 device 关键字参数），这个问题应该会消失。

英文:

Why are you passing device=0? If isinstance(device, int), PyTorch will assume device is the index of a CUDA device, hence the error. Try device="cpu" (or maybe simply removing the device kwarg), and this issue should disappear.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

使用lama_index与mac m1。

问题

答案1

Python UDP吞吐量远低于TCP吞吐量。

SKlearn分类器的predict_proba不等于1。

如何实现用于网页抓取的多线程？

在Linux中，当服务停止运行时重新启动Python脚本。

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论