2023年5月25日 21:37:33go评论93阅读模式

英文:

How to use MLlfow to load the logged/saved model in Azure ML?

问题

我想部署经过训练的 ML 模型通过 AZURE ML 在线端点。

我已经在工作空间上注册了我的模型。

现在当我尝试使用 cutome score.py 来加载模型时，我得到以下错误 -

错误信息显示在 /azureml-envs/azureml_9a3b1e0a66d72d612aebc12b4a285f72/lib/python3.9/site-packages/azureml_inference_server_http/server/user_script.py 中，需要在其中更新 map_location=torch.device('cpu')。但是 mlflow.pyfunc.load_model() 没有参数可以访问 map_location，因此需要在代码中找到合适的位置进行更新。

英文:

I want to deploy the trained ML model via AZURE ML online endppoints.

I have already registered my model on the workspace.

Now I am getting following error when I am trying to load the model using cutome score.py for mlflow.pyfunc.load_model()

This is my code -

model_path = os.path.join(os.getenv(&quot;AZUREML_MODEL_DIR&quot;), &quot;use-case1-model&quot;)
model = mlflow.pyfunc.load_model(model_path)

score.py

import logging
import os
import json
import mlflow
from io import StringIO
from mlflow.pyfunc.scoring_server import infer_and_parse_json_input, predictions_to_json
import sys
from time import strftime, localtime
from collections import Counter
from pytorch_transformers import BertTokenizer
import random
import numpy as np 
import torch 
from tqdm import tqdm
def init():
    global model
    # &quot;model&quot; is the path of the mlflow artifacts when the model was registered. For automl
    # models, this is generally &quot;mlflow-model&quot;.
    model_path = os.path.join(os.getenv(&quot;AZUREML_MODEL_DIR&quot;), &quot;use-case1-model&quot;)
    model = mlflow.pyfunc.load_model(model_path)
    logging.info(&quot;Init complete&quot;)
def run(raw_data):
    data = json.loads(raw_data)
    title = json.dumps(data[&quot;title&quot;])
    att = json.dumps(data[&quot;attributes&quot;])
    output = model.predict([tensor_t,tensor_a])
    predict_list = output.tolist()[0]
    
    result = StringIO()
    predictions_to_json(predict_list,result)
    return result.getvalue()

Error that I am getting -

File &quot;/azureml-envs/azureml_9a3b1e0a66d72d612aebc12b4a285f72/lib/python3.9/site-packages/azureml_inference_server_http/server/user_script.py&quot;, line 117, in invoke_init
    self._user_init()
  File &quot;/var/azureml-app/dependencies/score.py&quot;, line 21, in init
    model = mlflow.pyfunc.load_model(model_path)
  File &quot;/azureml-envs/azureml_9a3b1e0a66d72d612aebc12b4a285f72/lib/python3.9/site-packages/mlflow/pyfunc/__init__.py&quot;, line 735, in load_model
    model_impl = importlib.import_module(conf[MAIN])._load_pyfunc(data_path)
  File &quot;/azureml-envs/azureml_9a3b1e0a66d72d612aebc12b4a285f72/lib/python3.9/site-packages/mlflow/pytorch/__init__.py&quot;, line 735, in _load_pyfunc
    return _PyTorchWrapper(_load_model(path, **kwargs))
  File &quot;/azureml-envs/azureml_9a3b1e0a66d72d612aebc12b4a285f72/lib/python3.9/site-packages/mlflow/pytorch/__init__.py&quot;, line 643, in _load_model
    return torch.load(model_path, **kwargs)
  File &quot;/azureml-envs/azureml_9a3b1e0a66d72d612aebc12b4a285f72/lib/python3.9/site-packages/torch/serialization.py&quot;, line 809, in load
    return _load(opened_zipfile, map_location, pickle_module, **pickle_load_args)
  File &quot;/azureml-envs/azureml_9a3b1e0a66d72d612aebc12b4a285f72/lib/python3.9/site-packages/torch/serialization.py&quot;, line 1172, in _load
    result = unpickler.load()
  File &quot;/azureml-envs/azureml_9a3b1e0a66d72d612aebc12b4a285f72/lib/python3.9/site-packages/torch/serialization.py&quot;, line 1142, in persistent_load
    typed_storage = load_tensor(dtype, nbytes, key, _maybe_decode_ascii(location))
  File &quot;/azureml-envs/azureml_9a3b1e0a66d72d612aebc12b4a285f72/lib/python3.9/site-packages/torch/serialization.py&quot;, line 1116, in load_tensor
    wrap_storage=restore_location(storage, location),
  File &quot;/azureml-envs/azureml_9a3b1e0a66d72d612aebc12b4a285f72/lib/python3.9/site-packages/torch/serialization.py&quot;, line 217, in default_restore_location
    result = fn(storage, location)
  File &quot;/azureml-envs/azureml_9a3b1e0a66d72d612aebc12b4a285f72/lib/python3.9/site-packages/torch/serialization.py&quot;, line 182, in _cuda_deserialize
    device = validate_cuda_device(location)
  File &quot;/azureml-envs/azureml_9a3b1e0a66d72d612aebc12b4a285f72/lib/python3.9/site-packages/torch/serialization.py&quot;, line 166, in validate_cuda_device
    raise RuntimeError(&#39;Attempting to deserialize object on a CUDA &#39;
RuntimeError: Attempting to deserialize object on a CUDA device but torch.cuda.is_available() is False. If you are running on a CPU-only machine, please use torch.load with map_location=torch.device(&#39;cpu&#39;) to map your storages to the CPU.

How and where can I update map_location=torch.device('cpu') ? mlflow.pyfunc.load_model() doesnt have a parameter to access map_location and as the packages is installed in docker i cannot make changes to serilaization.py

答案1

得分: 0

根据错误日志，您正在尝试在CUDA设备上反序列化一个对象，但torch.cuda.is_available()返回False，这是因为您在仅CPU的机器上运行。要解决此问题，您需要更新torch.load函数以指定map_location=torch.device('cpu')来将存储映射到CPU。

由于mlflow.pyfunc.load_model()函数没有map_location参数，您可以使用一个**kwargs参数，该参数可以传递任何额外的关键字参数给torch.load()函数。

要解决这个问题，在您的score.py文件中添加*{'map_location': torch.device('cpu')}。

model = mlflow.pyfunc.load_model(model_path, *{'map_location': torch.device('cpu')})

或者使用下面的代码（更新后的解决方案）：

model = mlflow.pytorch.load_model(model_path, map_location=torch.device('cpu'))

示例：

import mlflow
import torch
path = "./deploy/credit_defaults_model/"
model = mlflow.pyfunc.load_model(path, *{'map_location': torch.device('cpu')})

英文:

As per the error logs, you are attempting to deserialize an object on a CUDA device, but torch.cuda.is_available() is returning False, which is due to running on a CPU only machine. To resolve this issue, you need to update the torch.load function to specify map_location=torch.device('cpu') to map the storages to the CPU.

Since the mlflow.pyfunc.load_model() function does not have a map_location argument, you can use a **kwargs argument that can pass any additional keyword arguments to the torch.load() function.

To solve the issue, add *{'map_location': torch.device('cpu')}in your score.py file.

model = mlflow.pyfunc.load_model(model_path, *{&#39;map_location&#39;: torch.device(&#39;cpu&#39;)})

or Use below code:(Updated solution)

model = mlflow.pytorch.load_model(model_path, map_location=torch.device('cpu'))

Example:

import  mlflow
import  torch
path=&quot;./deploy/credit_defaults_model/&quot;
model = mlflow.pyfunc.load_model(path, *{&#39;map_location&#39;: torch.device(&#39;cpu&#39;)})

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何在Azure ML中使用MLflow加载已记录/保存的模型？

问题

答案1

更改执行策略并启用执行 PowerShell 脚本

如何在Azure HTTP 触发器函数中修复对 ‘EventGridPublisherClient’ 的 ImportError？

Azure数据工厂检索ForEach活动的状态。

创建索引会导致未经授权的错误。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。