英文:
Problems running Dl4j with CUDA
问题
我正在尝试在GPU上运行dl4j模型,而不是使用CPU。该模型在CPU上运行得非常好。
因此,我决定尝试使用CUDA来利用我的GPU。我按照这里中描述的每个步骤进行操作,并且在CUDA安装方面,我按照NVIDIA的指示从这里安装了CUDA Toolkit。代码编译正常,但是我遇到了一个错误:
Skipped [JCublasBackend] backend (unavailable): java.lang.UnsatisfiedLinkError: C:\Users\albertb\.javacpp\cache\cuda-10.2-7.6-1.5.3-windows-x86_64.jar\org\bytedeco\cuda\windows-x86_64\jnicudart.dll: 找不到依赖库
Exception in thread "main" java.lang.ExceptionInInitializerError
at TestCUDA.run(TestCUDA.java:12)
at TestCUDA.main(TestCUDA.java:7)
Caused by: java.lang.RuntimeException: org.nd4j.linalg.factory.Nd4jBackend$NoAvailableBackendException: 请确保您的类路径上有nd4j后端。请参阅:https://deeplearning4j.konduit.ai/nd4j/backend
at org.nd4j.linalg.factory.Nd4j.initContext(Nd4j.java:5094)
at org.nd4j.linalg.factory.Nd4j.<clinit>(Nd4j.java:270)
... 2 more
Caused by: org.nd4j.linalg.factory.Nd4jBackend$NoAvailableBackendException: 请确保您的类路径上有nd4j后端。请参阅:https://deeplearning4j.konduit.ai/nd4j/backend
at org.nd4j.linalg.factory.Nd4jBackend.load(Nd4jBackend.java:221)
at org.nd4j.linalg.factory.Nd4j.initContext(Nd4j.java:5091)
... 3 more
我的项目是在IntelliJ上使用Maven创建的。我认为所有的依赖关系都没问题,否则代码就无法编译。某个后端依赖项、库或安装可能是缺失的。
也许版本不兼容是问题所在。
消息“请确保您的类路径上有nd4j后端”给了我一些关于必须存在某个后端jar的提示。但据我所知,Maven应该会处理这个问题……
我在互联网上搜索了一些解决方案、示例或教程…… 什么都有!
所有的链接都指向相同的指令。
一些技术信息:
- Windows 10
- Java 1.8 IntelliJ + Maven
- dl4j 1.0.0-beta7
- nd4j-cuda-10.2
- CUDA 11.1 - 截至今天的最新版本
我会继续努力解决这个问题,但我会非常感谢任何帮助。
谢谢。
英文:
I'm trying to run a dl4j model using GPU instead of CPU. The model runs perfectly well using CPU.
So I decided to try CUDA to be able to to take advantage of the my GPU. I followed every step described in here and for the CUDA install I followed the instructions from NVIDIA to install CUDA Toolkit from here. The code compiles ok but I get an error:
Skipped [JCublasBackend] backend (unavailable): java.lang.UnsatisfiedLinkError: C:\Users\albertb\.javacpp\cache\cuda-10.2-7.6-1.5.3-windows-x86_64.jar\org\bytedeco\cuda\windows-x86_64\jnicudart.dll: Can't find dependent libraries
Exception in thread "main" java.lang.ExceptionInInitializerError
at TestCUDA.run(TestCUDA.java:12)
at TestCUDA.main(TestCUDA.java:7)
Caused by: java.lang.RuntimeException: org.nd4j.linalg.factory.Nd4jBackend$NoAvailableBackendException: Please ensure that you have an nd4j back-end on your classpath. Please see: https://deeplearning4j.konduit.ai/nd4j/backend
at org.nd4j.linalg.factory.Nd4j.initContext(Nd4j.java:5094)
at org.nd4j.linalg.factory.Nd4j.<clinit>(Nd4j.java:270)
... 2 more
Caused by: org.nd4j.linalg.factory.Nd4jBackend$NoAvailableBackendException: Please ensure that you have an nd4j backend on your classpath. Please see: https://deeplearning4j.konduit.ai/nd4j/backend
at org.nd4j.linalg.factory.Nd4jBackend.load(Nd4jBackend.java:221)
at org.nd4j.linalg.factory.Nd4j.initContext(Nd4j.java:5091)
... 3 more
My project was created using Maven on IntelliJ. All the dependencies are ok I guess otherwise the code would not compile. Some back-end dependency or library or install must be missing.
May be a version incompatibility is the problem.
The message Please ensure that you have an nd4j backend on your classpath
gives me some hint about some back-end jar that must be there. But Maven is supposed to take care of that... as far as I know.
I searched internet for some solution or example or tutorial.. anything !!
There are all colones of the same instruction from the links above.
Some technical info:
- Windows 10
- Java 1.8 IntelliJ + Maven
- dl4j 1.0,0-beta7
- nd4j-cuda-10.2
- CUDA 11.1 - latest version today
I'll keep trying to solve the problem but I would appreciate any help.
Thanks.
答案1
得分: 1
你可能安装了错误的CUDA版本。ND4J CUDA版本在构件ID中指定,如:nd4j-cuda-11.0/10.2。
如果你愿意,你可以使用javacpp提供的redist构件:https://search.maven.org/artifact/org.bytedeco/cuda-platform-redist/11.0-8.0-1.5.4/jar
这是适用于CUDA 11的最新版本。Nd4j只是在其所有本地交互操作下使用javacpp。
英文:
You likely have the wrong cuda version installed. The nd4j cuda version is specified in the artifact id like: nd4j-cuda-11.0/10.2.
If you want, you can use the redist artifacts from javacpp for this: https://search.maven.org/artifact/org.bytedeco/cuda-platform-redist/11.0-8.0-1.5.4/jar
This is the latest version for cuda 11. Nd4j just uses javacpp underneath for all of its native interop.
答案2
得分: 0
我解决了!现在我的示例在GeForce 3090上运行!
我的POM的一部分:
<dependency>
<groupId>org.deeplearning4j</groupId>
<artifactId>deeplearning4j-cuda-11.0</artifactId>
<version>1.0.0-SNAPSHOT</version>
</dependency>
<dependency>
<groupId>org.deeplearning4j</groupId>
<artifactId>deeplearning4j-core</artifactId>
<version>1.0.0-SNAPSHOT</version>
</dependency>
另外,您需要从源代码构建nd4j-cuda-11.0,并将所有的 *jar 和 dll 文件添加到您的项目中!
英文:
I’m resolved it! Now my example working on GeForce3090!!!
Part of My POM:
org.example
dl4jCuda
1.0-SNAPSHOT
<dependency>
<groupId>org.deeplearning4j</groupId>
<artifactId>deeplearning4j-cuda-11.0</artifactId>
<version>1.0.0-SNAPSHOT</version>
</dependency>
<dependency>
<groupId>org.deeplearning4j</groupId>
<artifactId>deeplearning4j-core</artifactId>
<version>1.0.0-SNAPSHOT</version>
</dependency>
Also you need build nd4j-cuda-11.0 from sources and add all *jar and dll files to your project!
通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库,让每个人都能够通过互相帮助和分享经验来进步。
评论