我使用intel core i7-13700K,RTX4090。我在windows11下使用jittor会识别gpu并下载cuda11.2包,但使用cpu运算,ubuntu22.04下同样识别gpu并下载cuda11.2包,但报错。
RTX4090原生支持cuda12,且旧版本nvcc可能不兼容新架构的编译,此前编译时遇到过此类问题。可能是这个原因吗,或者是由于其他可能的操作配置问题。
临时更换RTX2080ti可以运行。但我希望尝试使用4090的设备。
ubuntu22.04记录:
[i 0408 18:28:02.514394 08 compiler.py:955] Jittor(1.3.7.12) src: /home/loping151/.conda/envs/dl2023/lib/python3.9/site-packages/jittor
[i 0408 18:28:02.515433 08 compiler.py:956] g++ at /usr/bin/g++(11.3.0)
[i 0408 18:28:02.515458 08 compiler.py:957] cache_path: /home/loping151/.cache/jittor/jt1.3.7/g++11.3.0/py3.9.16/Linux-5.15.0-5x44/13thGenIntelRCx2d/default
[i 0408 18:28:02.526062 08 install_cuda.py:93] cuda_driver_version: [12, 0]
[i 0408 18:28:02.527497 08 __init__.py:411] Found /home/loping151/.cache/jittor/jtcuda/cuda11.2_cudnn8_linux/bin/nvcc(11.2.152) at /home/loping151/.cache/jittor/jtcuda/cuda11.2_cudnn8_linux/bin/nvcc.
[i 0408 18:28:02.548540 08 __init__.py:411] Found gdb(12.1) at /usr/bin/gdb.
[i 0408 18:28:02.549770 08 __init__.py:411] Found addr2line(2.38) at /usr/bin/addr2line.
[i 0408 18:28:02.607237 08 compiler.py:1010] cuda key:cu11.2.152_sm_89
[i 0408 18:28:02.671440 08 __init__.py:227] Total mem: 62.59GB, using 16 procs for compiling.
/usr/include/stdio.h(189): error: attribute "__malloc__" does not take arguments
/usr/include/stdio.h(201): error: attribute "__malloc__" does not take arguments
/usr/include/stdio.h(223): error: attribute "__malloc__" does not take arguments
/usr/include/stdio.h(260): error: attribute "__malloc__" does not take arguments
/usr/include/stdio.h(285): error: attribute "__malloc__" does not take arguments
/usr/include/stdio.h(294): error: attribute "__malloc__" does not take arguments
/usr/include/stdio.h(303): error: attribute "__malloc__" does not take arguments
/usr/include/stdio.h(309): error: attribute "__malloc__" does not take arguments
/usr/include/stdio.h(315): error: attribute "__malloc__" does not take arguments
/usr/include/stdio.h(830): error: attribute "__malloc__" does not take arguments
/usr/include/stdlib.h(566): error: attribute "__malloc__" does not take arguments
/usr/include/stdlib.h(570): error: attribute "__malloc__" does not take arguments
/usr/include/stdlib.h(799): error: attribute "__malloc__" does not take arguments
/usr/include/c++/11/type_traits(1406): error: type name is not allowed
/usr/include/c++/11/type_traits(1406): error: type name is not allowed
/usr/include/c++/11/type_traits(1406): error: identifier "__is_same" is undefined
/usr/include/wchar.h(155): error: attribute "__malloc__" does not take arguments
/usr/include/wchar.h(582): error: attribute "__malloc__" does not take arguments
/home/loping151/.conda/envs/dl2023/lib/python3.9/site-packages/jittor/src/misc/cstr.h(19): error: no instance of overloaded function "std::unique_ptr<_Tp [], _Dp>::reset [with _Tp=char, _Dp=std::default_delete<char []>]" matches the argument list
argument types are: (char *)
object type is: jittor::unique_ptr<char []>
/home/loping151/.conda/envs/dl2023/lib/python3.9/site-packages/jittor/src/misc/cstr.h(25): error: no instance of overloaded function "std::unique_ptr<_Tp [], _Dp>::reset [with _Tp=char, _Dp=std::default_delete<char []>]" matches the argument list
argument types are: (char *)
object type is: jittor::unique_ptr<char []>
/usr/include/c++/11/ext/string_conversions.h(85): error: no instance of overloaded function "_Range_chk::_S_chk" matches the argument list
argument types are: (const long, std::is_same<int, int>)
detected during instantiation of "_Ret __gnu_cxx::__stoa(_TRet (*)(const _CharT *, _CharT **, _Base...), const char *, const _CharT *, std::size_t *, _Base...) [with _TRet=long, _Ret=int, _CharT=char, _Base=<int>]"
/usr/include/c++/11/bits/basic_string.h(6620): here
/usr/include/c++/11/ext/string_conversions.h(85): error: no instance of overloaded function "_Range_chk::_S_chk" matches the argument list
argument types are: (const long, std::is_same<long, int>)
detected during instantiation of "_Ret __gnu_cxx::__stoa(_TRet (*)(const _CharT *, _CharT **, _Base...), const char *, const _CharT *, std::size_t *, _Base...) [with _TRet=long, _Ret=long, _CharT=char, _Base=<int>]"
/usr/include/c++/11/bits/basic_string.h(6625): here
/usr/include/c++/11/ext/string_conversions.h(85): error: no instance of overloaded function "_Range_chk::_S_chk" matches the argument list
argument types are: (const unsigned long, std::is_same<unsigned long, int>)
detected during instantiation of "_Ret __gnu_cxx::__stoa(_TRet (*)(const _CharT *, _CharT **, _Base...), const char *, const _CharT *, std::size_t *, _Base...) [with _TRet=unsigned long, _Ret=unsigned long, _CharT=char, _Base=<int>]"
/usr/include/c++/11/bits/basic_string.h(6630): here
/usr/include/c++/11/ext/string_conversions.h(85): error: no instance of overloaded function "_Range_chk::_S_chk" matches the argument list
argument types are: (const long long, std::is_same<long long, int>)
detected during instantiation of "_Ret __gnu_cxx::__stoa(_TRet (*)(const _CharT *, _CharT **, _Base...), const char *, const _CharT *, std::size_t *, _Base...) [with _TRet=long long, _Ret=long long, _CharT=char, _Base=<int>]"
/usr/include/c++/11/bits/basic_string.h(6635): here
/usr/include/c++/11/ext/string_conversions.h(85): error: no instance of overloaded function "_Range_chk::_S_chk" matches the argument list
argument types are: (const unsigned long long, std::is_same<unsigned long long, int>)
detected during instantiation of "_Ret __gnu_cxx::__stoa(_TRet (*)(const _CharT *, _CharT **, _Base...), const char *, const _CharT *, std::size_t *, _Base...) [with _TRet=unsigned long long, _Ret=unsigned long long, _CharT=char, _Base=<int>]"
/usr/include/c++/11/bits/basic_string.h(6640): here
/usr/include/c++/11/ext/string_conversions.h(85): error: no instance of overloaded function "_Range_chk::_S_chk" matches the argument list
argument types are: (const float, std::is_same<float, int>)
detected during instantiation of "_Ret __gnu_cxx::__stoa(_TRet (*)(const _CharT *, _CharT **, _Base...), const char *, const _CharT *, std::size_t *, _Base...) [with _TRet=float, _Ret=float, _CharT=char, _Base=<>]"
/usr/include/c++/11/bits/basic_string.h(6646): here
/usr/include/c++/11/ext/string_conversions.h(85): error: no instance of overloaded function "_Range_chk::_S_chk" matches the argument list
argument types are: (const double, std::is_same<double, int>)
detected during instantiation of "_Ret __gnu_cxx::__stoa(_TRet (*)(const _CharT *, _CharT **, _Base...), const char *, const _CharT *, std::size_t *, _Base...) [with _TRet=double, _Ret=double, _CharT=char, _Base=<>]"
/usr/include/c++/11/bits/basic_string.h(6650): here
/usr/include/c++/11/ext/string_conversions.h(85): error: no instance of overloaded function "_Range_chk::_S_chk" matches the argument list
argument types are: (const long double, std::is_same<long double, int>)
detected during instantiation of "_Ret __gnu_cxx::__stoa(_TRet (*)(const _CharT *, _CharT **, _Base...), const char *, const _CharT *, std::size_t *, _Base...) [with _TRet=long double, _Ret=long double, _CharT=char, _Base=<>]"
/usr/include/c++/11/bits/basic_string.h(6654): here
/usr/include/c++/11/ext/string_conversions.h(85): error: no instance of overloaded function "_Range_chk::_S_chk" matches the argument list
argument types are: (const long, std::is_same<int, int>)
detected during instantiation of "_Ret __gnu_cxx::__stoa(_TRet (*)(const _CharT *, _CharT **, _Base...), const char *, const _CharT *, std::size_t *, _Base...) [with _TRet=long, _Ret=int, _CharT=wchar_t, _Base=<int>]"
/usr/include/c++/11/bits/basic_string.h(6751): here
/usr/include/c++/11/ext/string_conversions.h(85): error: no instance of overloaded function "_Range_chk::_S_chk" matches the argument list
argument types are: (const long, std::is_same<long, int>)
detected during instantiation of "_Ret __gnu_cxx::__stoa(_TRet (*)(const _CharT *, _CharT **, _Base...), const char *, const _CharT *, std::size_t *, _Base...) [with _TRet=long, _Ret=long, _CharT=wchar_t, _Base=<int>]"
/usr/include/c++/11/bits/basic_string.h(6756): here
/usr/include/c++/11/ext/string_conversions.h(85): error: no instance of overloaded function "_Range_chk::_S_chk" matches the argument list
argument types are: (const unsigned long, std::is_same<unsigned long, int>)
detected during instantiation of "_Ret __gnu_cxx::__stoa(_TRet (*)(const _CharT *, _CharT **, _Base...), const char *, const _CharT *, std::size_t *, _Base...) [with _TRet=unsigned long, _Ret=unsigned long, _CharT=wchar_t, _Base=<int>]"
/usr/include/c++/11/bits/basic_string.h(6761): here
/usr/include/c++/11/ext/string_conversions.h(85): error: no instance of overloaded function "_Range_chk::_S_chk" matches the argument list
argument types are: (const long long, std::is_same<long long, int>)
detected during instantiation of "_Ret __gnu_cxx::__stoa(_TRet (*)(const _CharT *, _CharT **, _Base...), const char *, const _CharT *, std::size_t *, _Base...) [with _TRet=long long, _Ret=long long, _CharT=wchar_t, _Base=<int>]"
/usr/include/c++/11/bits/basic_string.h(6766): here
/usr/include/c++/11/ext/string_conversions.h(85): error: no instance of overloaded function "_Range_chk::_S_chk" matches the argument list
argument types are: (const unsigned long long, std::is_same<unsigned long long, int>)
detected during instantiation of "_Ret __gnu_cxx::__stoa(_TRet (*)(const _CharT *, _CharT **, _Base...), const char *, const _CharT *, std::size_t *, _Base...) [with _TRet=unsigned long long, _Ret=unsigned long long, _CharT=wchar_t, _Base=<int>]"
/usr/include/c++/11/bits/basic_string.h(6771): here
/usr/include/c++/11/ext/string_conversions.h(85): error: no instance of overloaded function "_Range_chk::_S_chk" matches the argument list
argument types are: (const float, std::is_same<float, int>)
detected during instantiation of "_Ret __gnu_cxx::__stoa(_TRet (*)(const _CharT *, _CharT **, _Base...), const char *, const _CharT *, std::size_t *, _Base...) [with _TRet=float, _Ret=float, _CharT=wchar_t, _Base=<>]"
/usr/include/c++/11/bits/basic_string.h(6777): here
/usr/include/c++/11/ext/string_conversions.h(85): error: no instance of overloaded function "_Range_chk::_S_chk" matches the argument list
argument types are: (const double, std::is_same<double, int>)
detected during instantiation of "_Ret __gnu_cxx::__stoa(_TRet (*)(const _CharT *, _CharT **, _Base...), const char *, const _CharT *, std::size_t *, _Base...) [with _TRet=double, _Ret=double, _CharT=wchar_t, _Base=<>]"
/usr/include/c++/11/bits/basic_string.h(6781): here
/usr/include/c++/11/ext/string_conversions.h(85): error: no instance of overloaded function "_Range_chk::_S_chk" matches the argument list
argument types are: (const long double, std::is_same<long double, int>)
detected during instantiation of "_Ret __gnu_cxx::__stoa(_TRet (*)(const _CharT *, _CharT **, _Base...), const char *, const _CharT *, std::size_t *, _Base...) [with _TRet=long double, _Ret=long double, _CharT=wchar_t, _Base=<>]"
/usr/include/c++/11/bits/basic_string.h(6785): here
36 errors detected in the compilation of "/home/loping151/.conda/envs/dl2023/lib/python3.9/site-packages/jittor/src/misc/nan_checker.cu".
multiprocessing.pool.RemoteTraceback:
"""
Traceback (most recent call last):
File "/home/loping151/.conda/envs/dl2023/lib/python3.9/multiprocessing/pool.py", line 125, in worker
result = (True, func(*args, **kwds))
File "/home/loping151/.conda/envs/dl2023/lib/python3.9/site-packages/jittor_utils/__init__.py", line 197, in do_compile
return cc.cache_compile(cmd, cache_path, jittor_path)
RuntimeError: [f 0408 18:28:03.202718 08 log.cc:608] Check failed ret(256) == 0(0) Run cmd failed: "/home/loping151/.cache/jittor/jtcuda/cuda11.2_cudnn8_linux/bin/nvcc" "/home/loping151/.conda/envs/dl2023/lib/python3.9/site-packages/jittor/src/misc/nan_checker.cu" -std=c++14 -Xcompiler -fPIC -Xcompiler -march=native -Xcompiler -fdiagnostics-color=always -I"/home/loping151/.conda/envs/dl2023/lib/python3.9/site-packages/jittor/src" -I/home/loping151/.conda/envs/dl2023/include/python3.9 -I/home/loping151/.conda/envs/dl2023/include/python3.9 -DHAS_CUDA -DIS_CUDA -I"/home/loping151/.cache/jittor/jtcuda/cuda11.2_cudnn8_linux/include" -I"/home/loping151/.conda/envs/dl2023/lib/python3.9/site-packages/jittor/extern/cuda/inc" -I"/home/loping151/.cache/jittor/jt1.3.7/g++11.3.0/py3.9.16/Linux-5.15.0-5x44/13thGenIntelRCx2d/default/cu11.2.152_sm_89" -O2 -c -o "/home/loping151/.cache/jittor/jt1.3.7/g++11.3.0/py3.9.16/Linux-5.15.0-5x44/13thGenIntelRCx2d/default/cu11.2.152_sm_89/obj_files/nan_checker.cu.o" -x cu --cudart=shared -ccbin="/usr/bin/g++" -w -I"/home/loping151/.conda/envs/dl2023/lib/python3.9/site-packages/jittor/extern/cuda/inc"
"""
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/home/loping151/Documents/dl2023/main.py", line 1, in <module>
import jittor as jt
File "/home/loping151/.conda/envs/dl2023/lib/python3.9/site-packages/jittor/__init__.py", line 18, in <module>
from . import compiler
File "/home/loping151/.conda/envs/dl2023/lib/python3.9/site-packages/jittor/compiler.py", line 1353, in <module>
compile(cc_path, cc_flags+opt_flags, files, 'jittor_core'+extension_suffix)
File "/home/loping151/.conda/envs/dl2023/lib/python3.9/site-packages/jittor/compiler.py", line 151, in compile
jit_utils.run_cmds(cmds, cache_path, jittor_path, "Compiling "+base_output)
File "/home/loping151/.conda/envs/dl2023/lib/python3.9/site-packages/jittor_utils/__init__.py", line 251, in run_cmds
for i,_ in enumerate(p.imap_unordered(do_compile, cmds)):
File "/home/loping151/.conda/envs/dl2023/lib/python3.9/multiprocessing/pool.py", line 870, in next
raise value
RuntimeError: [f 0408 18:28:03.202718 08 log.cc:608] Check failed ret(256) == 0(0) Run cmd failed: "/home/loping151/.cache/jittor/jtcuda/cuda11.2_cudnn8_linux/bin/nvcc" "/home/loping151/.conda/envs/dl2023/lib/python3.9/site-packages/jittor/src/misc/nan_checker.cu" -std=c++14 -Xcompiler -fPIC -Xcompiler -march=native -Xcompiler -fdiagnostics-color=always -I"/home/loping151/.conda/envs/dl2023/lib/python3.9/site-packages/jittor/src" -I/home/loping151/.conda/envs/dl2023/include/python3.9 -I/home/loping151/.conda/envs/dl2023/include/python3.9 -DHAS_CUDA -DIS_CUDA -I"/home/loping151/.cache/jittor/jtcuda/cuda11.2_cudnn8_linux/include" -I"/home/loping151/.conda/envs/dl2023/lib/python3.9/site-packages/jittor/extern/cuda/inc" -I"/home/loping151/.cache/jittor/jt1.3.7/g++11.3.0/py3.9.16/Linux-5.15.0-5x44/13thGenIntelRCx2d/default/cu11.2.152_sm_89" -O2 -c -o "/home/loping151/.cache/jittor/jt1.3.7/g++11.3.0/py3.9.16/Linux-5.15.0-5x44/13thGenIntelRCx2d/default/cu11.2.152_sm_89/obj_files/nan_checker.cu.o" -x cu --cudart=shared -ccbin="/usr/bin/g++" -w -I"/home/loping151/.conda/envs/dl2023/lib/python3.9/site-packages/jittor/extern/cuda/inc"
进程已结束,退出代码1