Jittor Cuda

I am getting the following error. Is it a problem with Jittor?

INFO: Created TensorFlow Lite XNNPACK delegate for CPU.
/usr/local/envs/env/lib/python3.9/site-packages/jittor/src/misc/cuda_atomic.h(138): error: no instance of overloaded function “atomicCAS” matches the argument list
argument types are: (unsigned short *, unsigned short, unsigned short)

/usr/local/envs/env/lib/python3.9/site-packages/jittor/src/misc/cuda_atomic.h(153): error: no instance of overloaded function “atomicCAS” matches the argument list
argument types are: (unsigned short *, unsigned short, unsigned short)

2 errors detected in the compilation of “/root/.cache/jittor/jt1.3.8/g++7.5.0/py3.9.16/Linux-4.15.0-2x73/IntelRXeonRCPUxbf/master/cu11.2.152_sm_61/jit/__opkey0_unary__Tx_int32__Ty_float32__OP_cast__JIT_1__JIT_cuda_1__graph_010000___var_info____hash_b4fa5e27e794d05_op.cc”.
/usr/local/envs/env/lib/python3.9/site-packages/jittor/src/misc/cuda_atomic.h(138): error: no instance of overloaded function “atomicCAS” matches the argument list
argument types are: (unsigned short *, unsigned short, unsigned short)

/usr/local/envs/env/lib/python3.9/site-packages/jittor/src/misc/cuda_atomic.h(153): error: no instance of overloaded function “atomicCAS” matches the argument list
argument types are: (unsigned short *, unsigned short, unsigned short)

2 errors detected in the compilation of “/root/.cache/jittor/jt1.3.8/g++7.5.0/py3.9.16/Linux-4.15.0-2x73/IntelRXeonRCPUxbf/master/cu11.2.152_sm_61/jit/__opkey0_broadcast_to__Tx_float32__DIM_4__BCAST_1__JIT_1__JIT_cuda_1__graph_010000___var_i___hash_793d8265cc491f47_op.cc”.
/usr/local/envs/env/lib/python3.9/site-packages/jittor/src/misc/cuda_atomic.h(138): error: no instance of overloaded function “atomicCAS” matches the argument list
argument types are: (unsigned short *, unsigned short, unsigned short)

/usr/local/envs/env/lib/python3.9/site-packages/jittor/src/misc/cuda_atomic.h(153): error: no instance of overloaded function “atomicCAS” matches the argument list
argument types are: (unsigned short *, unsigned short, unsigned short)

2 errors detected in the compilation of “/root/.cache/jittor/jt1.3.8/g++7.5.0/py3.9.16/Linux-4.15.0-2x73/IntelRXeonRCPUxbf/master/cu11.2.152_sm_61/jit/__opkey0_broadcast_to__Tx_float32__DIM_2__BCAST_3__JIT_1__JIT_cuda_1__graph_010000___var_i___hash_26a8a325bd5964d3_op.cc”.
/usr/local/envs/env/lib/python3.9/site-packages/jittor/src/misc/cuda_atomic.h(138): error: no instance of overloaded function “atomicCAS” matches the argument list
argument types are: (unsigned short *, unsigned short, unsigned short)

/usr/local/envs/env/lib/python3.9/site-packages/jittor/src/misc/cuda_atomic.h(153): error: no instance of overloaded function “atomicCAS” matches the argument list
argument types are: (unsigned short *, unsigned short, unsigned short)

2 errors detected in the compilation of “/root/.cache/jittor/jt1.3.8/g++7.5.0/py3.9.16/Linux-4.15.0-2x73/IntelRXeonRCPUxbf/master/cu11.2.152_sm_61/jit/__opkey0_broadcast_to__Tx_float32__DIM_3__BCAST_1__JIT_1__JIT_cuda_1__graph_010000___var_i___hash_483c49bd49d07258_op.cc”.
Exception in thread Thread-4:
Traceback (most recent call last):
File “/usr/local/envs/env/lib/python3.9/threading.py”, line 980, in _bootstrap_inner
self.run()
File “/dataset_preprocessing/ffhq/faceverse/fit_imgs_offline_cuda.py”, line 66, in run
rigid_optimizer.backward(loss)
File “/usr/local/envs/env/lib/python3.9/site-packages/jittor/optim.py”, line 170, in backward
jt.sync(params_has_grad)
RuntimeError: Wrong inputs arguments, Please refer to examples(help(jt.sync)).

Types of your inputs are:
self = module,
args = (list, ),

The function declarations are:
void sync(const vector<VarHolder*>& vh=vector<VarHolder*>(), bool device_sync=false, bool weak_sync=true)

Failed reason:[f 0624 06:22:44.413856 36 parallel_compiler.cc:330] Error happend during compilation:
[Error] source file location:/root/.cache/jittor/jt1.3.8/g++7.5.0/py3.9.16/Linux-4.15.0-2x73/IntelRXeonRCPUxbf/master/cu11.2.152_sm_61/jit/__opkey0_broadcast_to__Tx_float32__DIM_3__BCAST_1__JIT_1__JIT_cuda_1__graph_010000___var_i___hash_483c49bd49d07258_op.cc
Compile fused operator(0/6)failed:[Op(38:0:1:1:i1:o1:s0,broadcast_to->39),]

Reason: [f 0624 06:22:42.803444 84:C1 log.cc:608] Check failed ret(256) == 0(0) Run cmd failed: “/root/.cache/jittor/jtcuda/cuda11.2_cudnn8_linux/bin/nvcc” “/root/.cache/jittor/jt1.3.8/g++7.5.0/py3.9.16/Linux-4.15.0-2x73/IntelRXeonRCPUxbf/master/cu11.2.152_sm_61/jit/__opkey0_broadcast_to__Tx_float32__DIM_3__BCAST_1__JIT_1__JIT_cuda_1__graph_010000___var_i___hash_483c49bd49d07258_op.cc” -std=c++14 -Xcompiler -fPIC -Xcompiler -march=native -Xcompiler -fdiagnostics-color=always -lstdc++ -ldl -shared -I"/usr/local/envs/env/lib/python3.9/site-packages/jittor/src" -I/usr/local/envs/env/include/python3.9 -I/usr/local/envs/env/include/python3.9 -DHAS_CUDA -DIS_CUDA -I"/root/.cache/jittor/jtcuda/cuda11.2_cudnn8_linux/include" -I"/usr/local/envs/env/lib/python3.9/site-packages/jittor/extern/cuda/inc" -lcudart -L"/root/.cache/jittor/jtcuda/cuda11.2_cudnn8_linux/lib64" -Xlinker -rpath=“/root/.cache/jittor/jtcuda/cuda11.2_cudnn8_linux/lib64” -I"/root/.cache/jittor/jt1.3.8/g++7.5.0/py3.9.16/Linux-4.15.0-2x73/IntelRXeonRCPUxbf/master/cu11.2.152_sm_61" -L"/root/.cache/jittor/jt1.3.8/g++7.5.0/py3.9.16/Linux-4.15.0-2x73/IntelRXeonRCPUxbf/master/cu11.2.152_sm_61" -Xlinker -rpath=“/root/.cache/jittor/jt1.3.8/g++7.5.0/py3.9.16/Linux-4.15.0-2x73/IntelRXeonRCPUxbf/master/cu11.2.152_sm_61” -L"/root/.cache/jittor/jt1.3.8/g++7.5.0/py3.9.16/Linux-4.15.0-2x73/IntelRXeonRCPUxbf/master" -Xlinker -rpath=“/root/.cache/jittor/jt1.3.8/g++7.5.0/py3.9.16/Linux-4.15.0-2x73/IntelRXeonRCPUxbf/master” -l:“jit_utils_core.cpython-39-x86_64-linux-gnu”.so -l:“jittor_core.cpython-39-x86_64-linux-gnu”.so -x cu --cudart=shared -ccbin=“/usr/bin/g++” --use_fast_math -w -I"/usr/local/envs/env/lib/python3.9/site-packages/jittor/extern/cuda/inc" -arch=compute_61 -code=sm_61 -o “/root/.cache/jittor/jt1.3.8/g++7.5.0/py3.9.16/Linux-4.15.0-2x73/IntelRXeonRCPUxbf/master/cu11.2.152_sm_61/jit/__opkey0_broadcast_to__Tx_float32__DIM_3__BCAST_1__JIT_1__JIT_cuda_1__graph_010000___var_i___hash_483c49bd49d07258_op.so”