问题

使用paddle框架进行相关模型训练的时候,使用GPU时候报错:

C++ Traceback (most recent call last):
--------------------------------------
0 std::thread::_Impl<std::_Bind_simple<ThreadPool::ThreadPool(unsigned long)::{lambda()#1} ()> >::_M_run()
1 std::__future_base::_State_baseV2::_M_do_set(std::function<std::unique_ptr<std::__future_base::_Result_base, std::__future_base::_Result_base::_Deleter> ()>*, bool*)
2 paddle::framework::SignalHandle(char const*, int)
3 paddle::platform::GetCurrentTraceBackString[abi:cxx11]()

----------------------
Error Message Summary:
----------------------
FatalError: `Segmentation fault` is detected by the operating system.
[TimeInfo: *** Aborted at 1639543591 (unix time) try "date -d @1639543591" if you are using GNU date ***]
[SignalInfo: *** SIGSEGV (@0x0) received by PID 168238 (TID 0x7fc05d8c5700) from PID 0 ***]

版本:

  • python=3.8
  • paddlepaddle-gpu=2.2.1
  • cuda=10.1
  • cudnn=7.6

查看issue发现出现这种情况的不少,这里做一个记录

解决方法

使用conda重新安装一遍,参考安装教程:

  • https://www.paddlepaddle/documentation/docs/zh/install/conda/linux-conda.html#anchor-0

执行命令:

conda install paddlepaddle-gpu==2.2.1 cudatoolkit=10.1 --channel https://mirrors.tuna.tsinghua.edu/anaconda/cloud/Paddle/

参考

  • https://www.paddlepaddle/install/quick?docurl=/documentation/docs/zh/install/conda/linux-conda.html
  • https://www.paddlepaddle/documentation/docs/zh/install/conda/linux-conda.html#anchor-0

更多推荐

Bug:paddle:FatalError: `Segmentation fault` is detected by the operating system.