Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DataLoader worker (pid(s)) exited unexpectedly #374

Open
bileltouati opened this issue Oct 30, 2022 · 2 comments
Open

DataLoader worker (pid(s)) exited unexpectedly #374

bileltouati opened this issue Oct 30, 2022 · 2 comments

Comments

@bileltouati
Copy link

When running my code I recieved this error message “ RuntimeError: DataLoader worker (pid(s) 8992) exited unexpectedly " after training start
i use pytorch 1.12.1+cu113
NUM_Worker : 0
batch_size : 1
cuda : True
multi_gpu : True
image_size : 224
can you help me
Model : resnet_split0 Experience : cross_validation
self.dataset : None
videos_split : [[‘SM686-7’ ‘train’]
[‘LYI1079-2’ ‘train’]
[‘GA817-1-8’ ‘train’]

[‘TA239-2’ ‘test’]
[‘GM537-7’ ‘test’]
[‘AM33-2’ ‘test’]]

videos_split : [[‘SM686-7’ ‘train’]
[‘LYI1079-2’ ‘train’]
[‘GA817-1-8’ ‘train’]

Epoch_begin
Epoch 1 : train
im in bloc loop :slight_smile:
0 / 259398
LGA881-1-2 9 0
Traceback (most recent call last):
File “/usr/local/lib/python3.7/dist-packages/torch/utils/data/dataloader.py”, line 1163, in _try_get_data
data = self._data_queue.get(timeout=timeout)
c = Client(address, authkey=process.current_process().authkey)
File “/usr/lib/python3.7/multiprocessing/connection.py”, line 492, in Client
c = SocketClient(address)

File “/usr/lib/python3.7/multiprocessing/connection.py”, line 620, in SocketClient
s.connect(address)
FileNotFoundError: [Errno 2] No such file or directory

The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File “/content/drive/MyDrive/memoire-SSD/code/trainVal.py”, line 449, in
main()
File “/content/drive/MyDrive/memoire-SSD/code/trainVal.py”, line 446, in main
run(args)
File “/content/drive/MyDrive/memoire-SSD/code/trainVal.py”, line 348, in run
trainFunc(kwargsTr)
File “/content/drive/MyDrive/memoire-SSD/code/trainVal.py”, line 30, in epochSeqTr
for batch_idx,batch in enumerate(loader):
File “/usr/local/lib/python3.7/dist-packages/torch/utils/data/dataloader.py”, line 681, in next
data = self._next_data()
File “/usr/local/lib/python3.7/dist-packages/torch/utils/data/dataloader.py”, line 1359, in _next_data
idx, data = self._get_data()
File “/usr/local/lib/python3.7/dist-packages/torch/utils/data/dataloader.py”, line 1325, in _get_data
success, data = self._try_get_data()
File “/usr/local/lib/python3.7/dist-packages/torch/utils/data/dataloader.py”, line 1176, in _try_get_data
r
aise RuntimeError(‘DataLoader worker (pid(s) {}) exited unexpectedly’.format(pids_str)) from e
RuntimeError: DataLoader worker (pid(s) 8992) exited unexpectedly**

@bileltouati
Copy link
Author

any request ?

@bileltouati
Copy link
Author

error still persists

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant