Skip to content

监督微调PaddleOCR-VL,加载Checkpoint恢复训练的时候失败 #1432

@MYBao217

Description

@MYBao217

CONDA_PREFIX=/home/ubuntu/miniconda3
CUSTOM_DEVICE_ROOT=
PNPM_HOME=/home/ubuntu/.local/share/pnpm
PADDLE_BACKUP_ENV_PATH=/home/ubuntu/workspace/paddles/erniekit_dist_log/backup_env.0.json
JUPYTER_SERVER_URL=http://b5e8e0adfa40:8888/
PADDLE_MASTER=172.172.172.6:47101
FLAGS_selected_gpus=0

0 before shuffle: 5fc8d1286d48c0757e2f1344ad6e4659
0 after shuffle: df2db9428d9faf1abf5420cd29a8380a
0 source epoch switching.
0 before shuffle: df2db9428d9faf1abf5420cd29a8380a
0 after shuffle: 08fe71632b71a664bb91483f594a3c05
0 source epoch switching.
0 before shuffle: 08fe71632b71a664bb91483f594a3c05
0 after shuffle: 1e48c9155942807ed4a06397d866f012

以上是输出结果。
可以成功加载模型的参数,但之后会陷入0 source epoch switching循环。

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions