loaded library: /usr/lib/x86_64-linux-gnu/libibverbs.so.1 ***************************************** Setting OMP_NUM_THREADS environment variable for each process to be 1 in default, to avoid your system being overloaded, please further tune the variable for optimal performance in your application as needed. ***************************************** loaded library: loaded library: loaded library: loaded library: /usr/lib/x86_64-linux-gnu/libibverbs.so.1/usr/lib/x86_64-linux-gnu/libibverbs.so.1/usr/lib/x86_64-linux-gnu/libibverbs.so.1 /usr/lib/x86_64-linux-gnu/libibverbs.so.1 loaded library: /usr/lib/x86_64-linux-gnu/libibverbs.so.1 loaded library: /usr/lib/x86_64-linux-gnu/libibverbs.so.1 loaded library: /usr/lib/x86_64-linux-gnu/libibverbs.so.1 loaded library: /usr/lib/x86_64-linux-gnu/libibverbs.so.1 W20220521 11:25:00.937984 405 rpc_client.cpp:190] LoadServer 10.7.52.7 Failed at 0 times error_code 14 error_message failed to connect to all addresses ------------------------ arguments ------------------------ batches_per_epoch ............................... 834 channel_last .................................... True ddp ............................................. False exit_num ........................................ -1 fuse_bn_add_relu ................................ True fuse_bn_relu .................................... True gpu_stat_file ................................... None grad_clipping ................................... 0.0 graph ........................................... True label_smoothing ................................. 0.1 learning_rate ................................... 1.536 legacy_init ..................................... False load_path ....................................... None lr_decay_type ................................... cosine metric_local .................................... True metric_train_acc ................................ True momentum ........................................ 0.875 nccl_fusion_max_ops ............................. 24 nccl_fusion_threshold_mb ........................ 16 num_classes ..................................... 1000 num_devices_per_node ............................ 8 num_epochs ...................................... 50 num_nodes ....................................... 1 ofrecord_part_num ............................... 256 ofrecord_path ................................... /dataset/79846248 print_interval .................................. 100 print_timestamp ................................. False samples_per_epoch ............................... 1281167 save_init ....................................... False save_path ....................................... None scale_grad ...................................... True skip_eval ....................................... False synthetic_data .................................. False total_batches ................................... -1 train_batch_size ................................ 192 train_global_batch_size ......................... 1536 use_fp16 ........................................ True use_gpu_decode .................................. True val_batch_size .................................. 50 val_batches_per_epoch ........................... 125 val_global_batch_size ........................... 400 val_samples_per_epoch ........................... 50000 warmup_epochs ................................... 5 weight_decay .................................... 3.0517578125e-05 zero_init_residual .............................. True -------------------- end of arguments --------------------- ***** Model Init ***** ***** Model Init Finish, time escapled: 2.96841 s ***** [rank:0] [train], epoch: 0/50, iter: 100/834, loss: 0.86086, top1: 0.00333, throughput: 293.53 | 2022-05-21 11:26:21.610 [rank:2] [train], epoch: 0/50, iter: 100/834, loss: 0.86109, top1: 0.00281, throughput: 293.53 | 2022-05-21 11:26:21.612 [rank:7] [train], epoch: 0/50, iter: 100/834, loss: 0.86120, top1: 0.00349, throughput: 293.56 | 2022-05-21 11:26:21.612 [rank:1] [train], epoch: 0/50, iter: 100/834, loss: 0.86083, top1: 0.00396, throughput: 293.53 | 2022-05-21 11:26:21.613 [rank:3] [train], epoch: 0/50, iter: 100/834, loss: 0.86087, top1: 0.00417, throughput: 293.53 | 2022-05-21 11:26:21.612 [rank:5] [train], epoch: 0/50, iter: 100/834, loss: 0.86111, top1: 0.00365, throughput: 293.55 | 2022-05-21 11:26:21.611 [rank:4] [train], epoch: 0/50, iter: 100/834, loss: 0.86123, top1: 0.00359, throughput: 293.53 | 2022-05-21 11:26:21.612 [rank:6] [train], epoch: 0/50, iter: 100/834, loss: 0.86097, top1: 0.00333, throughput: 293.53 | 2022-05-21 11:26:21.614 timestamp, name, driver_version, utilization.gpu [%], utilization.memory [%], memory.total [MiB], memory.free [MiB], memory.used [MiB] timestamp, name, driver_version, utilization.gpu [%], utilization.memory [%], memory.total [MiB], memory.free [MiB], memory.used [MiB] 2022/05/21 11:26:21.824, Tesla V100-SXM2-32GB, 470.57.02, 99 %, 75 %, 32510 MiB, 21205 MiB, 11305 MiB timestamp, name, driver_version, utilization.gpu [%], utilization.memory [%], memory.total [MiB], memory.free [MiB], memory.used [MiB] timestamp, name, driver_version, utilization.gpu [%], utilization.memory [%], memory.total [MiB], memory.free [MiB], memory.used [MiB] timestamp, name, driver_version, utilization.gpu [%], utilization.memory [%], memory.total [MiB], memory.free [MiB], memory.used [MiB] 2022/05/21 11:26:21.831, Tesla V100-SXM2-32GB, 470.57.02, 99 %, 75 %, 32510 MiB, 21205 MiB, 11305 MiB 2022/05/21 11:26:21.834, Tesla V100-SXM2-32GB, 470.57.02, 97 %, 73 %, 32510 MiB, 21182 MiB, 11328 MiB 2022/05/21 11:26:21.834, Tesla V100-SXM2-32GB, 470.57.02, 99 %, 75 %, 32510 MiB, 21205 MiB, 11305 MiB timestamp, name, driver_version, utilization.gpu [%], utilization.memory [%], memory.total [MiB], memory.free [MiB], memory.used [MiB] 2022/05/21 11:26:21.842, Tesla V100-SXM2-32GB, 470.57.02, 97 %, 73 %, 32510 MiB, 21182 MiB, 11328 MiB timestamp, name, driver_version, utilization.gpu [%], utilization.memory [%], memory.total [MiB], memory.free [MiB], memory.used [MiB] 2022/05/21 11:26:21.842, Tesla V100-SXM2-32GB, 470.57.02, 92 %, 62 %, 32510 MiB, 21205 MiB, 11305 MiB 2022/05/21 11:26:21.842, Tesla V100-SXM2-32GB, 470.57.02, 92 %, 62 %, 32510 MiB, 21205 MiB, 11305 MiB timestamp, name, driver_version, utilization.gpu [%], utilization.memory [%], memory.total [MiB], memory.free [MiB], memory.used [MiB] 2022/05/21 11:26:21.844, Tesla V100-SXM2-32GB, 470.57.02, 92 %, 66 %, 32510 MiB, 21344 MiB, 11166 MiB 2022/05/21 11:26:21.846, Tesla V100-SXM2-32GB, 470.57.02, 97 %, 73 %, 32510 MiB, 21182 MiB, 11328 MiB 2022/05/21 11:26:21.852, Tesla V100-SXM2-32GB, 470.57.02, 92 %, 66 %, 32510 MiB, 21344 MiB, 11166 MiB 2022/05/21 11:26:21.852, Tesla V100-SXM2-32GB, 470.57.02, 97 %, 73 %, 32510 MiB, 21182 MiB, 11328 MiB 2022/05/21 11:26:21.852, Tesla V100-SXM2-32GB, 470.57.02, 92 %, 62 %, 32510 MiB, 21205 MiB, 11305 MiB 2022/05/21 11:26:21.853, Tesla V100-SXM2-32GB, 470.57.02, 97 %, 73 %, 32510 MiB, 21182 MiB, 11328 MiB 2022/05/21 11:26:21.852, Tesla V100-SXM2-32GB, 470.57.02, 92 %, 62 %, 32510 MiB, 21205 MiB, 11305 MiB 2022/05/21 11:26:21.854, Tesla V100-SXM2-32GB, 470.57.02, 93 %, 62 %, 32510 MiB, 21304 MiB, 11206 MiB 2022/05/21 11:26:21.853, Tesla V100-SXM2-32GB, 470.57.02, 92 %, 62 %, 32510 MiB, 21205 MiB, 11305 MiB 2022/05/21 11:26:21.856, Tesla V100-SXM2-32GB, 470.57.02, 92 %, 66 %, 32510 MiB, 21344 MiB, 11166 MiB 2022/05/21 11:26:21.862, Tesla V100-SXM2-32GB, 470.57.02, 93 %, 62 %, 32510 MiB, 21304 MiB, 11206 MiB 2022/05/21 11:26:21.863, Tesla V100-SXM2-32GB, 470.57.02, 92 %, 66 %, 32510 MiB, 21344 MiB, 11166 MiB 2022/05/21 11:26:21.868, Tesla V100-SXM2-32GB, 470.57.02, 83 %, 58 %, 32510 MiB, 21182 MiB, 11328 MiB 2022/05/21 11:26:21.868, Tesla V100-SXM2-32GB, 470.57.02, 92 %, 66 %, 32510 MiB, 21344 MiB, 11166 MiB 2022/05/21 11:26:21.869, Tesla V100-SXM2-32GB, 470.57.02, 83 %, 58 %, 32510 MiB, 21182 MiB, 11328 MiB 2022/05/21 11:26:21.870, Tesla V100-SXM2-32GB, 470.57.02, 94 %, 62 %, 32510 MiB, 21306 MiB, 11204 MiB 2022/05/21 11:26:21.870, Tesla V100-SXM2-32GB, 470.57.02, 83 %, 58 %, 32510 MiB, 21182 MiB, 11328 MiB 2022/05/21 11:26:21.872, Tesla V100-SXM2-32GB, 470.57.02, 93 %, 62 %, 32510 MiB, 21304 MiB, 11206 MiB 2022/05/21 11:26:21.876, Tesla V100-SXM2-32GB, 470.57.02, 94 %, 62 %, 32510 MiB, 21306 MiB, 11204 MiB 2022/05/21 11:26:21.877, Tesla V100-SXM2-32GB, 470.57.02, 93 %, 62 %, 32510 MiB, 21304 MiB, 11206 MiB 2022/05/21 11:26:21.877, Tesla V100-SXM2-32GB, 470.57.02, 81 %, 59 %, 32510 MiB, 21344 MiB, 11166 MiB 2022/05/21 11:26:21.877, Tesla V100-SXM2-32GB, 470.57.02, 93 %, 62 %, 32510 MiB, 21304 MiB, 11206 MiB 2022/05/21 11:26:21.877, Tesla V100-SXM2-32GB, 470.57.02, 81 %, 59 %, 32510 MiB, 21344 MiB, 11166 MiB 2022/05/21 11:26:21.888, Tesla V100-SXM2-32GB, 470.57.02, 89 %, 63 %, 32510 MiB, 21172 MiB, 11338 MiB 2022/05/21 11:26:21.888, Tesla V100-SXM2-32GB, 470.57.02, 81 %, 59 %, 32510 MiB, 21344 MiB, 11166 MiB 2022/05/21 11:26:21.890, Tesla V100-SXM2-32GB, 470.57.02, 94 %, 62 %, 32510 MiB, 21306 MiB, 11204 MiB 2022/05/21 11:26:21.895, Tesla V100-SXM2-32GB, 470.57.02, 89 %, 63 %, 32510 MiB, 21172 MiB, 11338 MiB 2022/05/21 11:26:21.895, Tesla V100-SXM2-32GB, 470.57.02, 94 %, 62 %, 32510 MiB, 21306 MiB, 11204 MiB 2022/05/21 11:26:21.895, Tesla V100-SXM2-32GB, 470.57.02, 93 %, 62 %, 32510 MiB, 21304 MiB, 11206 MiB 2022/05/21 11:26:21.896, Tesla V100-SXM2-32GB, 470.57.02, 94 %, 62 %, 32510 MiB, 21306 MiB, 11204 MiB 2022/05/21 11:26:21.896, Tesla V100-SXM2-32GB, 470.57.02, 93 %, 62 %, 32510 MiB, 21304 MiB, 11206 MiB 2022/05/21 11:26:21.897, Tesla V100-SXM2-32GB, 470.57.02, 84 %, 59 %, 32510 MiB, 21100 MiB, 11410 MiB 2022/05/21 11:26:21.897, Tesla V100-SXM2-32GB, 470.57.02, 93 %, 62 %, 32510 MiB, 21304 MiB, 11206 MiB 2022/05/21 11:26:21.899, Tesla V100-SXM2-32GB, 470.57.02, 89 %, 63 %, 32510 MiB, 21172 MiB, 11338 MiB 2022/05/21 11:26:21.904, Tesla V100-SXM2-32GB, 470.57.02, 84 %, 59 %, 32510 MiB, 21100 MiB, 11410 MiB 2022/05/21 11:26:21.905, Tesla V100-SXM2-32GB, 470.57.02, 89 %, 63 %, 32510 MiB, 21172 MiB, 11338 MiB 2022/05/21 11:26:21.905, Tesla V100-SXM2-32GB, 470.57.02, 94 %, 62 %, 32510 MiB, 21306 MiB, 11204 MiB 2022/05/21 11:26:21.905, Tesla V100-SXM2-32GB, 470.57.02, 89 %, 63 %, 32510 MiB, 21172 MiB, 11338 MiB 2022/05/21 11:26:21.905, Tesla V100-SXM2-32GB, 470.57.02, 94 %, 62 %, 32510 MiB, 21306 MiB, 11204 MiB 2022/05/21 11:26:21.906, Tesla V100-SXM2-32GB, 470.57.02, 92 %, 61 %, 32510 MiB, 21206 MiB, 11304 MiB 2022/05/21 11:26:21.907, Tesla V100-SXM2-32GB, 470.57.02, 94 %, 62 %, 32510 MiB, 21306 MiB, 11204 MiB 2022/05/21 11:26:21.908, Tesla V100-SXM2-32GB, 470.57.02, 84 %, 59 %, 32510 MiB, 21100 MiB, 11410 MiB 2022/05/21 11:26:21.913, Tesla V100-SXM2-32GB, 470.57.02, 92 %, 61 %, 32510 MiB, 21206 MiB, 11304 MiB 2022/05/21 11:26:21.914, Tesla V100-SXM2-32GB, 470.57.02, 84 %, 59 %, 32510 MiB, 21100 MiB, 11410 MiB 2022/05/21 11:26:21.914, Tesla V100-SXM2-32GB, 470.57.02, 89 %, 63 %, 32510 MiB, 21172 MiB, 11338 MiB 2022/05/21 11:26:21.914, Tesla V100-SXM2-32GB, 470.57.02, 84 %, 59 %, 32510 MiB, 21100 MiB, 11410 MiB 2022/05/21 11:26:21.914, Tesla V100-SXM2-32GB, 470.57.02, 89 %, 63 %, 32510 MiB, 21172 MiB, 11338 MiB 2022/05/21 11:26:21.916, Tesla V100-SXM2-32GB, 470.57.02, 89 %, 63 %, 32510 MiB, 21172 MiB, 11338 MiB 2022/05/21 11:26:21.917, Tesla V100-SXM2-32GB, 470.57.02, 92 %, 61 %, 32510 MiB, 21206 MiB, 11304 MiB 2022/05/21 11:26:21.922, Tesla V100-SXM2-32GB, 470.57.02, 92 %, 61 %, 32510 MiB, 21206 MiB, 11304 MiB 2022/05/21 11:26:21.923, Tesla V100-SXM2-32GB, 470.57.02, 84 %, 59 %, 32510 MiB, 21100 MiB, 11410 MiB 2022/05/21 11:26:21.923, Tesla V100-SXM2-32GB, 470.57.02, 92 %, 61 %, 32510 MiB, 21206 MiB, 11304 MiB 2022/05/21 11:26:21.923, Tesla V100-SXM2-32GB, 470.57.02, 84 %, 59 %, 32510 MiB, 21100 MiB, 11410 MiB 2022/05/21 11:26:21.925, Tesla V100-SXM2-32GB, 470.57.02, 84 %, 59 %, 32510 MiB, 21100 MiB, 11410 MiB 2022/05/21 11:26:21.931, Tesla V100-SXM2-32GB, 470.57.02, 92 %, 61 %, 32510 MiB, 21206 MiB, 11304 MiB 2022/05/21 11:26:21.932, Tesla V100-SXM2-32GB, 470.57.02, 92 %, 61 %, 32510 MiB, 21206 MiB, 11304 MiB 2022/05/21 11:26:21.933, Tesla V100-SXM2-32GB, 470.57.02, 92 %, 61 %, 32510 MiB, 21206 MiB, 11304 MiB [rank:7] [train], epoch: 0/50, iter: 200/834, loss: 0.82971, top1: 0.01266, throughput: 1222.39 | 2022-05-21 11:26:37.319 [rank:3] [train], epoch: 0/50, iter: 200/834, loss: 0.82912, top1: 0.01182, throughput: 1222.34 | 2022-05-21 11:26:37.320 [rank:0] [train], epoch: 0/50, iter: 200/834, loss: 0.82971, top1: 0.01177, throughput: 1222.23[rank:2] [train], epoch: 0/50, iter: 200/834, loss: 0.82869, top1: 0.01187, throughput: 1222.27 | 2022-05-21 11:26:37.319| 2022-05-21 11:26:37.320 [rank:1] [train], epoch: 0/50, iter: 200/834, loss: 0.82857, top1: 0.01297, throughput: 1222.29 | 2022-05-21 11:26:37.321 [rank:4] [train], epoch: 0/50, iter: 200/834, loss: 0.82835, top1: 0.01297, throughput: 1222.19 | 2022-05-21 11:26:37.322 [rank:5] [train], epoch: 0/50, iter: 200/834, loss: 0.82812, top1: 0.01208, throughput: 1222.09 | 2022-05-21 11:26:37.322 [rank:6] [train], epoch: 0/50, iter: 200/834, loss: 0.82842, top1: 0.01505, throughput: 1222.51 | 2022-05-21 11:26:37.320 [rank:2] [train], epoch: 0/50, iter: 300/834, loss: 0.80504, top1: 0.01797, throughput: 1303.26 | 2022-05-21 11:26:52.052 [rank:7] [train], epoch: 0/50, iter: 300/834, loss: 0.80425, top1: 0.01854, throughput: 1303.25 | 2022-05-21 11:26:52.051 [rank:0] [train], epoch: 0/50, iter: 300/834, loss: 0.80464, top1: 0.01875, throughput: 1303.26 | 2022-05-21 11:26:52.052 [rank:3] [train], epoch: 0/50, iter: 300/834, loss: 0.80375, top1: 0.01917, throughput: 1303.24 | 2022-05-21 11:26:52.052 [rank:6] [train], epoch: 0/50, iter: 300/834, loss: 0.80418, top1: 0.01953, throughput: 1303.24 | 2022-05-21 11:26:52.052 [rank:4] [train], epoch: 0/50, iter: 300/834, loss: 0.80536, top1: 0.01740, throughput: 1303.42 | 2022-05-21 11:26:52.052 [rank:1] [train], epoch: 0/50, iter: 300/834, loss: 0.80401, top1: 0.01927, throughput: 1303.26 | 2022-05-21 11:26:52.053 [rank:5] [train], epoch: 0/50, iter: 300/834, loss: 0.80405, top1: 0.01938, throughput: 1303.49 | 2022-05-21 11:26:52.052 [rank:7] [train], epoch: 0/50, iter: 400/834, loss: 0.78906, top1: 0.02417, throughput: 1302.45 | 2022-05-21 11:27:06.793 [rank:2] [train], epoch: 0/50, iter: 400/834, loss: 0.78723, top1: 0.02573, throughput: 1302.39 | 2022-05-21 11:27:06.794 [rank:4] [train], epoch: 0/50, iter: 400/834, loss: 0.78796, top1: 0.02734, throughput: 1302.46[rank:0] [train], epoch: 0/50, iter: 400/834, loss: 0.78769, top1: 0.02677, throughput: 1302.41 | 2022-05-21 11:27:06.794| 2022-05-21 11:27:06.793 [rank:1] [train], epoch: 0/50, iter: 400/834, loss: 0.78671, top1: 0.02474, throughput: 1302.38 | 2022-05-21 11:27:06.795 [rank:3] [train], epoch: 0/50, iter: 400/834, loss: 0.78762, top1: 0.02411, throughput: 1302.27 | 2022-05-21 11:27:06.796 [rank:6] [train], epoch: 0/50, iter: 400/834, loss: 0.78791, top1: 0.02500, throughput: 1302.25 | 2022-05-21 11:27:06.796 [rank:5] [train], epoch: 0/50, iter: 400/834, loss: 0.78823, top1: 0.02458, throughput: 1302.22 | 2022-05-21 11:27:06.796 [rank:2] [train], epoch: 0/50, iter: 500/834, loss: 0.77367, top1: 0.03161, throughput: 1302.00 | 2022-05-21 11:27:21.541 [rank:1] [train], epoch: 0/50, iter: 500/834, loss: 0.77282, top1: 0.03276, throughput: 1302.03 | 2022-05-21 11:27:21.542 [rank:7] [train], epoch: 0/50, iter: 500/834, loss: 0.77430, top1: 0.03120, throughput: 1301.80 | 2022-05-21 11:27:21.542 [rank:3] [train], epoch: 0/50, iter: 500/834, loss: 0.76989, top1: 0.03177, throughput: 1302.15 | 2022-05-21 11:27:21.541 [rank:5] [train], epoch: 0/50, iter: 500/834, loss: 0.77259, top1: 0.03151, throughput: 1302.11 | 2022-05-21 11:27:21.541 [rank:0] [train], epoch: 0/50, iter: 500/834, loss: 0.77120, top1: 0.03141, throughput: 1301.72 | 2022-05-21 11:27:21.543 [rank:6] [train], epoch: 0/50, iter: 500/834, loss: 0.77216, top1: 0.03214, throughput: 1302.00 | 2022-05-21 11:27:21.543 [rank:4] [train], epoch: 0/50, iter: 500/834, loss: 0.77218, top1: 0.03161, throughput: 1301.91 | 2022-05-21 11:27:21.541 [rank:2] [train], epoch: 0/50, iter: 600/834, loss: 0.75723, top1: 0.04068, throughput: 1282.30 | 2022-05-21 11:27:36.514 [rank:5] [train], epoch: 0/50, iter: 600/834, loss: 0.75659, top1: 0.04182, throughput: 1282.28 | 2022-05-21 11:27:36.514 [rank:7] [train], epoch: 0/50, iter: 600/834, loss: 0.75821, top1: 0.03849, throughput: 1282.38 | 2022-05-21 11:27:36.514 [rank:1] [train], epoch: 0/50, iter: 600/834, loss: 0.75698, top1: 0.04146, throughput: 1282.19 | 2022-05-21 11:27:36.516 [rank:6] [train], epoch: 0/50, iter: 600/834, loss: 0.75669, top1: 0.03984, throughput: 1282.22[rank:3] [train], epoch: 0/50, iter: 600/834, loss: 0.75652, top1: 0.04255, throughput: 1282.12 | 2022-05-21 11:27:36.516| 2022-05-21 11:27:36.517 [rank:0] [train], epoch: 0/50, iter: 600/834, loss: 0.75724, top1: 0.03958, throughput: 1282.29 | 2022-05-21 11:27:36.517 [rank:4] [train], epoch: 0/50, iter: 600/834, loss: 0.75677, top1: 0.04203, throughput: 1282.23 | 2022-05-21 11:27:36.515 [rank:7] [train], epoch: 0/50, iter: 700/834, loss: 0.74270, top1: 0.04677, throughput: 1318.68 | 2022-05-21 11:27:51.074 [rank:1] [train], epoch: 0/50, iter: 700/834, loss: 0.74044, top1: 0.05031, throughput: 1318.82 | 2022-05-21 11:27:51.074 [rank:4] [train], epoch: 0/50, iter: 700/834, loss: 0.74182, top1: 0.04984, throughput: 1318.67 | 2022-05-21 11:27:51.075 [rank:2] [train], epoch: 0/50, iter: 700/834, loss: 0.74155, top1: 0.04995, throughput: 1318.44 | 2022-05-21 11:27:51.077 [rank:5] [train], epoch: 0/50, iter: 700/834, loss: 0.74229, top1: 0.04828, throughput: 1318.63 | 2022-05-21 11:27:51.075 [rank:3] [train], epoch: 0/50, iter: 700/834, loss: 0.74426, top1: 0.04740, throughput: 1318.63 | 2022-05-21 11:27:51.077 [rank:6] [train], epoch: 0/50, iter: 700/834, loss: 0.74460, top1: 0.04714, throughput: 1318.60 | 2022-05-21 11:27:51.077 [rank:0] [train], epoch: 0/50, iter: 700/834, loss: 0.74291, top1: 0.04896, throughput: 1318.63 | 2022-05-21 11:27:51.077 [rank:1] [train], epoch: 0/50, iter: 800/834, loss: 0.72671, top1: 0.05984, throughput: 1313.90 | 2022-05-21 11:28:05.687 [rank:2] [train], epoch: 0/50, iter: 800/834, loss: 0.72832, top1: 0.05745, throughput: 1314.08 | 2022-05-21 11:28:05.688 [rank:3] [train], epoch: 0/50, iter: 800/834, loss: 0.72752, top1: 0.05875, throughput: 1314.03 | 2022-05-21 11:28:05.688 [rank:6] [train], epoch: 0/50, iter: 800/834, loss: 0.72392, top1: 0.06161, throughput: 1314.12[rank:7] [train], epoch: 0/50, iter: 800/834, loss: 0.72713, top1: 0.05745, throughput: 1313.70 | 2022-05-21 11:28:05.688 | 2022-05-21 11:28:05.689 [rank:5] [train], epoch: 0/50, iter: 800/834, loss: 0.72945, top1: 0.05870, throughput: 1313.90 | 2022-05-21 11:28:05.688 [rank:0] [train], epoch: 0/50, iter: 800/834, loss: 0.72676, top1: 0.05901, throughput: 1314.04 | 2022-05-21 11:28:05.689 [rank:4] [train], epoch: 0/50, iter: 800/834, loss: 0.72755, top1: 0.06156, throughput: 1313.85 | 2022-05-21 11:28:05.689 [rank:6] [train], epoch: 0/50, iter: 834/834, loss: 0.71850, top1: 0.06495, throughput: 1327.69 | 2022-05-21 11:28:10.605 [rank:2] [train], epoch: 0/50, iter: 834/834, loss: 0.71987, top1: 0.06434, throughput: 1327.44 | 2022-05-21 11:28:10.605 [rank:7] [train], epoch: 0/50, iter: 834/834, loss: 0.71919, top1: 0.06725, throughput: 1327.81 | 2022-05-21 11:28:10.606 [rank:5] [train], epoch: 0/50, iter: 834/834, loss: 0.71989, top1: 0.06173, throughput: 1327.33 | 2022-05-21 11:28:10.606 [rank:1] [train], epoch: 0/50, iter: 834/834, loss: 0.71971, top1: 0.05806, throughput: 1327.08 | 2022-05-21 11:28:10.607 [rank:4] [train], epoch: 0/50, iter: 834/834, loss: 0.72027, top1: 0.05974, throughput: 1327.35 | 2022-05-21 11:28:10.607 [rank:3] [train], epoch: 0/50, iter: 834/834, loss: 0.71874, top1: 0.06327, throughput: 1326.96 | 2022-05-21 11:28:10.608 [rank:0] [train], epoch: 0/50, iter: 834/834, loss: 0.71452, top1: 0.07154, throughput: 1327.05 | 2022-05-21 11:28:10.608 [rank:0] [eval], epoch: 0/50, iter: 125/125, loss: 0.00000, top1: 0.06928, throughput: 252.09 | 2022-05-21 11:28:35.401 [rank:7] [eval], epoch: 0/50, iter: 125/125, loss: 0.00000, top1: 0.06640, throughput: 251.95 | 2022-05-21 11:28:35.412 [rank:6] [eval], epoch: 0/50, iter: 125/125, loss: 0.00000, top1: 0.06672, throughput: 251.29 | 2022-05-21 11:28:35.476 [rank:2] [eval], epoch: 0/50, iter: 125/125, loss: 0.00000, top1: 0.07280, throughput: 251.01 | 2022-05-21 11:28:35.505 [rank:4] [eval], epoch: 0/50, iter: 125/125, loss: 0.00000, top1: 0.07152, throughput: 250.85 | 2022-05-21 11:28:35.522 [rank:3] [eval], epoch: 0/50, iter: 125/125, loss: 0.00000, top1: 0.06752, throughput: 250.74 | 2022-05-21 11:28:35.533 [rank:1] [eval], epoch: 0/50, iter: 125/125, loss: 0.00000, top1: 0.07008, throughput: 250.17 | 2022-05-21 11:28:35.589 [rank:5] [eval], epoch: 0/50, iter: 125/125, loss: 0.00000, top1: 0.07216, throughput: 248.63 | 2022-05-21 11:28:35.744 [rank:3] [train], epoch: 1/50, iter: 100/834, loss: 0.70637, top1: 0.07234, throughput: 1306.63 | 2022-05-21 11:28:50.228 [rank:6] [train], epoch: 1/50, iter: 100/834, loss: 0.70701, top1: 0.07286, throughput: 1301.65 | 2022-05-21 11:28:50.227 [rank:0] [train], epoch: 1/50, iter: 100/834, loss: 0.70463, top1: 0.07422, throughput: 1295.01 | 2022-05-21 11:28:50.227 [rank:4] [train], epoch: 1/50, iter: 100/834, loss: 0.70839, top1: 0.07016, throughput: 1305.74 | 2022-05-21 11:28:50.226 [rank:7] [train], epoch: 1/50, iter: 100/834, loss: 0.70723, top1: 0.07417, throughput: 1295.93 | 2022-05-21 11:28:50.228 [rank:2] [train], epoch: 1/50, iter: 100/834, loss: 0.70546, top1: 0.07516, throughput: 1304.07 | 2022-05-21 11:28:50.228 [rank:1] [train], epoch: 1/50, iter: 100/834, loss: 0.70605, top1: 0.07302, throughput: 1311.59 | 2022-05-21 11:28:50.228 [rank:5] [train], epoch: 1/50, iter: 100/834, loss: 0.70676, top1: 0.07490, throughput: 1325.62 | 2022-05-21 11:28:50.228 [rank:4] [train], epoch: 1/50, iter: 200/834, loss: 0.69184, top1: 0.08672, throughput: 1311.24 | 2022-05-21 11:29:04.869 [rank:6] [train], epoch: 1/50, iter: 200/834, loss: 0.69216, top1: 0.08385, throughput: 1311.26 | 2022-05-21 11:29:04.869 [rank:7] [train], epoch: 1/50, iter: 200/834, loss: 0.69124, top1: 0.08661, throughput: 1311.38 | 2022-05-21 11:29:04.869 [rank:3] [train], epoch: 1/50, iter: 200/834, loss: 0.69192, top1: 0.08693, throughput: 1311.13 | 2022-05-21 11:29:04.871 [rank:2] [train], epoch: 1/50, iter: 200/834, loss: 0.68886, top1: 0.08698, throughput: 1311.17 | 2022-05-21 11:29:04.872 [rank:1] [train], epoch: 1/50, iter: 200/834, loss: 0.69045, top1: 0.08792, throughput: 1311.20 | 2022-05-21 11:29:04.871 [rank:5] [train], epoch: 1/50, iter: 200/834, loss: 0.69073, top1: 0.08667, throughput: 1311.14 | 2022-05-21 11:29:04.872 [rank:0] [train], epoch: 1/50, iter: 200/834, loss: 0.69102, top1: 0.08693, throughput: 1310.31 | 2022-05-21 11:29:04.880 [rank:2] [train], epoch: 1/50, iter: 300/834, loss: 0.67730, top1: 0.09995, throughput: 1328.80 | 2022-05-21 11:29:19.321 [rank:7] [train], epoch: 1/50, iter: 300/834, loss: 0.67533, top1: 0.09792, throughput: 1328.54 | 2022-05-21 11:29:19.321 [rank:3] [train], epoch: 1/50, iter: 300/834, loss: 0.67597, top1: 0.10318, throughput: 1328.82 | 2022-05-21 11:29:19.320 [rank:4] [train], epoch: 1/50, iter: 300/834, loss: 0.67598, top1: 0.10057, throughput: 1328.54 | 2022-05-21 11:29:19.321 [rank:1] [train], epoch: 1/50, iter: 300/834, loss: 0.67707, top1: 0.09552, throughput: 1328.27 | 2022-05-21 11:29:19.326 [rank:0] [train], epoch: 1/50, iter: 300/834, loss: 0.67774, top1: 0.09917, throughput: 1329.34 | 2022-05-21 11:29:19.323 [rank:6] [train], epoch: 1/50, iter: 300/834, loss: 0.67880, top1: 0.09792, throughput: 1328.25 | 2022-05-21 11:29:19.324 [rank:5] [train], epoch: 1/50, iter: 300/834, loss: 0.67628, top1: 0.09969, throughput: 1328.58 | 2022-05-21 11:29:19.323 [rank:7] [train], epoch: 1/50, iter: 400/834, loss: 0.66091, top1: 0.11219, throughput: 1304.00 | 2022-05-21 11:29:34.045 [rank:1] [train], epoch: 1/50, iter: 400/834, loss: 0.65896, top1: 0.11516, throughput: 1304.47 | 2022-05-21 11:29:34.045 [rank:3] [train], epoch: 1/50, iter: 400/834, loss: 0.66197, top1: 0.11229, throughput: 1303.88 | 2022-05-21 11:29:34.046 [rank:6] [train], epoch: 1/50, iter: 400/834, loss: 0.66332, top1: 0.10906, throughput: 1304.19 | 2022-05-21 11:29:34.046 [rank:2] [train], epoch: 1/50, iter: 400/834, loss: 0.66123, top1: 0.11010, throughput: 1303.78 | 2022-05-21 11:29:34.047 [rank:4] [train], epoch: 1/50, iter: 400/834, loss: 0.66246, top1: 0.11260, throughput: 1303.78 | 2022-05-21 11:29:34.047 [rank:0] [train], epoch: 1/50, iter: 400/834, loss: 0.66057, top1: 0.11089, throughput: 1303.95 | 2022-05-21 11:29:34.048 [rank:5] [train], epoch: 1/50, iter: 400/834, loss: 0.66033, top1: 0.11349, throughput: 1303.94 | 2022-05-21 11:29:34.048 [rank:3] [train], epoch: 1/50, iter: 500/834, loss: 0.64535, top1: 0.12557, throughput: 1315.96 | 2022-05-21 11:29:48.636 [rank:2] [train], epoch: 1/50, iter: 500/834, loss: 0.64687, top1: 0.12500, throughput: 1316.04 | 2022-05-21 11:29:48.636 [rank:6] [train], epoch: 1/50, iter: 500/834, loss: 0.64703, top1: 0.12422, throughput: 1315.96 | 2022-05-21 11:29:48.636 [rank:1] [train], epoch: 1/50, iter: 500/834, loss: 0.64246, top1: 0.13370, throughput: 1315.60 | 2022-05-21 11:29:48.639 [rank:7] [train], epoch: 1/50, iter: 500/834, loss: 0.64706, top1: 0.12599, throughput: 1315.67 | 2022-05-21 11:29:48.638 [rank:4] [train], epoch: 1/50, iter: 500/834, loss: 0.64420, top1: 0.13104, throughput: 1315.75 | 2022-05-21 11:29:48.640 [rank:0] [train], epoch: 1/50, iter: 500/834, loss: 0.64728, top1: 0.12703, throughput: 1315.90 | 2022-05-21 11:29:48.638 [rank:5] [train], epoch: 1/50, iter: 500/834, loss: 0.64488, top1: 0.12568, throughput: 1315.90 | 2022-05-21 11:29:48.638 [rank:5] [train], epoch: 1/50, iter: 600/834, loss: 0.63137, top1: 0.14104, throughput: 1311.35 | 2022-05-21 11:30:03.280 [rank:6] [train], epoch: 1/50, iter: 600/834, loss: 0.63167, top1: 0.14125, throughput: 1311.15 | 2022-05-21 11:30:03.280 [rank:1] [train], epoch: 1/50, iter: 600/834, loss: 0.63461, top1: 0.13865, throughput: 1311.39 | 2022-05-21 11:30:03.280 [rank:3] [train], epoch: 1/50, iter: 600/834, loss: 0.63358, top1: 0.14125, throughput: 1311.10 | 2022-05-21 11:30:03.280 [rank:7] [train], epoch: 1/50, iter: 600/834, loss: 0.63042, top1: 0.14198, throughput: 1311.31 | 2022-05-21 11:30:03.280 [rank:0] [train], epoch: 1/50, iter: 600/834, loss: 0.63106, top1: 0.14464, throughput: 1311.27 | 2022-05-21 11:30:03.281 [rank:4] [train], epoch: 1/50, iter: 600/834, loss: 0.63224, top1: 0.14177, throughput: 1311.30 | 2022-05-21 11:30:03.282 [rank:2] [train], epoch: 1/50, iter: 600/834, loss: 0.63250, top1: 0.13844, throughput: 1311.06 | 2022-05-21 11:30:03.281 [rank:3] [train], epoch: 1/50, iter: 700/834, loss: 0.61622, top1: 0.15948, throughput: 1310.94 | 2022-05-21 11:30:17.926 [rank:5] [train], epoch: 1/50, iter: 700/834, loss: 0.61853, top1: 0.15995, throughput: 1311.08 | 2022-05-21 11:30:17.924 [rank:4] [train], epoch: 1/50, iter: 700/834, loss: 0.61710, top1: 0.15521, throughput: 1311.26 | 2022-05-21 11:30:17.924 [rank:7] [train], epoch: 1/50, iter: 700/834, loss: 0.61858, top1: 0.15719, throughput: 1311.06 | 2022-05-21 11:30:17.924 [rank:6] [train], epoch: 1/50, iter: 700/834, loss: 0.62075, top1: 0.15422, throughput: 1311.01 | 2022-05-21 11:30:17.925 [rank:1] [train], epoch: 1/50, iter: 700/834, loss: 0.61776, top1: 0.15755, throughput: 1310.91 | 2022-05-21 11:30:17.926 [rank:2] [train], epoch: 1/50, iter: 700/834, loss: 0.61688, top1: 0.16068, throughput: 1310.97 | 2022-05-21 11:30:17.927 [rank:0] [train], epoch: 1/50, iter: 700/834, loss: 0.61548, top1: 0.15880, throughput: 1311.00 | 2022-05-21 11:30:17.926 [rank:7] [train], epoch: 1/50, iter: 800/834, loss: 0.60515, top1: 0.17391, throughput: 1310.36 | 2022-05-21 11:30:32.577 [rank:3] [train], epoch: 1/50, iter: 800/834, loss: 0.60303, top1: 0.17130, throughput: 1310.49 | 2022-05-21 11:30:32.577 [rank:4] [train], epoch: 1/50, iter: 800/834, loss: 0.60645, top1: 0.16885, throughput: 1310.32 | 2022-05-21 11:30:32.577 [rank:1] [train], epoch: 1/50, iter: 800/834, loss: 0.60145, top1: 0.17813, throughput: 1310.42 | 2022-05-21 11:30:32.578 [rank:2] [train], epoch: 1/50, iter: 800/834, loss: 0.60332, top1: 0.17620, throughput: 1310.51 | 2022-05-21 11:30:32.577 [rank:6] [train], epoch: 1/50, iter: 800/834, loss: 0.60273, top1: 0.17578, throughput: 1310.37 | 2022-05-21 11:30:32.577 [rank:0] [train], epoch: 1/50, iter: 800/834, loss: 0.60384, top1: 0.17188, throughput: 1310.37 | 2022-05-21 11:30:32.578 [rank:5] [train], epoch: 1/50, iter: 800/834, loss: 0.60657, top1: 0.16828, throughput: 1309.94 | 2022-05-21 11:30:32.582 [rank:1] [train], epoch: 1/50, iter: 834/834, loss: 0.59519, top1: 0.18919, throughput: 1322.91 | 2022-05-21 11:30:37.512 [rank:2] [train], epoch: 1/50, iter: 834/834, loss: 0.59644, top1: 0.17708, throughput: 1322.71 | 2022-05-21 11:30:37.513 [rank:7] [train], epoch: 1/50, iter: 834/834, loss: 0.59520, top1: 0.17892, throughput: 1322.62 | 2022-05-21 11:30:37.513 [rank:4] [train], epoch: 1/50, iter: 834/834, loss: 0.59430, top1: 0.18551, throughput: 1322.62 | 2022-05-21 11:30:37.513 [rank:5] [train], epoch: 1/50, iter: 834/834, loss: 0.59920, top1: 0.17126, throughput: 1323.88 | 2022-05-21 11:30:37.512 [rank:6] [train], epoch: 1/50, iter: 834/834, loss: 0.59543, top1: 0.17693, throughput: 1322.48 | 2022-05-21 11:30:37.514 [rank:0] [train], epoch: 1/50, iter: 834/834, loss: 0.59310, top1: 0.18827, throughput: 1322.65 | 2022-05-21 11:30:37.514 [rank:3] [train], epoch: 1/50, iter: 834/834, loss: 0.59447, top1: 0.18045, throughput: 1322.08 | 2022-05-21 11:30:37.514 [rank:4] [eval], epoch: 1/50, iter: 125/125, loss: 0.00000, top1: 0.14848, throughput: 544.82 | 2022-05-21 11:30:48.984 [rank:0] [eval], epoch: 1/50, iter: 125/125, loss: 0.00000, top1: 0.16320, throughput: 544.63 | 2022-05-21 11:30:48.989 [rank:7] [eval], epoch: 1/50, iter: 125/125, loss: 0.00000, top1: 0.15312, throughput: 544.30 | 2022-05-21 11:30:48.995 [rank:6] [eval], epoch: 1/50, iter: 125/125, loss: 0.00000, top1: 0.15984, throughput: 541.49 | 2022-05-21 11:30:49.056 [rank:2] [eval], epoch: 1/50, iter: 125/125, loss: 0.00000, top1: 0.15984, throughput: 541.04 | 2022-05-21 11:30:49.064 [rank:1] [eval], epoch: 1/50, iter: 125/125, loss: 0.00000, top1: 0.16672, throughput: 535.98 | 2022-05-21 11:30:49.173 [rank:3] [eval], epoch: 1/50, iter: 125/125, loss: 0.00000, top1: 0.15312, throughput: 534.25 | 2022-05-21 11:30:49.213 [rank:5] [eval], epoch: 1/50, iter: 125/125, loss: 0.00000, top1: 0.14992, throughput: 525.81 | 2022-05-21 11:30:49.399 [rank:4] [train], epoch: 2/50, iter: 100/834, loss: 0.58854, top1: 0.19432, throughput: 1290.55 | 2022-05-21 11:31:03.862 [rank:6] [train], epoch: 2/50, iter: 100/834, loss: 0.58394, top1: 0.20021, throughput: 1296.79 | 2022-05-21 11:31:03.861 [rank:7] [train], epoch: 2/50, iter: 100/834, loss: 0.58544, top1: 0.19604, throughput: 1291.51 | 2022-05-21 11:31:03.862 [rank:3] [train], epoch: 2/50, iter: 100/834, loss: 0.58739, top1: 0.19667, throughput: 1310.60 | 2022-05-21 11:31:03.863 [rank:0] [train], epoch: 2/50, iter: 100/834, loss: 0.58532, top1: 0.19755, throughput: 1290.96 | 2022-05-21 11:31:03.862 [rank:5] [train], epoch: 2/50, iter: 100/834, loss: 0.58745, top1: 0.19354, throughput: 1327.52 | 2022-05-21 11:31:03.862 [rank:2] [train], epoch: 2/50, iter: 100/834, loss: 0.58684, top1: 0.19578, throughput: 1297.34 | 2022-05-21 11:31:03.864 [rank:1] [train], epoch: 2/50, iter: 100/834, loss: 0.58796, top1: 0.19031, throughput: 1307.05 | 2022-05-21 11:31:03.863 [rank:6] [train], epoch: 2/50, iter: 200/834, loss: 0.57454, top1: 0.20818, throughput: 1329.61 | 2022-05-21 11:31:18.302 [rank:5] [train], epoch: 2/50, iter: 200/834, loss: 0.57495, top1: 0.21151, throughput: 1329.68 | 2022-05-21 11:31:18.302 [rank:4] [train], epoch: 2/50, iter: 200/834, loss: 0.57807, top1: 0.20724, throughput: 1329.65 | 2022-05-21 11:31:18.302 [rank:0] [train], epoch: 2/50, iter: 200/834, loss: 0.57875, top1: 0.20531, throughput: 1329.59 | 2022-05-21 11:31:18.303 [rank:3] [train], epoch: 2/50, iter: 200/834, loss: 0.57605, top1: 0.20969, throughput: 1329.59 | 2022-05-21 11:31:18.303 [rank:2] [train], epoch: 2/50, iter: 200/834, loss: 0.57955, top1: 0.20453, throughput: 1329.51 | 2022-05-21 11:31:18.305 [rank:7] [train], epoch: 2/50, iter: 200/834, loss: 0.57842, top1: 0.20604, throughput: 1329.43 | 2022-05-21 11:31:18.304 [rank:1] [train], epoch: 2/50, iter: 200/834, loss: 0.57603, top1: 0.20938, throughput: 1329.51 | 2022-05-21 11:31:18.304 [rank:5] [train], epoch: 2/50, iter: 300/834, loss: 0.56677, top1: 0.22198, throughput: 1299.65 | 2022-05-21 11:31:33.075 [rank:4] [train], epoch: 2/50, iter: 300/834, loss: 0.56505, top1: 0.22229, throughput: 1299.53 | 2022-05-21 11:31:33.076 [rank:6] [train], epoch: 2/50, iter: 300/834, loss: 0.56619, top1: 0.21620, throughput: 1299.51 | 2022-05-21 11:31:33.077 [rank:7] [train], epoch: 2/50, iter: 300/834, loss: 0.56697, top1: 0.22109, throughput: 1299.67 | 2022-05-21 11:31:33.077 [rank:3] [train], epoch: 2/50, iter: 300/834, loss: 0.56509, top1: 0.21729, throughput: 1299.58 | 2022-05-21 11:31:33.077 [rank:2] [train], epoch: 2/50, iter: 300/834, loss: 0.56478, top1: 0.22365, throughput: 1299.71 | 2022-05-21 11:31:33.078 [rank:1] [train], epoch: 2/50, iter: 300/834, loss: 0.56364, top1: 0.22115, throughput: 1299.55 | 2022-05-21 11:31:33.079 [rank:0] [train], epoch: 2/50, iter: 300/834, loss: 0.56327, top1: 0.22286, throughput: 1299.33 | 2022-05-21 11:31:33.079 [rank:7] [train], epoch: 2/50, iter: 400/834, loss: 0.55748, top1: 0.22797, throughput: 1312.63 | 2022-05-21 11:31:47.704 [rank:2] [train], epoch: 2/50, iter: 400/834, loss: 0.55555, top1: 0.22906, throughput: 1312.52 | 2022-05-21 11:31:47.706 [rank:0] [train], epoch: 2/50, iter: 400/834, loss: 0.55586, top1: 0.23745, throughput: 1312.80 | 2022-05-21 11:31:47.705 [rank:6] [train], epoch: 2/50, iter: 400/834, loss: 0.55481, top1: 0.23500, throughput: 1312.57 | 2022-05-21 11:31:47.704 [rank:4] [train], epoch: 2/50, iter: 400/834, loss: 0.55517, top1: 0.23464, throughput: 1312.27 | 2022-05-21 11:31:47.707 [rank:5] [train], epoch: 2/50, iter: 400/834, loss: 0.55668, top1: 0.23036, throughput: 1312.44 | 2022-05-21 11:31:47.704 [rank:1] [train], epoch: 2/50, iter: 400/834, loss: 0.55629, top1: 0.23323, throughput: 1312.61 | 2022-05-21 11:31:47.706 [rank:3] [train], epoch: 2/50, iter: 400/834, loss: 0.55490, top1: 0.23755, throughput: 1312.45 | 2022-05-21 11:31:47.706 [rank:1] [train], epoch: 2/50, iter: 500/834, loss: 0.54636, top1: 0.24385, throughput: 1321.43 | 2022-05-21 11:32:02.236 [rank:0] [train], epoch: 2/50, iter: 500/834, loss: 0.54814, top1: 0.24547, throughput: 1321.16 | 2022-05-21 11:32:02.237 [rank:5] [train], epoch: 2/50, iter: 500/834, loss: 0.54581, top1: 0.24604, throughput: 1321.20 | 2022-05-21 11:32:02.236 [rank:7] [train], epoch: 2/50, iter: 500/834, loss: 0.54804, top1: 0.24260, throughput: 1321.24 | 2022-05-21 11:32:02.236 [rank:4] [train], epoch: 2/50, iter: 500/834, loss: 0.54714, top1: 0.24495, throughput: 1321.47 | 2022-05-21 11:32:02.237 [rank:6] [train], epoch: 2/50, iter: 500/834, loss: 0.54525, top1: 0.24391, throughput: 1321.04 | 2022-05-21 11:32:02.238 [rank:3] [train], epoch: 2/50, iter: 500/834, loss: 0.54769, top1: 0.24484, throughput: 1321.21 | 2022-05-21 11:32:02.239 [rank:2] [train], epoch: 2/50, iter: 500/834, loss: 0.54896, top1: 0.23891, throughput: 1321.19 | 2022-05-21 11:32:02.238 [rank:6] [train], epoch: 2/50, iter: 600/834, loss: 0.54132, top1: 0.25786, throughput: 1326.80 | 2022-05-21 11:32:16.709 [rank:4] [train], epoch: 2/50, iter: 600/834, loss: 0.53710, top1: 0.25719, throughput: 1326.67 | 2022-05-21 11:32:16.709 [rank:0] [train], epoch: 2/50, iter: 600/834, loss: 0.53946, top1: 0.25849, throughput: 1326.37 | 2022-05-21 11:32:16.713 [rank:1] [train], epoch: 2/50, iter: 600/834, loss: 0.53952, top1: 0.25484, throughput: 1326.44 | 2022-05-21 11:32:16.711 [rank:3] [train], epoch: 2/50, iter: 600/834, loss: 0.53425, top1: 0.26281, throughput: 1326.78 | 2022-05-21 11:32:16.710 [rank:5] [train], epoch: 2/50, iter: 600/834, loss: 0.53483, top1: 0.26266, throughput: 1326.62 | 2022-05-21 11:32:16.709 [rank:2] [train], epoch: 2/50, iter: 600/834, loss: 0.53859, top1: 0.25505, throughput: 1326.64 | 2022-05-21 11:32:16.711 [rank:7] [train], epoch: 2/50, iter: 600/834, loss: 0.53904, top1: 0.25563, throughput: 1325.87 | 2022-05-21 11:32:16.717 [rank:7] [train], epoch: 2/50, iter: 700/834, loss: 0.52996, top1: 0.26448, throughput: 1307.41 | 2022-05-21 11:32:31.402 [rank:4] [train], epoch: 2/50, iter: 700/834, loss: 0.52959, top1: 0.27120, throughput: 1306.68 | 2022-05-21 11:32:31.403 [rank:0] [train], epoch: 2/50, iter: 700/834, loss: 0.53115, top1: 0.26714, throughput: 1306.99 | 2022-05-21 11:32:31.403 [rank:5] [train], epoch: 2/50, iter: 700/834, loss: 0.52794, top1: 0.27187, throughput: 1306.76[rank:6] [train], epoch: 2/50, iter: 700/834, loss: 0.52911, top1: 0.27083, throughput: 1306.69 | 2022-05-21 11:32:31.402| 2022-05-21 11:32:31.403 [rank:1] [train], epoch: 2/50, iter: 700/834, loss: 0.52763, top1: 0.27182, throughput: 1306.41 | 2022-05-21 11:32:31.408 [rank:3] [train], epoch: 2/50, iter: 700/834, loss: 0.52671, top1: 0.27531, throughput: 1306.50 | 2022-05-21 11:32:31.405 [rank:2] [train], epoch: 2/50, iter: 700/834, loss: 0.52839, top1: 0.26990, throughput: 1306.70 | 2022-05-21 11:32:31.405 [rank:3] [train], epoch: 2/50, iter: 800/834, loss: 0.52107, top1: 0.28313, throughput: 1309.71 | 2022-05-21 11:32:46.065 [rank:4] [train], epoch: 2/50, iter: 800/834, loss: 0.51890, top1: 0.28297, throughput: 1309.44 | 2022-05-21 11:32:46.065 [rank:0] [train], epoch: 2/50, iter: 800/834, loss: 0.52016, top1: 0.29068, throughput: 1309.56 | 2022-05-21 11:32:46.065 [rank:7] [train], epoch: 2/50, iter: 800/834, loss: 0.52297, top1: 0.28005, throughput: 1309.48 | 2022-05-21 11:32:46.065 [rank:2] [train], epoch: 2/50, iter: 800/834, loss: 0.52213, top1: 0.28651, throughput: 1309.69[rank:1] [train], epoch: 2/50, iter: 800/834, loss: 0.51898, top1: 0.28401, throughput: 1309.95 | 2022-05-21 11:32:46.065 | 2022-05-21 11:32:46.065 [rank:6] [train], epoch: 2/50, iter: 800/834, loss: 0.52137, top1: 0.28047, throughput: 1309.44 | 2022-05-21 11:32:46.066 [rank:5] [train], epoch: 2/50, iter: 800/834, loss: 0.52148, top1: 0.28104, throughput: 1309.38 | 2022-05-21 11:32:46.065 [rank:6] [train], epoch: 2/50, iter: 834/834, loss: 0.51875, top1: 0.28600, throughput: 1323.85 | 2022-05-21 11:32:50.997 [rank:4] [train], epoch: 2/50, iter: 834/834, loss: 0.51574, top1: 0.28952, throughput: 1323.72 | 2022-05-21 11:32:50.997 [rank:0] [train], epoch: 2/50, iter: 834/834, loss: 0.51702, top1: 0.29228, throughput: 1323.39 | 2022-05-21 11:32:50.997 [rank:1] [train], epoch: 2/50, iter: 834/834, loss: 0.51940, top1: 0.28554, throughput: 1323.50 | 2022-05-21 11:32:50.997 [rank:5] [train], epoch: 2/50, iter: 834/834, loss: 0.52260, top1: 0.27589, throughput: 1323.51 | 2022-05-21 11:32:50.998 [rank:7] [train], epoch: 2/50, iter: 834/834, loss: 0.51573, top1: 0.29029, throughput: 1322.95 | 2022-05-21 11:32:50.999 [rank:3] [train], epoch: 2/50, iter: 834/834, loss: 0.52157, top1: 0.28523, throughput: 1323.07 | 2022-05-21 11:32:50.999 [rank:2] [train], epoch: 2/50, iter: 834/834, loss: 0.52024, top1: 0.27788, throughput: 1322.87 | 2022-05-21 11:32:50.999 [rank:7] [eval], epoch: 2/50, iter: 125/125, loss: 0.00000, top1: 0.26896, throughput: 555.49 | 2022-05-21 11:33:02.250 [rank:0] [eval], epoch: 2/50, iter: 125/125, loss: 0.00000, top1: 0.26912, throughput: 554.35 | 2022-05-21 11:33:02.272 [rank:2] [eval], epoch: 2/50, iter: 125/125, loss: 0.00000, top1: 0.27872, throughput: 551.33 | 2022-05-21 11:33:02.335 [rank:6] [eval], epoch: 2/50, iter: 125/125, loss: 0.00000, top1: 0.26736, throughput: 550.91 | 2022-05-21 11:33:02.341 [rank:4] [eval], epoch: 2/50, iter: 125/125, loss: 0.00000, top1: 0.26176, throughput: 548.50 | 2022-05-21 11:33:02.392 [rank:1] [eval], epoch: 2/50, iter: 125/125, loss: 0.00000, top1: 0.27184, throughput: 548.34 | 2022-05-21 11:33:02.395 [rank:3] [eval], epoch: 2/50, iter: 125/125, loss: 0.00000, top1: 0.25200, throughput: 548.43 | 2022-05-21 11:33:02.395 [rank:5] [eval], epoch: 2/50, iter: 125/125, loss: 0.00000, top1: 0.26640, throughput: 538.27 | 2022-05-21 11:33:02.609 [rank:1] [train], epoch: 3/50, iter: 100/834, loss: 0.50720, top1: 0.29672, throughput: 1308.34 | 2022-05-21 11:33:17.070 [rank:0] [train], epoch: 3/50, iter: 100/834, loss: 0.50785, top1: 0.29625, throughput: 1297.40 | 2022-05-21 11:33:17.071 [rank:7] [train], epoch: 3/50, iter: 100/834, loss: 0.50938, top1: 0.29667, throughput: 1295.46 | 2022-05-21 11:33:17.071 [rank:5] [train], epoch: 3/50, iter: 100/834, loss: 0.50889, top1: 0.30432, throughput: 1327.64 | 2022-05-21 11:33:17.071 [rank:4] [train], epoch: 3/50, iter: 100/834, loss: 0.51118, top1: 0.29865, throughput: 1307.89 | 2022-05-21 11:33:17.072 [rank:6] [train], epoch: 3/50, iter: 100/834, loss: 0.50935, top1: 0.30307, throughput: 1303.38 | 2022-05-21 11:33:17.072 [rank:2] [train], epoch: 3/50, iter: 100/834, loss: 0.51037, top1: 0.29552, throughput: 1302.89 | 2022-05-21 11:33:17.072 [rank:3] [train], epoch: 3/50, iter: 100/834, loss: 0.50595, top1: 0.30500, throughput: 1308.17 | 2022-05-21 11:33:17.072 [rank:4] [train], epoch: 3/50, iter: 200/834, loss: 0.50255, top1: 0.30615, throughput: 1322.70 | 2022-05-21 11:33:31.588 [rank:1] [train], epoch: 3/50, iter: 200/834, loss: 0.49999, top1: 0.31552, throughput: 1322.10 | 2022-05-21 11:33:31.592 [rank:3] [train], epoch: 3/50, iter: 200/834, loss: 0.49958, top1: 0.31068, throughput: 1322.44 | 2022-05-21 11:33:31.591 [rank:5] [train], epoch: 3/50, iter: 200/834, loss: 0.50101, top1: 0.31281, throughput: 1322.52 | 2022-05-21 11:33:31.588 [rank:2] [train], epoch: 3/50, iter: 200/834, loss: 0.50302, top1: 0.30964, throughput: 1322.69 | 2022-05-21 11:33:31.588 [rank:6] [train], epoch: 3/50, iter: 200/834, loss: 0.50009, top1: 0.31453, throughput: 1322.27 | 2022-05-21 11:33:31.593 [rank:0] [train], epoch: 3/50, iter: 200/834, loss: 0.49925, top1: 0.31719, throughput: 1322.13 | 2022-05-21 11:33:31.593 [rank:7] [train], epoch: 3/50, iter: 200/834, loss: 0.50089, top1: 0.31323, throughput: 1322.16 | 2022-05-21 11:33:31.593 [rank:5] [train], epoch: 3/50, iter: 300/834, loss: 0.49692, top1: 0.31979, throughput: 1320.62 | 2022-05-21 11:33:46.127 [rank:7] [train], epoch: 3/50, iter: 300/834, loss: 0.49749, top1: 0.31854, throughput: 1321.06 | 2022-05-21 11:33:46.127 [rank:4] [train], epoch: 3/50, iter: 300/834, loss: 0.49548, top1: 0.32182, throughput: 1320.54 | 2022-05-21 11:33:46.127 [rank:3] [train], epoch: 3/50, iter: 300/834, loss: 0.49616, top1: 0.32130, throughput: 1320.83 | 2022-05-21 11:33:46.127 [rank:0] [train], epoch: 3/50, iter: 300/834, loss: 0.49590, top1: 0.31984, throughput: 1320.88 | 2022-05-21 11:33:46.128 [rank:6] [train], epoch: 3/50, iter: 300/834, loss: 0.49650, top1: 0.31708, throughput: 1320.94 | 2022-05-21 11:33:46.128 [rank:1] [train], epoch: 3/50, iter: 300/834, loss: 0.49459, top1: 0.32036, throughput: 1320.82 | 2022-05-21 11:33:46.129 [rank:2] [train], epoch: 3/50, iter: 300/834, loss: 0.49383, top1: 0.32318, throughput: 1320.41 | 2022-05-21 11:33:46.129 [rank:4] [train], epoch: 3/50, iter: 400/834, loss: 0.49114, top1: 0.32656, throughput: 1311.34 | 2022-05-21 11:34:00.769 [rank:5] [train], epoch: 3/50, iter: 400/834, loss: 0.48939, top1: 0.33198, throughput: 1311.53 | 2022-05-21 11:34:00.766 [rank:6] [train], epoch: 3/50, iter: 400/834, loss: 0.49024, top1: 0.33021, throughput: 1311.39 | 2022-05-21 11:34:00.769 [rank:7] [train], epoch: 3/50, iter: 400/834, loss: 0.48721, top1: 0.33547, throughput: 1311.34 | 2022-05-21 11:34:00.768 [rank:2] [train], epoch: 3/50, iter: 400/834, loss: 0.49309, top1: 0.32536, throughput: 1311.63 | 2022-05-21 11:34:00.767 [rank:0] [train], epoch: 3/50, iter: 400/834, loss: 0.48737, top1: 0.33286, throughput: 1311.42 | 2022-05-21 11:34:00.769 [rank:1] [train], epoch: 3/50, iter: 400/834, loss: 0.48943, top1: 0.33005, throughput: 1311.46 | 2022-05-21 11:34:00.769 [rank:3] [train], epoch: 3/50, iter: 400/834, loss: 0.48993, top1: 0.32693, throughput: 1311.26 | 2022-05-21 11:34:00.769 [rank:7] [train], epoch: 3/50, iter: 500/834, loss: 0.48508, top1: 0.33573, throughput: 1319.82 | 2022-05-21 11:34:15.316 [rank:1] [train], epoch: 3/50, iter: 500/834, loss: 0.48616, top1: 0.33698, throughput: 1320.21 | 2022-05-21 11:34:15.312 [rank:4] [train], epoch: 3/50, iter: 500/834, loss: 0.48622, top1: 0.33271, throughput: 1320.15 | 2022-05-21 11:34:15.313 [rank:5] [train], epoch: 3/50, iter: 500/834, loss: 0.48279, top1: 0.33891, throughput: 1319.93 | 2022-05-21 11:34:15.313 [rank:0] [train], epoch: 3/50, iter: 500/834, loss: 0.48614, top1: 0.33510, throughput: 1319.99 | 2022-05-21 11:34:15.315 [rank:3] [train], epoch: 3/50, iter: 500/834, loss: 0.48659, top1: 0.33625, throughput: 1320.07 | 2022-05-21 11:34:15.314 [rank:6] [train], epoch: 3/50, iter: 500/834, loss: 0.48634, top1: 0.33583, throughput: 1319.61 | 2022-05-21 11:34:15.319 [rank:2] [train], epoch: 3/50, iter: 500/834, loss: 0.48452, top1: 0.33552, throughput: 1319.76 | 2022-05-21 11:34:15.315 [rank:4] [train], epoch: 3/50, iter: 600/834, loss: 0.47965, top1: 0.34187, throughput: 1310.51 | 2022-05-21 11:34:29.963 [rank:2] [train], epoch: 3/50, iter: 600/834, loss: 0.47930, top1: 0.34354, throughput: 1310.55 | 2022-05-21 11:34:29.966 [rank:6] [train], epoch: 3/50, iter: 600/834, loss: 0.47875, top1: 0.34719, throughput: 1310.79 | 2022-05-21 11:34:29.966 [rank:7] [train], epoch: 3/50, iter: 600/834, loss: 0.47671, top1: 0.34781, throughput: 1310.68 | 2022-05-21 11:34:29.965 [rank:5] [train], epoch: 3/50, iter: 600/834, loss: 0.48062, top1: 0.34609, throughput: 1310.49 | 2022-05-21 11:34:29.964 [rank:1] [train], epoch: 3/50, iter: 600/834, loss: 0.47934, top1: 0.34563, throughput: 1310.41 | 2022-05-21 11:34:29.964 [rank:0] [train], epoch: 3/50, iter: 600/834, loss: 0.47883, top1: 0.35000, throughput: 1310.49 | 2022-05-21 11:34:29.966 [rank:3] [train], epoch: 3/50, iter: 600/834, loss: 0.48325, top1: 0.33943, throughput: 1310.44 | 2022-05-21 11:34:29.966 [rank:4] [train], epoch: 3/50, iter: 700/834, loss: 0.47723, top1: 0.34943, throughput: 1306.95 | 2022-05-21 11:34:44.654 [rank:5] [train], epoch: 3/50, iter: 700/834, loss: 0.47532, top1: 0.35224, throughput: 1306.97 | 2022-05-21 11:34:44.654 [rank:3] [train], epoch: 3/50, iter: 700/834, loss: 0.47279, top1: 0.35516, throughput: 1307.16 | 2022-05-21 11:34:44.654 [rank:6] [train], epoch: 3/50, iter: 700/834, loss: 0.47609, top1: 0.34755, throughput: 1306.97 | 2022-05-21 11:34:44.657 [rank:7] [train], epoch: 3/50, iter: 700/834, loss: 0.47462, top1: 0.35141, throughput: 1306.91 | 2022-05-21 11:34:44.656 [rank:0] [train], epoch: 3/50, iter: 700/834, loss: 0.47455, top1: 0.35016, throughput: 1306.96 | 2022-05-21 11:34:44.656 [rank:1] [train], epoch: 3/50, iter: 700/834, loss: 0.47384, top1: 0.35083, throughput: 1306.86 | 2022-05-21 11:34:44.656 [rank:2] [train], epoch: 3/50, iter: 700/834, loss: 0.47383, top1: 0.34990, throughput: 1306.91 | 2022-05-21 11:34:44.657 [rank:2] [train], epoch: 3/50, iter: 800/834, loss: 0.47350, top1: 0.35307, throughput: 1326.72 | 2022-05-21 11:34:59.128 [rank:7] [train], epoch: 3/50, iter: 800/834, loss: 0.47331, top1: 0.35510, throughput: 1326.72 | 2022-05-21 11:34:59.127 [rank:3] [train], epoch: 3/50, iter: 800/834, loss: 0.47060, top1: 0.35917, throughput: 1326.53 | 2022-05-21 11:34:59.128 [rank:1] [train], epoch: 3/50, iter: 800/834, loss: 0.46942, top1: 0.36286, throughput: 1326.71 | 2022-05-21 11:34:59.128 [rank:0] [train], epoch: 3/50, iter: 800/834, loss: 0.47104, top1: 0.35995, throughput: 1326.38 | 2022-05-21 11:34:59.132 [rank:4] [train], epoch: 3/50, iter: 800/834, loss: 0.46958, top1: 0.35823, throughput: 1326.33 | 2022-05-21 11:34:59.130 [rank:6] [train], epoch: 3/50, iter: 800/834, loss: 0.47034, top1: 0.35208, throughput: 1326.57 | 2022-05-21 11:34:59.130 [rank:5] [train], epoch: 3/50, iter: 800/834, loss: 0.46971, top1: 0.36042, throughput: 1326.38 | 2022-05-21 11:34:59.130 [rank:0] [train], epoch: 3/50, iter: 834/834, loss: 0.46575, top1: 0.36320, throughput: 1325.18 | 2022-05-21 11:35:04.058 [rank:4] [train], epoch: 3/50, iter: 834/834, loss: 0.46735, top1: 0.36320, throughput: 1324.83 | 2022-05-21 11:35:04.058 [rank:7] [train], epoch: 3/50, iter: 834/834, loss: 0.46806, top1: 0.35371, throughput: 1323.97 | 2022-05-21 11:35:04.058 [rank:3] [train], epoch: 3/50, iter: 834/834, loss: 0.46856, top1: 0.35263, throughput: 1323.90 | 2022-05-21 11:35:04.059 [rank:6] [train], epoch: 3/50, iter: 834/834, loss: 0.46684, top1: 0.36872, throughput: 1324.36 | 2022-05-21 11:35:04.059 [rank:2] [train], epoch: 3/50, iter: 834/834, loss: 0.46830, top1: 0.36642, throughput: 1323.67 | 2022-05-21 11:35:04.060 [rank:1] [train], epoch: 3/50, iter: 834/834, loss: 0.46924, top1: 0.35938, throughput: 1323.57[rank:5] [train], epoch: 3/50, iter: 834/834, loss: 0.46596, top1: 0.36520, throughput: 1323.92 | 2022-05-21 11:35:04.060 | 2022-05-21 11:35:04.060 [rank:0] [eval], epoch: 3/50, iter: 125/125, loss: 0.00000, top1: 0.33920, throughput: 568.04 | 2022-05-21 11:35:15.061 [rank:7] [eval], epoch: 3/50, iter: 125/125, loss: 0.00000, top1: 0.33280, throughput: 566.87 | 2022-05-21 11:35:15.084 [rank:2] [eval], epoch: 3/50, iter: 125/125, loss: 0.00000, top1: 0.33344, throughput: 562.33 | 2022-05-21 11:35:15.175 [rank:6] [eval], epoch: 3/50, iter: 125/125, loss: 0.00000, top1: 0.33488, throughput: 559.78 | 2022-05-21 11:35:15.225 [rank:3] [eval], epoch: 3/50, iter: 125/125, loss: 0.00000, top1: 0.33040, throughput: 556.25 | 2022-05-21 11:35:15.295 [rank:4] [eval], epoch: 3/50, iter: 125/125, loss: 0.00000, top1: 0.33072, throughput: 555.38 | 2022-05-21 11:35:15.311 [rank:5] [eval], epoch: 3/50, iter: 125/125, loss: 0.00000, top1: 0.33344, throughput: 554.94 | 2022-05-21 11:35:15.323 [rank:1] [eval], epoch: 3/50, iter: 125/125, loss: 0.00000, top1: 0.33984, throughput: 554.48 | 2022-05-21 11:35:15.332 [rank:4] [train], epoch: 4/50, iter: 100/834, loss: 0.46179, top1: 0.37271, throughput: 1310.59 | 2022-05-21 11:35:29.961[rank:6] [train], epoch: 4/50, iter: 100/834, loss: 0.46467, top1: 0.36734, throughput: 1302.93 | 2022-05-21 11:35:29.961 [rank:3] [train], epoch: 4/50, iter: 100/834, loss: 0.45773, top1: 0.37531, throughput: 1309.13 | 2022-05-21 11:35:29.961 [rank:7] [train], epoch: 4/50, iter: 100/834, loss: 0.46050, top1: 0.37203, throughput: 1290.56 | 2022-05-21 11:35:29.961 [rank:5] [train], epoch: 4/50, iter: 100/834, loss: 0.46023, top1: 0.37682, throughput: 1311.52 | 2022-05-21 11:35:29.962 [rank:1] [train], epoch: 4/50, iter: 100/834, loss: 0.45942, top1: 0.37703, throughput: 1312.31 | 2022-05-21 11:35:29.962 [rank:0] [train], epoch: 4/50, iter: 100/834, loss: 0.45879, top1: 0.37776, throughput: 1288.01 | 2022-05-21 11:35:29.967 [rank:2] [train], epoch: 4/50, iter: 100/834, loss: 0.45489, top1: 0.38375, throughput: 1298.01 | 2022-05-21 11:35:29.966 [rank:0] [train], epoch: 4/50, iter: 200/834, loss: 0.45632, top1: 0.38339, throughput: 1318.42 | 2022-05-21 11:35:44.530 [rank:2] [train], epoch: 4/50, iter: 200/834, loss: 0.45743, top1: 0.37755, throughput: 1318.51 | 2022-05-21 11:35:44.528 [rank:1] [train], epoch: 4/50, iter: 200/834, loss: 0.45792, top1: 0.37964, throughput: 1318.03 | 2022-05-21 11:35:44.530 [rank:6] [train], epoch: 4/50, iter: 200/834, loss: 0.46036, top1: 0.37260, throughput: 1317.78 | 2022-05-21 11:35:44.531 [rank:3] [train], epoch: 4/50, iter: 200/834, loss: 0.45832, top1: 0.37859, throughput: 1317.89 | 2022-05-21 11:35:44.530 [rank:5] [train], epoch: 4/50, iter: 200/834, loss: 0.45833, top1: 0.37661, throughput: 1317.96 | 2022-05-21 11:35:44.530 [rank:4] [train], epoch: 4/50, iter: 200/834, loss: 0.45790, top1: 0.37703, throughput: 1317.36 | 2022-05-21 11:35:44.536 [rank:7] [train], epoch: 4/50, iter: 200/834, loss: 0.45897, top1: 0.37797, throughput: 1317.60 | 2022-05-21 11:35:44.533 [rank:7] [train], epoch: 4/50, iter: 300/834, loss: 0.45533, top1: 0.38620, throughput: 1316.07 | 2022-05-21 11:35:59.122 [rank:4] [train], epoch: 4/50, iter: 300/834, loss: 0.45366, top1: 0.38708, throughput: 1316.34 | 2022-05-21 11:35:59.122 [rank:6] [train], epoch: 4/50, iter: 300/834, loss: 0.45367, top1: 0.38401, throughput: 1315.80 | 2022-05-21 11:35:59.122 [rank:1] [train], epoch: 4/50, iter: 300/834, loss: 0.45160, top1: 0.38885, throughput: 1315.73 | 2022-05-21 11:35:59.122 [rank:0] [train], epoch: 4/50, iter: 300/834, loss: 0.45258, top1: 0.39036, throughput: 1315.77 | 2022-05-21 11:35:59.122 [rank:3] [train], epoch: 4/50, iter: 300/834, loss: 0.45781, top1: 0.38161, throughput: 1315.54[rank:5] [train], epoch: 4/50, iter: 300/834, loss: 0.45369, top1: 0.38161, throughput: 1315.66 | 2022-05-21 11:35:59.124 | 2022-05-21 11:35:59.124 [rank:2] [train], epoch: 4/50, iter: 300/834, loss: 0.45430, top1: 0.38703, throughput: 1315.40 | 2022-05-21 11:35:59.125 [rank:6] [train], epoch: 4/50, iter: 400/834, loss: 0.45034, top1: 0.39286, throughput: 1300.33 | 2022-05-21 11:36:13.888 [rank:7] [train], epoch: 4/50, iter: 400/834, loss: 0.44793, top1: 0.39203, throughput: 1300.39 | 2022-05-21 11:36:13.886 [rank:3] [train], epoch: 4/50, iter: 400/834, loss: 0.45048, top1: 0.39219, throughput: 1300.55 | 2022-05-21 11:36:13.887 [rank:2] [train], epoch: 4/50, iter: 400/834, loss: 0.45051, top1: 0.38417, throughput: 1300.65 | 2022-05-21 11:36:13.887 [rank:1] [train], epoch: 4/50, iter: 400/834, loss: 0.45258, top1: 0.38693, throughput: 1300.34 | 2022-05-21 11:36:13.888 [rank:0] [train], epoch: 4/50, iter: 400/834, loss: 0.45330, top1: 0.38526, throughput: 1300.27 | 2022-05-21 11:36:13.888 [rank:5] [train], epoch: 4/50, iter: 400/834, loss: 0.44944, top1: 0.39427, throughput: 1300.38 | 2022-05-21 11:36:13.889 [rank:4] [train], epoch: 4/50, iter: 400/834, loss: 0.44845, top1: 0.39031, throughput: 1299.99 | 2022-05-21 11:36:13.891 [rank:6] [train], epoch: 4/50, iter: 500/834, loss: 0.45012, top1: 0.39057, throughput: 1324.46 | 2022-05-21 11:36:28.384[rank:5] [train], epoch: 4/50, iter: 500/834, loss: 0.44929, top1: 0.39229, throughput: 1324.51 | 2022-05-21 11:36:28.385 [rank:0] [train], epoch: 4/50, iter: 500/834, loss: 0.44907, top1: 0.39318, throughput: 1324.47 | 2022-05-21 11:36:28.385 [rank:3] [train], epoch: 4/50, iter: 500/834, loss: 0.44670, top1: 0.39510, throughput: 1324.37 | 2022-05-21 11:36:28.385 [rank:7] [train], epoch: 4/50, iter: 500/834, loss: 0.44813, top1: 0.39458, throughput: 1324.03 | 2022-05-21 11:36:28.388 [rank:1] [train], epoch: 4/50, iter: 500/834, loss: 0.44663, top1: 0.39354, throughput: 1324.11 | 2022-05-21 11:36:28.388 [rank:4] [train], epoch: 4/50, iter: 500/834, loss: 0.44996, top1: 0.38813, throughput: 1324.40 | 2022-05-21 11:36:28.388 [rank:2] [train], epoch: 4/50, iter: 500/834, loss: 0.44823, top1: 0.39417, throughput: 1324.01 | 2022-05-21 11:36:28.388 [rank:4] [train], epoch: 4/50, iter: 600/834, loss: 0.44641, top1: 0.39422, throughput: 1328.07 | 2022-05-21 11:36:42.845 [rank:2] [train], epoch: 4/50, iter: 600/834, loss: 0.44671, top1: 0.39526, throughput: 1327.79 | 2022-05-21 11:36:42.848 [rank:3] [train], epoch: 4/50, iter: 600/834, loss: 0.44539, top1: 0.39635, throughput: 1327.64 | 2022-05-21 11:36:42.846 [rank:5] [train], epoch: 4/50, iter: 600/834, loss: 0.44665, top1: 0.39484, throughput: 1327.70 | 2022-05-21 11:36:42.846 [rank:1] [train], epoch: 4/50, iter: 600/834, loss: 0.44458, top1: 0.40000, throughput: 1327.98 | 2022-05-21 11:36:42.846 [rank:7] [train], epoch: 4/50, iter: 600/834, loss: 0.44687, top1: 0.39630, throughput: 1327.80 | 2022-05-21 11:36:42.848 [rank:0] [train], epoch: 4/50, iter: 600/834, loss: 0.44714, top1: 0.39599, throughput: 1327.15 | 2022-05-21 11:36:42.852 [rank:6] [train], epoch: 4/50, iter: 600/834, loss: 0.44508, top1: 0.39755, throughput: 1327.14 | 2022-05-21 11:36:42.852 [rank:3] [train], epoch: 4/50, iter: 700/834, loss: 0.44190, top1: 0.40552, throughput: 1316.98 | 2022-05-21 11:36:57.425 [rank:6] [train], epoch: 4/50, iter: 700/834, loss: 0.44381, top1: 0.39927, throughput: 1317.57 | 2022-05-21 11:36:57.424 [rank:0] [train], epoch: 4/50, iter: 700/834, loss: 0.43925, top1: 0.41146, throughput: 1317.55 | 2022-05-21 11:36:57.424 [rank:2] [train], epoch: 4/50, iter: 700/834, loss: 0.44090, top1: 0.40667, throughput: 1317.27 | 2022-05-21 11:36:57.424 [rank:1] [train], epoch: 4/50, iter: 700/834, loss: 0.44261, top1: 0.40167, throughput: 1316.97 | 2022-05-21 11:36:57.425 [rank:7] [train], epoch: 4/50, iter: 700/834, loss: 0.44431, top1: 0.39927, throughput: 1317.08 | 2022-05-21 11:36:57.425 [rank:5] [train], epoch: 4/50, iter: 700/834, loss: 0.44212, top1: 0.40224, throughput: 1316.86 | 2022-05-21 11:36:57.426 [rank:4] [train], epoch: 4/50, iter: 700/834, loss: 0.44058, top1: 0.40552, throughput: 1316.71 | 2022-05-21 11:36:57.427 [rank:7] [train], epoch: 4/50, iter: 800/834, loss: 0.44062, top1: 0.40865, throughput: 1327.18 | 2022-05-21 11:37:11.892 [rank:6] [train], epoch: 4/50, iter: 800/834, loss: 0.43851, top1: 0.41062, throughput: 1326.69 | 2022-05-21 11:37:11.896 [rank:0] [train], epoch: 4/50, iter: 800/834, loss: 0.44057, top1: 0.40542, throughput: 1326.79 | 2022-05-21 11:37:11.895 [rank:3] [train], epoch: 4/50, iter: 800/834, loss: 0.44059, top1: 0.40146, throughput: 1326.96 | 2022-05-21 11:37:11.894 [rank:2] [train], epoch: 4/50, iter: 800/834, loss: 0.43878, top1: 0.40557, throughput: 1326.46 | 2022-05-21 11:37:11.898 [rank:4] [train], epoch: 4/50, iter: 800/834, loss: 0.43900, top1: 0.40828, throughput: 1326.55 | 2022-05-21 11:37:11.901 [rank:5] [train], epoch: 4/50, iter: 800/834, loss: 0.43614, top1: 0.41109, throughput: 1326.78 | 2022-05-21 11:37:11.897 [rank:1] [train], epoch: 4/50, iter: 800/834, loss: 0.43854, top1: 0.40797, throughput: 1326.59 | 2022-05-21 11:37:11.898 [rank:4] [train], epoch: 4/50, iter: 834/834, loss: 0.43840, top1: 0.40564, throughput: 1327.74 | 2022-05-21 11:37:16.817 [rank:6] [train], epoch: 4/50, iter: 834/834, loss: 0.43643, top1: 0.40947, throughput: 1326.44 | 2022-05-21 11:37:16.817 [rank:0] [train], epoch: 4/50, iter: 834/834, loss: 0.43655, top1: 0.40947, throughput: 1326.12 | 2022-05-21 11:37:16.818 [rank:5] [train], epoch: 4/50, iter: 834/834, loss: 0.43948, top1: 0.40977, throughput: 1326.61 | 2022-05-21 11:37:16.818 [rank:2] [train], epoch: 4/50, iter: 834/834, loss: 0.43996, top1: 0.40319, throughput: 1327.11 | 2022-05-21 11:37:16.817 [rank:7] [train], epoch: 4/50, iter: 834/834, loss: 0.44134, top1: 0.40472, throughput: 1325.23[rank:3] [train], epoch: 4/50, iter: 834/834, loss: 0.43894, top1: 0.39966, throughput: 1325.85 | 2022-05-21 11:37:16.818| 2022-05-21 11:37:16.818 [rank:1] [train], epoch: 4/50, iter: 834/834, loss: 0.44121, top1: 0.40748, throughput: 1326.64 | 2022-05-21 11:37:16.819 [rank:0] [eval], epoch: 4/50, iter: 125/125, loss: 0.00000, top1: 0.37584, throughput: 553.02 | 2022-05-21 11:37:28.120 [rank:7] [eval], epoch: 4/50, iter: 125/125, loss: 0.00000, top1: 0.36592, throughput: 548.33 | 2022-05-21 11:37:28.216 [rank:1] [eval], epoch: 4/50, iter: 125/125, loss: 0.00000, top1: 0.36176, throughput: 543.81 | 2022-05-21 11:37:28.312 [rank:3] [eval], epoch: 4/50, iter: 125/125, loss: 0.00000, top1: 0.36928, throughput: 543.37 | 2022-05-21 11:37:28.320 [rank:6] [eval], epoch: 4/50, iter: 125/125, loss: 0.00000, top1: 0.36416, throughput: 542.75 | 2022-05-21 11:37:28.333 [rank:2] [eval], epoch: 4/50, iter: 125/125, loss: 0.00000, top1: 0.37312, throughput: 542.55 | 2022-05-21 11:37:28.337 [rank:4] [eval], epoch: 4/50, iter: 125/125, loss: 0.00000, top1: 0.36064, throughput: 540.00 | 2022-05-21 11:37:28.391 [rank:5] [eval], epoch: 4/50, iter: 125/125, loss: 0.00000, top1: 0.35376, throughput: 539.22 | 2022-05-21 11:37:28.409 [rank:5] [train], epoch: 5/50, iter: 100/834, loss: 0.43013, top1: 0.41870, throughput: 1330.62 | 2022-05-21 11:37:42.838 [rank:4] [train], epoch: 5/50, iter: 100/834, loss: 0.43219, top1: 0.41594, throughput: 1328.95 | 2022-05-21 11:37:42.839 [rank:3] [train], epoch: 5/50, iter: 100/834, loss: 0.43271, top1: 0.41786, throughput: 1322.54 | 2022-05-21 11:37:42.838 [rank:6] [train], epoch: 5/50, iter: 100/834, loss: 0.43018, top1: 0.41896, throughput: 1323.57 | 2022-05-21 11:37:42.839 [rank:0] [train], epoch: 5/50, iter: 100/834, loss: 0.42974, top1: 0.42172, throughput: 1304.45 | 2022-05-21 11:37:42.838 [rank:7] [train], epoch: 5/50, iter: 100/834, loss: 0.43308, top1: 0.41875, throughput: 1312.99 | 2022-05-21 11:37:42.840 [rank:2] [train], epoch: 5/50, iter: 100/834, loss: 0.43254, top1: 0.41823, throughput: 1323.46 | 2022-05-21 11:37:42.844 [rank:1] [train], epoch: 5/50, iter: 100/834, loss: 0.43223, top1: 0.41880, throughput: 1321.24 | 2022-05-21 11:37:42.844 [rank:6] [train], epoch: 5/50, iter: 200/834, loss: 0.42826, top1: 0.42771, throughput: 1323.03 | 2022-05-21 11:37:57.351 [rank:7] [train], epoch: 5/50, iter: 200/834, loss: 0.43351, top1: 0.41760, throughput: 1323.15 | 2022-05-21 11:37:57.350 [rank:2] [train], epoch: 5/50, iter: 200/834, loss: 0.42872, top1: 0.42771, throughput: 1323.38 | 2022-05-21 11:37:57.353 [rank:4] [train], epoch: 5/50, iter: 200/834, loss: 0.42647, top1: 0.42880, throughput: 1322.89 | 2022-05-21 11:37:57.352 [rank:5] [train], epoch: 5/50, iter: 200/834, loss: 0.43042, top1: 0.42240, throughput: 1322.88 | 2022-05-21 11:37:57.352 [rank:1] [train], epoch: 5/50, iter: 200/834, loss: 0.43046, top1: 0.42026, throughput: 1323.42 | 2022-05-21 11:37:57.352 [rank:0] [train], epoch: 5/50, iter: 200/834, loss: 0.42913, top1: 0.42443, throughput: 1322.41 | 2022-05-21 11:37:57.357 [rank:3] [train], epoch: 5/50, iter: 200/834, loss: 0.42911, top1: 0.42484, throughput: 1322.75 | 2022-05-21 11:37:57.353 [rank:7] [train], epoch: 5/50, iter: 300/834, loss: 0.42699, top1: 0.43047, throughput: 1324.35 | 2022-05-21 11:38:11.848 [rank:6] [train], epoch: 5/50, iter: 300/834, loss: 0.42623, top1: 0.42802, throughput: 1324.33 | 2022-05-21 11:38:11.849 [rank:2] [train], epoch: 5/50, iter: 300/834, loss: 0.42885, top1: 0.42276, throughput: 1324.55 | 2022-05-21 11:38:11.848 [rank:0] [train], epoch: 5/50, iter: 300/834, loss: 0.42980, top1: 0.42620, throughput: 1324.82 | 2022-05-21 11:38:11.850 [rank:3] [train], epoch: 5/50, iter: 300/834, loss: 0.42316, top1: 0.43380, throughput: 1324.42 | 2022-05-21 11:38:11.850 [rank:1] [train], epoch: 5/50, iter: 300/834, loss: 0.42743, top1: 0.42443, throughput: 1324.23 | 2022-05-21 11:38:11.851 [rank:4] [train], epoch: 5/50, iter: 300/834, loss: 0.42638, top1: 0.42901, throughput: 1323.92 | 2022-05-21 11:38:11.855 [rank:5] [train], epoch: 5/50, iter: 300/834, loss: 0.42737, top1: 0.42427, throughput: 1324.29 | 2022-05-21 11:38:11.850 [rank:5] [train], epoch: 5/50, iter: 400/834, loss: 0.42322, top1: 0.43312, throughput: 1329.54 | 2022-05-21 11:38:26.291 [rank:4] [train], epoch: 5/50, iter: 400/834, loss: 0.42258, top1: 0.43286, throughput: 1329.99 | 2022-05-21 11:38:26.291 [rank:6] [train], epoch: 5/50, iter: 400/834, loss: 0.42470, top1: 0.42641, throughput: 1329.11 | 2022-05-21 11:38:26.295 [rank:1] [train], epoch: 5/50, iter: 400/834, loss: 0.42309, top1: 0.43526, throughput: 1329.59 | 2022-05-21 11:38:26.291 [rank:0] [train], epoch: 5/50, iter: 400/834, loss: 0.42316, top1: 0.43057, throughput: 1329.49 | 2022-05-21 11:38:26.292 [rank:7] [train], epoch: 5/50, iter: 400/834, loss: 0.42552, top1: 0.43078, throughput: 1329.22 | 2022-05-21 11:38:26.293 [rank:3] [train], epoch: 5/50, iter: 400/834, loss: 0.42161, top1: 0.43422, throughput: 1329.32 | 2022-05-21 11:38:26.293 [rank:2] [train], epoch: 5/50, iter: 400/834, loss: 0.42232, top1: 0.43229, throughput: 1329.29 | 2022-05-21 11:38:26.292 [rank:7] [train], epoch: 5/50, iter: 500/834, loss: 0.41930, top1: 0.44036, throughput: 1320.37 | 2022-05-21 11:38:40.834 [rank:5] [train], epoch: 5/50, iter: 500/834, loss: 0.42074, top1: 0.43542, throughput: 1320.26 | 2022-05-21 11:38:40.834 [rank:4] [train], epoch: 5/50, iter: 500/834, loss: 0.42220, top1: 0.43349, throughput: 1319.84 | 2022-05-21 11:38:40.838 [rank:3] [train], epoch: 5/50, iter: 500/834, loss: 0.42008, top1: 0.44203, throughput: 1320.07[rank:6] [train], epoch: 5/50, iter: 500/834, loss: 0.42279, top1: 0.43167, throughput: 1320.34 | 2022-05-21 11:38:40.838 | 2022-05-21 11:38:40.837 [rank:0] [train], epoch: 5/50, iter: 500/834, loss: 0.42000, top1: 0.43297, throughput: 1319.76 | 2022-05-21 11:38:40.840 [rank:2] [train], epoch: 5/50, iter: 500/834, loss: 0.41923, top1: 0.43557, throughput: 1320.10 | 2022-05-21 11:38:40.836 [rank:1] [train], epoch: 5/50, iter: 500/834, loss: 0.41869, top1: 0.44234, throughput: 1319.91 | 2022-05-21 11:38:40.838 [rank:7] [train], epoch: 5/50, iter: 600/834, loss: 0.41911, top1: 0.43833, throughput: 1327.92 | 2022-05-21 11:38:55.293 [rank:4] [train], epoch: 5/50, iter: 600/834, loss: 0.42212, top1: 0.43583, throughput: 1328.28 | 2022-05-21 11:38:55.293 [rank:0] [train], epoch: 5/50, iter: 600/834, loss: 0.41989, top1: 0.43891, throughput: 1328.38 | 2022-05-21 11:38:55.293 [rank:6] [train], epoch: 5/50, iter: 600/834, loss: 0.41695, top1: 0.44417, throughput: 1327.79 | 2022-05-21 11:38:55.297 [rank:1] [train], epoch: 5/50, iter: 600/834, loss: 0.41556, top1: 0.44521, throughput: 1328.07 | 2022-05-21 11:38:55.295 [rank:5] [train], epoch: 5/50, iter: 600/834, loss: 0.41857, top1: 0.44229, throughput: 1327.75 | 2022-05-21 11:38:55.295 [rank:3] [train], epoch: 5/50, iter: 600/834, loss: 0.42145, top1: 0.43604, throughput: 1328.12 | 2022-05-21 11:38:55.295 [rank:2] [train], epoch: 5/50, iter: 600/834, loss: 0.42124, top1: 0.43880, throughput: 1327.43 | 2022-05-21 11:38:55.300 [rank:4] [train], epoch: 5/50, iter: 700/834, loss: 0.41554, top1: 0.44661, throughput: 1327.06 | 2022-05-21 11:39:09.761 [rank:6] [train], epoch: 5/50, iter: 700/834, loss: 0.41813, top1: 0.44620, throughput: 1327.40 | 2022-05-21 11:39:09.761 [rank:0] [train], epoch: 5/50, iter: 700/834, loss: 0.41843, top1: 0.44208, throughput: 1327.03 | 2022-05-21 11:39:09.762 [rank:7] [train], epoch: 5/50, iter: 700/834, loss: 0.41755, top1: 0.44063, throughput: 1326.94 | 2022-05-21 11:39:09.762 [rank:5] [train], epoch: 5/50, iter: 700/834, loss: 0.41743, top1: 0.44609, throughput: 1327.20 | 2022-05-21 11:39:09.761 [rank:1] [train], epoch: 5/50, iter: 700/834, loss: 0.41609, top1: 0.44354, throughput: 1327.15 | 2022-05-21 11:39:09.762 [rank:3] [train], epoch: 5/50, iter: 700/834, loss: 0.41329, top1: 0.44776, throughput: 1327.05 | 2022-05-21 11:39:09.763 [rank:2] [train], epoch: 5/50, iter: 700/834, loss: 0.41531, top1: 0.45286, throughput: 1327.69 | 2022-05-21 11:39:09.762 [rank:6] [train], epoch: 5/50, iter: 800/834, loss: 0.41470, top1: 0.44641, throughput: 1319.35 | 2022-05-21 11:39:24.314 [rank:4] [train], epoch: 5/50, iter: 800/834, loss: 0.41382, top1: 0.44964, throughput: 1319.32 | 2022-05-21 11:39:24.314 [rank:1] [train], epoch: 5/50, iter: 800/834, loss: 0.41730, top1: 0.44161, throughput: 1319.39 | 2022-05-21 11:39:24.314 [rank:5] [train], epoch: 5/50, iter: 800/834, loss: 0.41596, top1: 0.44484, throughput: 1319.32 | 2022-05-21 11:39:24.314 [rank:2] [train], epoch: 5/50, iter: 800/834, loss: 0.41195, top1: 0.44885, throughput: 1319.17 | 2022-05-21 11:39:24.316 [rank:0] [train], epoch: 5/50, iter: 800/834, loss: 0.41338, top1: 0.44323, throughput: 1319.23 | 2022-05-21 11:39:24.316 [rank:7] [train], epoch: 5/50, iter: 800/834, loss: 0.41328, top1: 0.44875, throughput: 1319.14 | 2022-05-21 11:39:24.317 [rank:3] [train], epoch: 5/50, iter: 800/834, loss: 0.41773, top1: 0.44089, throughput: 1319.28 | 2022-05-21 11:39:24.316 [rank:3] [train], epoch: 5/50, iter: 834/834, loss: 0.41294, top1: 0.44562, throughput: 1317.99 | 2022-05-21 11:39:29.269 [rank:7] [train], epoch: 5/50, iter: 834/834, loss: 0.41413, top1: 0.44301, throughput: 1317.88 | 2022-05-21 11:39:29.270 [rank:5] [train], epoch: 5/50, iter: 834/834, loss: 0.41658, top1: 0.44026, throughput: 1317.01 | 2022-05-21 11:39:29.271 [rank:6] [train], epoch: 5/50, iter: 834/834, loss: 0.41204, top1: 0.45159, throughput: 1316.77 | 2022-05-21 11:39:29.271 [rank:4] [train], epoch: 5/50, iter: 834/834, loss: 0.41403, top1: 0.44884, throughput: 1316.80[rank:0] [train], epoch: 5/50, iter: 834/834, loss: 0.40662, top1: 0.46140, throughput: 1317.31 | 2022-05-21 11:39:29.271 | 2022-05-21 11:39:29.271 [rank:1] [train], epoch: 5/50, iter: 834/834, loss: 0.40965, top1: 0.46170, throughput: 1316.80 | 2022-05-21 11:39:29.272 [rank:2] [train], epoch: 5/50, iter: 834/834, loss: 0.41003, top1: 0.44684, throughput: 1317.30 | 2022-05-21 11:39:29.272 [rank:4] [eval], epoch: 5/50, iter: 125/125, loss: 0.00000, top1: 0.41792, throughput: 534.60 | 2022-05-21 11:39:40.962 [rank:0] [eval], epoch: 5/50, iter: 125/125, loss: 0.00000, top1: 0.42944, throughput: 534.51 | 2022-05-21 11:39:40.964 [rank:7] [eval], epoch: 5/50, iter: 125/125, loss: 0.00000, top1: 0.43344, throughput: 534.29 | 2022-05-21 11:39:40.968 [rank:2] [eval], epoch: 5/50, iter: 125/125, loss: 0.00000, top1: 0.42768, throughput: 531.67 | 2022-05-21 11:39:41.027 [rank:1] [eval], epoch: 5/50, iter: 125/125, loss: 0.00000, top1: 0.43072, throughput: 531.04 | 2022-05-21 11:39:41.041 [rank:6] [eval], epoch: 5/50, iter: 125/125, loss: 0.00000, top1: 0.43120, throughput: 529.56 | 2022-05-21 11:39:41.074 [rank:3] [eval], epoch: 5/50, iter: 125/125, loss: 0.00000, top1: 0.42496, throughput: 527.17 | 2022-05-21 11:39:41.125 [rank:5] [eval], epoch: 5/50, iter: 125/125, loss: 0.00000, top1: 0.41472, throughput: 520.76 | 2022-05-21 11:39:41.273 [rank:7] [train], epoch: 6/50, iter: 100/834, loss: 0.40707, top1: 0.46245, throughput: 1302.36 | 2022-05-21 11:39:55.711 [rank:5] [train], epoch: 6/50, iter: 100/834, loss: 0.40715, top1: 0.45938, throughput: 1329.82 | 2022-05-21 11:39:55.711 [rank:2] [train], epoch: 6/50, iter: 100/834, loss: 0.40964, top1: 0.45880, throughput: 1307.65 | 2022-05-21 11:39:55.710 [rank:3] [train], epoch: 6/50, iter: 100/834, loss: 0.40471, top1: 0.46719, throughput: 1316.34 | 2022-05-21 11:39:55.711 [rank:4] [train], epoch: 6/50, iter: 100/834, loss: 0.40885, top1: 0.45490, throughput: 1301.69 | 2022-05-21 11:39:55.713 [rank:6] [train], epoch: 6/50, iter: 100/834, loss: 0.40882, top1: 0.45682, throughput: 1311.66 | 2022-05-21 11:39:55.712 [rank:1] [train], epoch: 6/50, iter: 100/834, loss: 0.40743, top1: 0.45542, throughput: 1308.41 | 2022-05-21 11:39:55.715 [rank:0] [train], epoch: 6/50, iter: 100/834, loss: 0.41075, top1: 0.45245, throughput: 1301.55 | 2022-05-21 11:39:55.716 [rank:1] [train], epoch: 6/50, iter: 200/834, loss: 0.40552, top1: 0.46333, throughput: 1312.38 | 2022-05-21 11:40:10.345 [rank:7] [train], epoch: 6/50, iter: 200/834, loss: 0.40545, top1: 0.46604, throughput: 1311.89 | 2022-05-21 11:40:10.346 [rank:5] [train], epoch: 6/50, iter: 200/834, loss: 0.40385, top1: 0.47000, throughput: 1311.85 | 2022-05-21 11:40:10.346 [rank:0] [train], epoch: 6/50, iter: 200/834, loss: 0.40440, top1: 0.46411, throughput: 1312.31 | 2022-05-21 11:40:10.347 [rank:3] [train], epoch: 6/50, iter: 200/834, loss: 0.40427, top1: 0.45953, throughput: 1311.74 | 2022-05-21 11:40:10.348 [rank:4] [train], epoch: 6/50, iter: 200/834, loss: 0.40636, top1: 0.46125, throughput: 1311.87 | 2022-05-21 11:40:10.348[rank:6] [train], epoch: 6/50, iter: 200/834, loss: 0.40465, top1: 0.45885, throughput: 1311.37 | 2022-05-21 11:40:10.353 [rank:2] [train], epoch: 6/50, iter: 200/834, loss: 0.40595, top1: 0.45969, throughput: 1311.60 | 2022-05-21 11:40:10.349 [rank:3] [train], epoch: 6/50, iter: 300/834, loss: 0.40780, top1: 0.45161, throughput: 1329.20 | 2022-05-21 11:40:24.793 [rank:7] [train], epoch: 6/50, iter: 300/834, loss: 0.40471, top1: 0.46802, throughput: 1328.97 | 2022-05-21 11:40:24.793 [rank:2] [train], epoch: 6/50, iter: 300/834, loss: 0.40485, top1: 0.46641, throughput: 1329.17 | 2022-05-21 11:40:24.794 [rank:6] [train], epoch: 6/50, iter: 300/834, loss: 0.40660, top1: 0.46167, throughput: 1329.58 | 2022-05-21 11:40:24.793 [rank:1] [train], epoch: 6/50, iter: 300/834, loss: 0.40680, top1: 0.45974, throughput: 1328.91 | 2022-05-21 11:40:24.793 [rank:5] [train], epoch: 6/50, iter: 300/834, loss: 0.40649, top1: 0.46130, throughput: 1328.98 | 2022-05-21 11:40:24.794 [rank:4] [train], epoch: 6/50, iter: 300/834, loss: 0.40553, top1: 0.46641, throughput: 1328.95 | 2022-05-21 11:40:24.796 [rank:0] [train], epoch: 6/50, iter: 300/834, loss: 0.40582, top1: 0.46297, throughput: 1328.76 | 2022-05-21 11:40:24.796 [rank:5] [train], epoch: 6/50, iter: 400/834, loss: 0.40190, top1: 0.46307, throughput: 1318.02 | 2022-05-21 11:40:39.361 [rank:4] [train], epoch: 6/50, iter: 400/834, loss: 0.40474, top1: 0.46828, throughput: 1317.89 | 2022-05-21 11:40:39.364 [rank:7] [train], epoch: 6/50, iter: 400/834, loss: 0.40177, top1: 0.46839, throughput: 1317.85 | 2022-05-21 11:40:39.362 [rank:3] [train], epoch: 6/50, iter: 400/834, loss: 0.40193, top1: 0.46448, throughput: 1317.67 | 2022-05-21 11:40:39.364 [rank:1] [train], epoch: 6/50, iter: 400/834, loss: 0.40329, top1: 0.45953, throughput: 1317.89 | 2022-05-21 11:40:39.362 [rank:6] [train], epoch: 6/50, iter: 400/834, loss: 0.40415, top1: 0.46500, throughput: 1317.55 | 2022-05-21 11:40:39.366 [rank:0] [train], epoch: 6/50, iter: 400/834, loss: 0.40177, top1: 0.46323, throughput: 1318.05 | 2022-05-21 11:40:39.363 [rank:2] [train], epoch: 6/50, iter: 400/834, loss: 0.40236, top1: 0.46802, throughput: 1317.80 | 2022-05-21 11:40:39.364 [rank:7] [train], epoch: 6/50, iter: 500/834, loss: 0.40690, top1: 0.45922, throughput: 1322.20 | 2022-05-21 11:40:53.884 [rank:2] [train], epoch: 6/50, iter: 500/834, loss: 0.40322, top1: 0.46458, throughput: 1322.38 | 2022-05-21 11:40:53.883 [rank:5] [train], epoch: 6/50, iter: 500/834, loss: 0.40010, top1: 0.47172, throughput: 1322.06 | 2022-05-21 11:40:53.884 [rank:6] [train], epoch: 6/50, iter: 500/834, loss: 0.40534, top1: 0.46453, throughput: 1322.51 | 2022-05-21 11:40:53.884 [rank:3] [train], epoch: 6/50, iter: 500/834, loss: 0.40303, top1: 0.45880, throughput: 1322.33 | 2022-05-21 11:40:53.884 [rank:4] [train], epoch: 6/50, iter: 500/834, loss: 0.40518, top1: 0.46229, throughput: 1322.35 | 2022-05-21 11:40:53.884 [rank:0] [train], epoch: 6/50, iter: 500/834, loss: 0.40253, top1: 0.46448, throughput: 1321.89 | 2022-05-21 11:40:53.888 [rank:1] [train], epoch: 6/50, iter: 500/834, loss: 0.40488, top1: 0.46411, throughput: 1321.82 | 2022-05-21 11:40:53.887 [rank:7] [train], epoch: 6/50, iter: 600/834, loss: 0.40411, top1: 0.45807, throughput: 1330.21 | 2022-05-21 11:41:08.318 [rank:5] [train], epoch: 6/50, iter: 600/834, loss: 0.39822, top1: 0.47245, throughput: 1330.23 | 2022-05-21 11:41:08.317 [rank:6] [train], epoch: 6/50, iter: 600/834, loss: 0.40360, top1: 0.46578, throughput: 1330.14 | 2022-05-21 11:41:08.318 [rank:1] [train], epoch: 6/50, iter: 600/834, loss: 0.39972, top1: 0.47526, throughput: 1330.49 | 2022-05-21 11:41:08.318 [rank:0] [train], epoch: 6/50, iter: 600/834, loss: 0.40131, top1: 0.47318, throughput: 1330.45 | 2022-05-21 11:41:08.319 [rank:4] [train], epoch: 6/50, iter: 600/834, loss: 0.40034, top1: 0.47042, throughput: 1329.99 | 2022-05-21 11:41:08.320 [rank:3] [train], epoch: 6/50, iter: 600/834, loss: 0.39906, top1: 0.47599, throughput: 1329.96 | 2022-05-21 11:41:08.320 [rank:2] [train], epoch: 6/50, iter: 600/834, loss: 0.40241, top1: 0.46729, throughput: 1329.86 | 2022-05-21 11:41:08.320 [rank:5] [train], epoch: 6/50, iter: 700/834, loss: 0.40311, top1: 0.46448, throughput: 1319.01 | 2022-05-21 11:41:22.874 [rank:3] [train], epoch: 6/50, iter: 700/834, loss: 0.40042, top1: 0.46969, throughput: 1319.29 | 2022-05-21 11:41:22.873 [rank:4] [train], epoch: 6/50, iter: 700/834, loss: 0.39990, top1: 0.47089, throughput: 1319.25 | 2022-05-21 11:41:22.874 [rank:2] [train], epoch: 6/50, iter: 700/834, loss: 0.39882, top1: 0.47146, throughput: 1319.35 | 2022-05-21 11:41:22.873 [rank:6] [train], epoch: 6/50, iter: 700/834, loss: 0.40044, top1: 0.47167, throughput: 1319.00 | 2022-05-21 11:41:22.875 [rank:1] [train], epoch: 6/50, iter: 700/834, loss: 0.39805, top1: 0.47052, throughput: 1318.98 | 2022-05-21 11:41:22.875 [rank:7] [train], epoch: 6/50, iter: 700/834, loss: 0.40167, top1: 0.46969, throughput: 1318.89 | 2022-05-21 11:41:22.875 [rank:0] [train], epoch: 6/50, iter: 700/834, loss: 0.39808, top1: 0.47161, throughput: 1318.96 | 2022-05-21 11:41:22.876 [rank:1] [train], epoch: 6/50, iter: 800/834, loss: 0.39862, top1: 0.47370, throughput: 1326.13 | 2022-05-21 11:41:37.353 [rank:3] [train], epoch: 6/50, iter: 800/834, loss: 0.39915, top1: 0.47286, throughput: 1325.96 | 2022-05-21 11:41:37.353 [rank:0] [train], epoch: 6/50, iter: 800/834, loss: 0.39747, top1: 0.47427, throughput: 1326.20[rank:6] [train], epoch: 6/50, iter: 800/834, loss: 0.39730, top1: 0.47516, throughput: 1326.04 | 2022-05-21 11:41:37.353| 2022-05-21 11:41:37.354 [rank:2] [train], epoch: 6/50, iter: 800/834, loss: 0.39816, top1: 0.47224, throughput: 1325.91 | 2022-05-21 11:41:37.354 [rank:7] [train], epoch: 6/50, iter: 800/834, loss: 0.40016, top1: 0.47010, throughput: 1325.94 | 2022-05-21 11:41:37.356 [rank:4] [train], epoch: 6/50, iter: 800/834, loss: 0.39660, top1: 0.47760, throughput: 1325.79 | 2022-05-21 11:41:37.356 [rank:5] [train], epoch: 6/50, iter: 800/834, loss: 0.39689, top1: 0.47740, throughput: 1325.75 | 2022-05-21 11:41:37.356 [rank:6] [train], epoch: 6/50, iter: 834/834, loss: 0.39678, top1: 0.47610, throughput: 1324.62 | 2022-05-21 11:41:42.282 [rank:0] [train], epoch: 6/50, iter: 834/834, loss: 0.39465, top1: 0.48300, throughput: 1324.41 | 2022-05-21 11:41:42.282 [rank:2] [train], epoch: 6/50, iter: 834/834, loss: 0.39926, top1: 0.47656, throughput: 1324.66 | 2022-05-21 11:41:42.282 [rank:4] [train], epoch: 6/50, iter: 834/834, loss: 0.39826, top1: 0.47212, throughput: 1325.11 | 2022-05-21 11:41:42.282 [rank:7] [train], epoch: 6/50, iter: 834/834, loss: 0.39358, top1: 0.47472, throughput: 1323.94 | 2022-05-21 11:41:42.286 [rank:5] [train], epoch: 6/50, iter: 834/834, loss: 0.40004, top1: 0.47181, throughput: 1323.93 | 2022-05-21 11:41:42.287 [rank:1] [train], epoch: 6/50, iter: 834/834, loss: 0.39866, top1: 0.47227, throughput: 1323.14 | 2022-05-21 11:41:42.287 [rank:3] [train], epoch: 6/50, iter: 834/834, loss: 0.39509, top1: 0.47687, throughput: 1323.20 | 2022-05-21 11:41:42.287 [rank:0] [eval], epoch: 6/50, iter: 125/125, loss: 0.00000, top1: 0.48448, throughput: 556.06 | 2022-05-21 11:41:53.522 [rank:4] [eval], epoch: 6/50, iter: 125/125, loss: 0.00000, top1: 0.48256, throughput: 554.24 | 2022-05-21 11:41:53.559 [rank:7] [eval], epoch: 6/50, iter: 125/125, loss: 0.00000, top1: 0.48000, throughput: 554.21 | 2022-05-21 11:41:53.564 [rank:2] [eval], epoch: 6/50, iter: 125/125, loss: 0.00000, top1: 0.48496, throughput: 552.23 | 2022-05-21 11:41:53.599 [rank:6] [eval], epoch: 6/50, iter: 125/125, loss: 0.00000, top1: 0.48672, throughput: 549.55 | 2022-05-21 11:41:53.655 [rank:3] [eval], epoch: 6/50, iter: 125/125, loss: 0.00000, top1: 0.47536, throughput: 547.65 | 2022-05-21 11:41:53.699 [rank:1] [eval], epoch: 6/50, iter: 125/125, loss: 0.00000, top1: 0.48208, throughput: 546.71 | 2022-05-21 11:41:53.719 [rank:5] [eval], epoch: 6/50, iter: 125/125, loss: 0.00000, top1: 0.47488, throughput: 543.51 | 2022-05-21 11:41:53.786 [rank:3] [train], epoch: 7/50, iter: 100/834, loss: 0.39543, top1: 0.48120, throughput: 1324.57 | 2022-05-21 11:42:08.195 [rank:5] [train], epoch: 7/50, iter: 100/834, loss: 0.38840, top1: 0.48953, throughput: 1332.51 | 2022-05-21 11:42:08.195 [rank:7] [train], epoch: 7/50, iter: 100/834, loss: 0.39295, top1: 0.48500, throughput: 1312.16 | 2022-05-21 11:42:08.196 [rank:6] [train], epoch: 7/50, iter: 100/834, loss: 0.39153, top1: 0.48620, throughput: 1320.42 | 2022-05-21 11:42:08.196 [rank:2] [train], epoch: 7/50, iter: 100/834, loss: 0.39171, top1: 0.48776, throughput: 1315.38 | 2022-05-21 11:42:08.196 [rank:4] [train], epoch: 7/50, iter: 100/834, loss: 0.39277, top1: 0.48146, throughput: 1311.61 | 2022-05-21 11:42:08.197 [rank:0] [train], epoch: 7/50, iter: 100/834, loss: 0.38880, top1: 0.48812, throughput: 1308.33 | 2022-05-21 11:42:08.197 [rank:1] [train], epoch: 7/50, iter: 100/834, loss: 0.39142, top1: 0.48380, throughput: 1326.04 | 2022-05-21 11:42:08.198 [rank:3] [train], epoch: 7/50, iter: 200/834, loss: 0.39216, top1: 0.48474, throughput: 1307.09[rank:5] [train], epoch: 7/50, iter: 200/834, loss: 0.39024, top1: 0.48708, throughput: 1307.17 | 2022-05-21 11:42:22.883 | 2022-05-21 11:42:22.884 [rank:2] [train], epoch: 7/50, iter: 200/834, loss: 0.39198, top1: 0.48750, throughput: 1307.32 | 2022-05-21 11:42:22.882 [rank:7] [train], epoch: 7/50, iter: 200/834, loss: 0.39279, top1: 0.48125, throughput: 1307.24 | 2022-05-21 11:42:22.883 [rank:6] [train], epoch: 7/50, iter: 200/834, loss: 0.39265, top1: 0.48495, throughput: 1307.02 | 2022-05-21 11:42:22.886 [rank:4] [train], epoch: 7/50, iter: 200/834, loss: 0.39034, top1: 0.47979, throughput: 1307.17 | 2022-05-21 11:42:22.886 [rank:1] [train], epoch: 7/50, iter: 200/834, loss: 0.38990, top1: 0.48932, throughput: 1307.33 | 2022-05-21 11:42:22.884 [rank:0] [train], epoch: 7/50, iter: 200/834, loss: 0.38944, top1: 0.49094, throughput: 1307.18 | 2022-05-21 11:42:22.886 [rank:0] [train], epoch: 7/50, iter: 300/834, loss: 0.39081, top1: 0.49047, throughput: 1329.04 | 2022-05-21 11:42:37.332 [rank:3] [train], epoch: 7/50, iter: 300/834, loss: 0.39205, top1: 0.48818, throughput: 1328.85 | 2022-05-21 11:42:37.332 [rank:6] [train], epoch: 7/50, iter: 300/834, loss: 0.38859, top1: 0.49208, throughput: 1328.98 | 2022-05-21 11:42:37.333 [rank:2] [train], epoch: 7/50, iter: 300/834, loss: 0.39492, top1: 0.48375, throughput: 1328.75 | 2022-05-21 11:42:37.332 [rank:7] [train], epoch: 7/50, iter: 300/834, loss: 0.38928, top1: 0.48844, throughput: 1328.71 | 2022-05-21 11:42:37.334 [rank:1] [train], epoch: 7/50, iter: 300/834, loss: 0.39470, top1: 0.47969, throughput: 1328.76 | 2022-05-21 11:42:37.334 [rank:4] [train], epoch: 7/50, iter: 300/834, loss: 0.38822, top1: 0.48531, throughput: 1328.86 | 2022-05-21 11:42:37.334 [rank:5] [train], epoch: 7/50, iter: 300/834, loss: 0.39147, top1: 0.48750, throughput: 1328.61 | 2022-05-21 11:42:37.334 [rank:3] [train], epoch: 7/50, iter: 400/834, loss: 0.39175, top1: 0.48896, throughput: 1325.91 | 2022-05-21 11:42:51.813 [rank:1] [train], epoch: 7/50, iter: 400/834, loss: 0.38804, top1: 0.49516, throughput: 1326.16 | 2022-05-21 11:42:51.812 [rank:7] [train], epoch: 7/50, iter: 400/834, loss: 0.39367, top1: 0.48573, throughput: 1326.09 | 2022-05-21 11:42:51.812 [rank:4] [train], epoch: 7/50, iter: 400/834, loss: 0.38763, top1: 0.49557, throughput: 1326.09 | 2022-05-21 11:42:51.813 [rank:5] [train], epoch: 7/50, iter: 400/834, loss: 0.39062, top1: 0.49245, throughput: 1326.14 | 2022-05-21 11:42:51.812 [rank:6] [train], epoch: 7/50, iter: 400/834, loss: 0.38826, top1: 0.49479, throughput: 1325.93 | 2022-05-21 11:42:51.813 [rank:2] [train], epoch: 7/50, iter: 400/834, loss: 0.38723, top1: 0.48964, throughput: 1325.86 | 2022-05-21 11:42:51.813 [rank:0] [train], epoch: 7/50, iter: 400/834, loss: 0.38864, top1: 0.49411, throughput: 1325.72 | 2022-05-21 11:42:51.815 [rank:4] [train], epoch: 7/50, iter: 500/834, loss: 0.39254, top1: 0.48286, throughput: 1327.32 | 2022-05-21 11:43:06.278 [rank:7] [train], epoch: 7/50, iter: 500/834, loss: 0.38900, top1: 0.49234, throughput: 1327.05 | 2022-05-21 11:43:06.280 [rank:5] [train], epoch: 7/50, iter: 500/834, loss: 0.39115, top1: 0.48526, throughput: 1327.26 | 2022-05-21 11:43:06.278 [rank:3] [train], epoch: 7/50, iter: 500/834, loss: 0.38948, top1: 0.48443, throughput: 1327.30 | 2022-05-21 11:43:06.278 [rank:0] [train], epoch: 7/50, iter: 500/834, loss: 0.38857, top1: 0.48849, throughput: 1327.43 | 2022-05-21 11:43:06.279 [rank:1] [train], epoch: 7/50, iter: 500/834, loss: 0.39025, top1: 0.48453, throughput: 1327.03 | 2022-05-21 11:43:06.280 [rank:6] [train], epoch: 7/50, iter: 500/834, loss: 0.39089, top1: 0.48719, throughput: 1327.16 | 2022-05-21 11:43:06.280 [rank:2] [train], epoch: 7/50, iter: 500/834, loss: 0.38871, top1: 0.48615, throughput: 1327.16 | 2022-05-21 11:43:06.280 [rank:1] [train], epoch: 7/50, iter: 600/834, loss: 0.38822, top1: 0.48688, throughput: 1329.06 | 2022-05-21 11:43:20.727 [rank:2] [train], epoch: 7/50, iter: 600/834, loss: 0.38859, top1: 0.49047, throughput: 1329.19 | 2022-05-21 11:43:20.725 [rank:4] [train], epoch: 7/50, iter: 600/834, loss: 0.39061, top1: 0.48443, throughput: 1328.77[rank:5] [train], epoch: 7/50, iter: 600/834, loss: 0.39052, top1: 0.48406, throughput: 1328.96 | 2022-05-21 11:43:20.726 | 2022-05-21 11:43:20.727 [rank:6] [train], epoch: 7/50, iter: 600/834, loss: 0.38890, top1: 0.48672, throughput: 1329.07 | 2022-05-21 11:43:20.727 [rank:0] [train], epoch: 7/50, iter: 600/834, loss: 0.38783, top1: 0.49745, throughput: 1328.90 | 2022-05-21 11:43:20.727 [rank:7] [train], epoch: 7/50, iter: 600/834, loss: 0.38694, top1: 0.49734, throughput: 1329.00 | 2022-05-21 11:43:20.727 [rank:3] [train], epoch: 7/50, iter: 600/834, loss: 0.39006, top1: 0.48542, throughput: 1328.73 | 2022-05-21 11:43:20.728 [rank:0] [train], epoch: 7/50, iter: 700/834, loss: 0.38858, top1: 0.49099, throughput: 1327.28 | 2022-05-21 11:43:35.192 [rank:4] [train], epoch: 7/50, iter: 700/834, loss: 0.38816, top1: 0.48937, throughput: 1327.57 | 2022-05-21 11:43:35.190 [rank:6] [train], epoch: 7/50, iter: 700/834, loss: 0.38948, top1: 0.48458, throughput: 1327.48 | 2022-05-21 11:43:35.190 [rank:2] [train], epoch: 7/50, iter: 700/834, loss: 0.38916, top1: 0.49057, throughput: 1327.46 | 2022-05-21 11:43:35.189 [rank:7] [train], epoch: 7/50, iter: 700/834, loss: 0.38941, top1: 0.48401, throughput: 1327.45 | 2022-05-21 11:43:35.191 [rank:3] [train], epoch: 7/50, iter: 700/834, loss: 0.38554, top1: 0.49417, throughput: 1327.61 | 2022-05-21 11:43:35.190 [rank:5] [train], epoch: 7/50, iter: 700/834, loss: 0.39108, top1: 0.48755, throughput: 1327.39 | 2022-05-21 11:43:35.190 [rank:1] [train], epoch: 7/50, iter: 700/834, loss: 0.38819, top1: 0.49187, throughput: 1327.31 | 2022-05-21 11:43:35.192 [rank:3] [train], epoch: 7/50, iter: 800/834, loss: 0.38907, top1: 0.48734, throughput: 1326.30 | 2022-05-21 11:43:49.667 [rank:7] [train], epoch: 7/50, iter: 800/834, loss: 0.38917, top1: 0.49063, throughput: 1326.33 | 2022-05-21 11:43:49.667 [rank:5] [train], epoch: 7/50, iter: 800/834, loss: 0.38962, top1: 0.49057, throughput: 1326.33 | 2022-05-21 11:43:49.666 [rank:1] [train], epoch: 7/50, iter: 800/834, loss: 0.38434, top1: 0.49745, throughput: 1326.29 | 2022-05-21 11:43:49.668 [rank:0] [train], epoch: 7/50, iter: 800/834, loss: 0.38708, top1: 0.49401, throughput: 1326.36 | 2022-05-21 11:43:49.668 [rank:6] [train], epoch: 7/50, iter: 800/834, loss: 0.38766, top1: 0.49302, throughput: 1326.13 | 2022-05-21 11:43:49.668 [rank:2] [train], epoch: 7/50, iter: 800/834, loss: 0.38265, top1: 0.50229, throughput: 1326.20 | 2022-05-21 11:43:49.666 [rank:4] [train], epoch: 7/50, iter: 800/834, loss: 0.39052, top1: 0.48432, throughput: 1325.91 | 2022-05-21 11:43:49.671 [rank:1] [train], epoch: 7/50, iter: 834/834, loss: 0.38865, top1: 0.49479, throughput: 1326.37 | 2022-05-21 11:43:54.590 [rank:7] [train], epoch: 7/50, iter: 834/834, loss: 0.38580, top1: 0.49219, throughput: 1326.04 | 2022-05-21 11:43:54.590 [rank:0] [train], epoch: 7/50, iter: 834/834, loss: 0.39296, top1: 0.47319, throughput: 1325.66 | 2022-05-21 11:43:54.592 [rank:5] [train], epoch: 7/50, iter: 834/834, loss: 0.38633, top1: 0.48897, throughput: 1325.14 | 2022-05-21 11:43:54.592 [rank:6] [train], epoch: 7/50, iter: 834/834, loss: 0.38507, top1: 0.49203, throughput: 1325.62 | 2022-05-21 11:43:54.593 [rank:4] [train], epoch: 7/50, iter: 834/834, loss: 0.38932, top1: 0.49678, throughput: 1326.19 | 2022-05-21 11:43:54.593 [rank:3] [train], epoch: 7/50, iter: 834/834, loss: 0.38930, top1: 0.48667, throughput: 1325.21 | 2022-05-21 11:43:54.593 [rank:2] [train], epoch: 7/50, iter: 834/834, loss: 0.38800, top1: 0.49510, throughput: 1325.18 | 2022-05-21 11:43:54.593 [rank:0] [eval], epoch: 7/50, iter: 125/125, loss: 0.00000, top1: 0.49776, throughput: 559.45 | 2022-05-21 11:44:05.764 [rank:7] [eval], epoch: 7/50, iter: 125/125, loss: 0.00000, top1: 0.49200, throughput: 558.41 | 2022-05-21 11:44:05.783 [rank:2] [eval], epoch: 7/50, iter: 125/125, loss: 0.00000, top1: 0.48992, throughput: 555.90 | 2022-05-21 11:44:05.836 [rank:4] [eval], epoch: 7/50, iter: 125/125, loss: 0.00000, top1: 0.48768, throughput: 554.88 | 2022-05-21 11:44:05.857 [rank:3] [eval], epoch: 7/50, iter: 125/125, loss: 0.00000, top1: 0.48016, throughput: 551.79 | 2022-05-21 11:44:05.919 [rank:6] [eval], epoch: 7/50, iter: 125/125, loss: 0.00000, top1: 0.49264, throughput: 550.15 | 2022-05-21 11:44:05.953 [rank:1] [eval], epoch: 7/50, iter: 125/125, loss: 0.00000, top1: 0.50240, throughput: 545.06 | 2022-05-21 11:44:06.057 [rank:5] [eval], epoch: 7/50, iter: 125/125, loss: 0.00000, top1: 0.48752, throughput: 542.14 | 2022-05-21 11:44:06.121 [rank:5] [train], epoch: 8/50, iter: 100/834, loss: 0.38296, top1: 0.49891, throughput: 1328.92 | 2022-05-21 11:44:20.569 [rank:3] [train], epoch: 8/50, iter: 100/834, loss: 0.38288, top1: 0.50214, throughput: 1310.58 | 2022-05-21 11:44:20.570 [rank:0] [train], epoch: 8/50, iter: 100/834, loss: 0.37686, top1: 0.50984, throughput: 1296.86 | 2022-05-21 11:44:20.569 [rank:2] [train], epoch: 8/50, iter: 100/834, loss: 0.37967, top1: 0.50703, throughput: 1303.05 | 2022-05-21 11:44:20.570 [rank:6] [train], epoch: 8/50, iter: 100/834, loss: 0.38040, top1: 0.50182, throughput: 1313.41 | 2022-05-21 11:44:20.572 [rank:4] [train], epoch: 8/50, iter: 100/834, loss: 0.38017, top1: 0.51146, throughput: 1304.78 | 2022-05-21 11:44:20.572 [rank:1] [train], epoch: 8/50, iter: 100/834, loss: 0.38194, top1: 0.50495, throughput: 1322.77 | 2022-05-21 11:44:20.572 [rank:7] [train], epoch: 8/50, iter: 100/834, loss: 0.38141, top1: 0.50141, throughput: 1298.22 | 2022-05-21 11:44:20.572 [rank:7] [train], epoch: 8/50, iter: 200/834, loss: 0.38026, top1: 0.50406, throughput: 1326.78 | 2022-05-21 11:44:35.043 [rank:1] [train], epoch: 8/50, iter: 200/834, loss: 0.38408, top1: 0.49875, throughput: 1326.76 | 2022-05-21 11:44:35.043 [rank:4] [train], epoch: 8/50, iter: 200/834, loss: 0.38260, top1: 0.49953, throughput: 1326.73 | 2022-05-21 11:44:35.043 [rank:3] [train], epoch: 8/50, iter: 200/834, loss: 0.38124, top1: 0.49969, throughput: 1326.44 | 2022-05-21 11:44:35.044 [rank:5] [train], epoch: 8/50, iter: 200/834, loss: 0.38038, top1: 0.50677, throughput: 1326.36 | 2022-05-21 11:44:35.044 [rank:6] [train], epoch: 8/50, iter: 200/834, loss: 0.38129, top1: 0.50609, throughput: 1326.56 | 2022-05-21 11:44:35.045 [rank:2] [train], epoch: 8/50, iter: 200/834, loss: 0.38044, top1: 0.50349, throughput: 1326.60 | 2022-05-21 11:44:35.043 [rank:0] [train], epoch: 8/50, iter: 200/834, loss: 0.38209, top1: 0.50083, throughput: 1326.34 | 2022-05-21 11:44:35.045 [rank:6] [train], epoch: 8/50, iter: 300/834, loss: 0.37950, top1: 0.50745, throughput: 1327.13 | 2022-05-21 11:44:49.513 [rank:5] [train], epoch: 8/50, iter: 300/834, loss: 0.38356, top1: 0.49849, throughput: 1327.06 | 2022-05-21 11:44:49.513 [rank:4] [train], epoch: 8/50, iter: 300/834, loss: 0.37845, top1: 0.51078, throughput: 1326.80[rank:7] [train], epoch: 8/50, iter: 300/834, loss: 0.37709, top1: 0.51000, throughput: 1326.81 | 2022-05-21 11:44:49.514| 2022-05-21 11:44:49.514 [rank:1] [train], epoch: 8/50, iter: 300/834, loss: 0.38479, top1: 0.49901, throughput: 1326.59 | 2022-05-21 11:44:49.516 [rank:0] [train], epoch: 8/50, iter: 300/834, loss: 0.38050, top1: 0.50552, throughput: 1326.93 | 2022-05-21 11:44:49.515 [rank:3] [train], epoch: 8/50, iter: 300/834, loss: 0.38227, top1: 0.50031, throughput: 1326.85 | 2022-05-21 11:44:49.515 [rank:2] [train], epoch: 8/50, iter: 300/834, loss: 0.38084, top1: 0.50167, throughput: 1326.59 | 2022-05-21 11:44:49.516 [rank:5] [train], epoch: 8/50, iter: 400/834, loss: 0.38353, top1: 0.50156, throughput: 1327.96 | 2022-05-21 11:45:03.971 [rank:4] [train], epoch: 8/50, iter: 400/834, loss: 0.38070, top1: 0.50104, throughput: 1328.02[rank:7] [train], epoch: 8/50, iter: 400/834, loss: 0.38186, top1: 0.50443, throughput: 1327.99 | 2022-05-21 11:45:03.972 | 2022-05-21 11:45:03.972 [rank:1] [train], epoch: 8/50, iter: 400/834, loss: 0.37902, top1: 0.50651, throughput: 1328.21 | 2022-05-21 11:45:03.972 [rank:6] [train], epoch: 8/50, iter: 400/834, loss: 0.38107, top1: 0.50583, throughput: 1327.67 | 2022-05-21 11:45:03.974 [rank:0] [train], epoch: 8/50, iter: 400/834, loss: 0.38327, top1: 0.49797, throughput: 1327.86 | 2022-05-21 11:45:03.974 [rank:2] [train], epoch: 8/50, iter: 400/834, loss: 0.38063, top1: 0.50526, throughput: 1328.03 | 2022-05-21 11:45:03.974 [rank:3] [train], epoch: 8/50, iter: 400/834, loss: 0.38330, top1: 0.50286, throughput: 1327.85 | 2022-05-21 11:45:03.974 [rank:7] [train], epoch: 8/50, iter: 500/834, loss: 0.38135, top1: 0.50474, throughput: 1320.90 | 2022-05-21 11:45:18.508 [rank:0] [train], epoch: 8/50, iter: 500/834, loss: 0.38296, top1: 0.49990, throughput: 1321.09 | 2022-05-21 11:45:18.507 [rank:5] [train], epoch: 8/50, iter: 500/834, loss: 0.38065, top1: 0.50578, throughput: 1320.87 | 2022-05-21 11:45:18.507 [rank:1] [train], epoch: 8/50, iter: 500/834, loss: 0.37903, top1: 0.50635, throughput: 1320.88 | 2022-05-21 11:45:18.508 [rank:4] [train], epoch: 8/50, iter: 500/834, loss: 0.38216, top1: 0.50281, throughput: 1320.87 | 2022-05-21 11:45:18.508 [rank:2] [train], epoch: 8/50, iter: 500/834, loss: 0.37884, top1: 0.50911, throughput: 1321.09 | 2022-05-21 11:45:18.507 [rank:6] [train], epoch: 8/50, iter: 500/834, loss: 0.38204, top1: 0.50365, throughput: 1320.88 | 2022-05-21 11:45:18.510 [rank:3] [train], epoch: 8/50, iter: 500/834, loss: 0.37968, top1: 0.50411, throughput: 1320.92 | 2022-05-21 11:45:18.510 [rank:4] [train], epoch: 8/50, iter: 600/834, loss: 0.38295, top1: 0.50516, throughput: 1327.42 | 2022-05-21 11:45:32.972 [rank:7] [train], epoch: 8/50, iter: 600/834, loss: 0.38047, top1: 0.50219, throughput: 1327.46 | 2022-05-21 11:45:32.971 [rank:5] [train], epoch: 8/50, iter: 600/834, loss: 0.38615, top1: 0.49245, throughput: 1327.28 | 2022-05-21 11:45:32.972 [rank:6] [train], epoch: 8/50, iter: 600/834, loss: 0.37958, top1: 0.50464, throughput: 1327.52 | 2022-05-21 11:45:32.973 [rank:3] [train], epoch: 8/50, iter: 600/834, loss: 0.38129, top1: 0.50115, throughput: 1327.50 | 2022-05-21 11:45:32.973 [rank:1] [train], epoch: 8/50, iter: 600/834, loss: 0.38132, top1: 0.50406, throughput: 1327.26 | 2022-05-21 11:45:32.974 [rank:2] [train], epoch: 8/50, iter: 600/834, loss: 0.38103, top1: 0.50250, throughput: 1327.37 | 2022-05-21 11:45:32.972 [rank:0] [train], epoch: 8/50, iter: 600/834, loss: 0.37995, top1: 0.50641, throughput: 1327.19 | 2022-05-21 11:45:32.974 [rank:1] [train], epoch: 8/50, iter: 700/834, loss: 0.37953, top1: 0.50484, throughput: 1315.53 | 2022-05-21 11:45:47.568 [rank:3] [train], epoch: 8/50, iter: 700/834, loss: 0.37549, top1: 0.51453, throughput: 1315.50 | 2022-05-21 11:45:47.568 [rank:7] [train], epoch: 8/50, iter: 700/834, loss: 0.38092, top1: 0.50188, throughput: 1315.39 | 2022-05-21 11:45:47.568 [rank:2] [train], epoch: 8/50, iter: 700/834, loss: 0.38134, top1: 0.50307, throughput: 1315.38 | 2022-05-21 11:45:47.569 [rank:4] [train], epoch: 8/50, iter: 700/834, loss: 0.37662, top1: 0.51219, throughput: 1315.38 | 2022-05-21 11:45:47.569 [rank:0] [train], epoch: 8/50, iter: 700/834, loss: 0.37873, top1: 0.50901, throughput: 1315.55 | 2022-05-21 11:45:47.569 [rank:6] [train], epoch: 8/50, iter: 700/834, loss: 0.37963, top1: 0.50901, throughput: 1315.30 | 2022-05-21 11:45:47.570 [rank:5] [train], epoch: 8/50, iter: 700/834, loss: 0.37989, top1: 0.50682, throughput: 1315.23 | 2022-05-21 11:45:47.571 [rank:1] [train], epoch: 8/50, iter: 800/834, loss: 0.37904, top1: 0.50724, throughput: 1322.50 | 2022-05-21 11:46:02.086 [rank:0] [train], epoch: 8/50, iter: 800/834, loss: 0.37778, top1: 0.50766, throughput: 1322.48 | 2022-05-21 11:46:02.087 [rank:3] [train], epoch: 8/50, iter: 800/834, loss: 0.37933, top1: 0.50927, throughput: 1322.40 | 2022-05-21 11:46:02.087 [rank:5] [train], epoch: 8/50, iter: 800/834, loss: 0.38058, top1: 0.50427, throughput: 1322.54 | 2022-05-21 11:46:02.088 [rank:2] [train], epoch: 8/50, iter: 800/834, loss: 0.37572, top1: 0.51224, throughput: 1322.43 | 2022-05-21 11:46:02.087 [rank:6] [train], epoch: 8/50, iter: 800/834, loss: 0.37894, top1: 0.50729, throughput: 1322.28 | 2022-05-21 11:46:02.091 [rank:7] [train], epoch: 8/50, iter: 800/834, loss: 0.38041, top1: 0.50682, throughput: 1322.27 | 2022-05-21 11:46:02.088 [rank:4] [train], epoch: 8/50, iter: 800/834, loss: 0.37873, top1: 0.50849, throughput: 1322.15 | 2022-05-21 11:46:02.090 [rank:7] [train], epoch: 8/50, iter: 834/834, loss: 0.37553, top1: 0.51547, throughput: 1325.58 | 2022-05-21 11:46:07.013 [rank:0] [train], epoch: 8/50, iter: 834/834, loss: 0.38058, top1: 0.50521, throughput: 1325.12 | 2022-05-21 11:46:07.013 [rank:5] [train], epoch: 8/50, iter: 834/834, loss: 0.37986, top1: 0.50551, throughput: 1325.16 | 2022-05-21 11:46:07.014 [rank:1] [train], epoch: 8/50, iter: 834/834, loss: 0.37551, top1: 0.50643, throughput: 1324.67 | 2022-05-21 11:46:07.014 [rank:2] [train], epoch: 8/50, iter: 834/834, loss: 0.37979, top1: 0.50214, throughput: 1324.78 | 2022-05-21 11:46:07.015 [rank:3] [train], epoch: 8/50, iter: 834/834, loss: 0.37763, top1: 0.50506, throughput: 1324.67 | 2022-05-21 11:46:07.015 [rank:4] [train], epoch: 8/50, iter: 834/834, loss: 0.38116, top1: 0.50061, throughput: 1325.38 | 2022-05-21 11:46:07.016 [rank:6] [train], epoch: 8/50, iter: 834/834, loss: 0.38046, top1: 0.50674, throughput: 1325.43 | 2022-05-21 11:46:07.016 [rank:7] [eval], epoch: 8/50, iter: 125/125, loss: 0.00000, top1: 0.51920, throughput: 563.12 | 2022-05-21 11:46:18.112 [rank:4] [eval], epoch: 8/50, iter: 125/125, loss: 0.00000, top1: 0.50896, throughput: 563.23 | 2022-05-21 11:46:18.112 [rank:0] [eval], epoch: 8/50, iter: 125/125, loss: 0.00000, top1: 0.51440, throughput: 563.03 | 2022-05-21 11:46:18.114 [rank:2] [eval], epoch: 8/50, iter: 125/125, loss: 0.00000, top1: 0.50464, throughput: 561.89 | 2022-05-21 11:46:18.138 [rank:6] [eval], epoch: 8/50, iter: 125/125, loss: 0.00000, top1: 0.51280, throughput: 557.59 | 2022-05-21 11:46:18.225 [rank:3] [eval], epoch: 8/50, iter: 125/125, loss: 0.00000, top1: 0.50752, throughput: 554.65 | 2022-05-21 11:46:18.284 [rank:1] [eval], epoch: 8/50, iter: 125/125, loss: 0.00000, top1: 0.51344, throughput: 549.03 | 2022-05-21 11:46:18.398 [rank:5] [eval], epoch: 8/50, iter: 125/125, loss: 0.00000, top1: 0.50256, throughput: 548.14 | 2022-05-21 11:46:18.416 [rank:7] [train], epoch: 9/50, iter: 100/834, loss: 0.37252, top1: 0.51500, throughput: 1305.48 | 2022-05-21 11:46:32.819 [rank:4] [train], epoch: 9/50, iter: 100/834, loss: 0.36907, top1: 0.52359, throughput: 1305.59 | 2022-05-21 11:46:32.818 [rank:1] [train], epoch: 9/50, iter: 100/834, loss: 0.37514, top1: 0.51417, throughput: 1331.41 | 2022-05-21 11:46:32.819 [rank:5] [train], epoch: 9/50, iter: 100/834, loss: 0.37413, top1: 0.51755, throughput: 1333.02 | 2022-05-21 11:46:32.820 [rank:2] [train], epoch: 9/50, iter: 100/834, loss: 0.37246, top1: 0.51818, throughput: 1307.83 | 2022-05-21 11:46:32.819 [rank:0] [train], epoch: 9/50, iter: 100/834, loss: 0.37396, top1: 0.51156, throughput: 1305.52 | 2022-05-21 11:46:32.821 [rank:3] [train], epoch: 9/50, iter: 100/834, loss: 0.37264, top1: 0.51766, throughput: 1320.75 | 2022-05-21 11:46:32.821 [rank:6] [train], epoch: 9/50, iter: 100/834, loss: 0.37558, top1: 0.51693, throughput: 1315.36 | 2022-05-21 11:46:32.822 [rank:4] [train], epoch: 9/50, iter: 200/834, loss: 0.37208, top1: 0.51740, throughput: 1317.56 | 2022-05-21 11:46:47.391 [rank:5] [train], epoch: 9/50, iter: 200/834, loss: 0.37286, top1: 0.51901, throughput: 1317.67 | 2022-05-21 11:46:47.391 [rank:3] [train], epoch: 9/50, iter: 200/834, loss: 0.37688, top1: 0.51016, throughput: 1317.71 | 2022-05-21 11:46:47.392 [rank:0] [train], epoch: 9/50, iter: 200/834, loss: 0.37099, top1: 0.52302, throughput: 1317.77 | 2022-05-21 11:46:47.391 [rank:2] [train], epoch: 9/50, iter: 200/834, loss: 0.37377, top1: 0.51536, throughput: 1317.45 | 2022-05-21 11:46:47.392 [rank:1] [train], epoch: 9/50, iter: 200/834, loss: 0.37151, top1: 0.51958, throughput: 1317.40 | 2022-05-21 11:46:47.393 [rank:6] [train], epoch: 9/50, iter: 200/834, loss: 0.37312, top1: 0.52286, throughput: 1317.31 | 2022-05-21 11:46:47.397 [rank:7] [train], epoch: 9/50, iter: 200/834, loss: 0.37148, top1: 0.51833, throughput: 1317.19 | 2022-05-21 11:46:47.395 [rank:4] [train], epoch: 9/50, iter: 300/834, loss: 0.37324, top1: 0.51115, throughput: 1325.72 | 2022-05-21 11:47:01.873 [rank:5] [train], epoch: 9/50, iter: 300/834, loss: 0.37620, top1: 0.51120, throughput: 1325.73 | 2022-05-21 11:47:01.874 [rank:0] [train], epoch: 9/50, iter: 300/834, loss: 0.37288, top1: 0.52083, throughput: 1325.62 | 2022-05-21 11:47:01.875 [rank:7] [train], epoch: 9/50, iter: 300/834, loss: 0.37471, top1: 0.51458, throughput: 1326.17 | 2022-05-21 11:47:01.873 [rank:2] [train], epoch: 9/50, iter: 300/834, loss: 0.37510, top1: 0.51068, throughput: 1325.77 | 2022-05-21 11:47:01.875 [rank:6] [train], epoch: 9/50, iter: 300/834, loss: 0.37372, top1: 0.52094, throughput: 1326.03 | 2022-05-21 11:47:01.876 [rank:3] [train], epoch: 9/50, iter: 300/834, loss: 0.37267, top1: 0.51141, throughput: 1325.64 | 2022-05-21 11:47:01.875 [rank:1] [train], epoch: 9/50, iter: 300/834, loss: 0.37395, top1: 0.51490, throughput: 1325.74 | 2022-05-21 11:47:01.875 [rank:5] [train], epoch: 9/50, iter: 400/834, loss: 0.37573, top1: 0.51281, throughput: 1329.76 | 2022-05-21 11:47:16.312 [rank:3] [train], epoch: 9/50, iter: 400/834, loss: 0.37411, top1: 0.51260, throughput: 1329.93 | 2022-05-21 11:47:16.312 [rank:7] [train], epoch: 9/50, iter: 400/834, loss: 0.37609, top1: 0.51354, throughput: 1329.74 | 2022-05-21 11:47:16.312 [rank:1] [train], epoch: 9/50, iter: 400/834, loss: 0.37305, top1: 0.51990, throughput: 1329.95 | 2022-05-21 11:47:16.312 [rank:6] [train], epoch: 9/50, iter: 400/834, loss: 0.37303, top1: 0.51813, throughput: 1329.96 | 2022-05-21 11:47:16.313 [rank:4] [train], epoch: 9/50, iter: 400/834, loss: 0.37127, top1: 0.51849, throughput: 1329.58 | 2022-05-21 11:47:16.314 [rank:0] [train], epoch: 9/50, iter: 400/834, loss: 0.37322, top1: 0.51500, throughput: 1329.70 | 2022-05-21 11:47:16.314 [rank:2] [train], epoch: 9/50, iter: 400/834, loss: 0.37421, top1: 0.51880, throughput: 1329.66 | 2022-05-21 11:47:16.314 [rank:1] [train], epoch: 9/50, iter: 500/834, loss: 0.37431, top1: 0.51240, throughput: 1322.65 | 2022-05-21 11:47:30.828 [rank:5] [train], epoch: 9/50, iter: 500/834, loss: 0.37386, top1: 0.51979, throughput: 1322.61 | 2022-05-21 11:47:30.829 [rank:6] [train], epoch: 9/50, iter: 500/834, loss: 0.37113, top1: 0.51880, throughput: 1322.62 | 2022-05-21 11:47:30.829 [rank:4] [train], epoch: 9/50, iter: 500/834, loss: 0.37113, top1: 0.52312, throughput: 1322.68 | 2022-05-21 11:47:30.830 [rank:0] [train], epoch: 9/50, iter: 500/834, loss: 0.37026, top1: 0.52229, throughput: 1322.69 | 2022-05-21 11:47:30.830 [rank:3] [train], epoch: 9/50, iter: 500/834, loss: 0.37358, top1: 0.51870, throughput: 1322.40 | 2022-05-21 11:47:30.831 [rank:7] [train], epoch: 9/50, iter: 500/834, loss: 0.37306, top1: 0.51682, throughput: 1322.57 | 2022-05-21 11:47:30.829 [rank:2] [train], epoch: 9/50, iter: 500/834, loss: 0.37456, top1: 0.51609, throughput: 1322.71 | 2022-05-21 11:47:30.830 [rank:0] [train], epoch: 9/50, iter: 600/834, loss: 0.37130, top1: 0.52130, throughput: 1329.19 | 2022-05-21 11:47:45.275 [rank:1] [train], epoch: 9/50, iter: 600/834, loss: 0.37097, top1: 0.52109, throughput: 1329.07 | 2022-05-21 11:47:45.275 [rank:6] [train], epoch: 9/50, iter: 600/834, loss: 0.37293, top1: 0.51948, throughput: 1329.19 | 2022-05-21 11:47:45.274 [rank:7] [train], epoch: 9/50, iter: 600/834, loss: 0.36747, top1: 0.52703, throughput: 1329.07 | 2022-05-21 11:47:45.275 [rank:4] [train], epoch: 9/50, iter: 600/834, loss: 0.37246, top1: 0.51688, throughput: 1329.21 | 2022-05-21 11:47:45.275 [rank:2] [train], epoch: 9/50, iter: 600/834, loss: 0.37400, top1: 0.51531, throughput: 1329.04 | 2022-05-21 11:47:45.276 [rank:5] [train], epoch: 9/50, iter: 600/834, loss: 0.37262, top1: 0.52109, throughput: 1328.93 | 2022-05-21 11:47:45.277 [rank:3] [train], epoch: 9/50, iter: 600/834, loss: 0.37116, top1: 0.51833, throughput: 1329.10 | 2022-05-21 11:47:45.277 [rank:6] [train], epoch: 9/50, iter: 700/834, loss: 0.37303, top1: 0.51568, throughput: 1322.54 | 2022-05-21 11:47:59.792 [rank:7] [train], epoch: 9/50, iter: 700/834, loss: 0.37330, top1: 0.52104, throughput: 1322.68 | 2022-05-21 11:47:59.792 [rank:0] [train], epoch: 9/50, iter: 700/834, loss: 0.37513, top1: 0.51516, throughput: 1322.51 | 2022-05-21 11:47:59.793 [rank:5] [train], epoch: 9/50, iter: 700/834, loss: 0.37147, top1: 0.52417, throughput: 1322.72 | 2022-05-21 11:47:59.792 [rank:1] [train], epoch: 9/50, iter: 700/834, loss: 0.37127, top1: 0.51953, throughput: 1322.51 | 2022-05-21 11:47:59.792 [rank:4] [train], epoch: 9/50, iter: 700/834, loss: 0.37398, top1: 0.51542, throughput: 1322.42 | 2022-05-21 11:47:59.794 [rank:2] [train], epoch: 9/50, iter: 700/834, loss: 0.37099, top1: 0.52443, throughput: 1322.44 | 2022-05-21 11:47:59.795 [rank:3] [train], epoch: 9/50, iter: 700/834, loss: 0.37139, top1: 0.51943, throughput: 1322.50 | 2022-05-21 11:47:59.795 [rank:3] [train], epoch: 9/50, iter: 800/834, loss: 0.37274, top1: 0.51974, throughput: 1328.86 | 2022-05-21 11:48:14.243 [rank:5] [train], epoch: 9/50, iter: 800/834, loss: 0.36966, top1: 0.52198, throughput: 1328.62 | 2022-05-21 11:48:14.243 [rank:2] [train], epoch: 9/50, iter: 800/834, loss: 0.37400, top1: 0.51609, throughput: 1328.88 | 2022-05-21 11:48:14.243 [rank:6] [train], epoch: 9/50, iter: 800/834, loss: 0.37092, top1: 0.52234, throughput: 1328.51 | 2022-05-21 11:48:14.244 [rank:0] [train], epoch: 9/50, iter: 800/834, loss: 0.37328, top1: 0.51724, throughput: 1328.53 | 2022-05-21 11:48:14.245 [rank:1] [train], epoch: 9/50, iter: 800/834, loss: 0.37359, top1: 0.51495, throughput: 1328.50 | 2022-05-21 11:48:14.245 [rank:7] [train], epoch: 9/50, iter: 800/834, loss: 0.37312, top1: 0.51823, throughput: 1328.24 | 2022-05-21 11:48:14.247 [rank:4] [train], epoch: 9/50, iter: 800/834, loss: 0.37387, top1: 0.51667, throughput: 1328.43 | 2022-05-21 11:48:14.247 [rank:5] [train], epoch: 9/50, iter: 834/834, loss: 0.37359, top1: 0.52160, throughput: 1325.42 | 2022-05-21 11:48:19.169 [rank:0] [train], epoch: 9/50, iter: 834/834, loss: 0.37550, top1: 0.50735, throughput: 1325.74 | 2022-05-21 11:48:19.169 [rank:6] [train], epoch: 9/50, iter: 834/834, loss: 0.37011, top1: 0.53002, throughput: 1325.49 | 2022-05-21 11:48:19.169 [rank:1] [train], epoch: 9/50, iter: 834/834, loss: 0.37195, top1: 0.52528, throughput: 1325.75 | 2022-05-21 11:48:19.169 [rank:3] [train], epoch: 9/50, iter: 834/834, loss: 0.37095, top1: 0.52007, throughput: 1325.18 | 2022-05-21 11:48:19.169 [rank:4] [train], epoch: 9/50, iter: 834/834, loss: 0.37344, top1: 0.51976, throughput: 1326.02 | 2022-05-21 11:48:19.170 [rank:7] [train], epoch: 9/50, iter: 834/834, loss: 0.37227, top1: 0.52252, throughput: 1325.96 | 2022-05-21 11:48:19.170 [rank:2] [train], epoch: 9/50, iter: 834/834, loss: 0.36664, top1: 0.52941, throughput: 1325.01 | 2022-05-21 11:48:19.170 [rank:0] [eval], epoch: 9/50, iter: 125/125, loss: 0.00000, top1: 0.52992, throughput: 563.27 | 2022-05-21 11:48:30.265 [rank:7] [eval], epoch: 9/50, iter: 125/125, loss: 0.00000, top1: 0.51328, throughput: 562.77 | 2022-05-21 11:48:30.276 [rank:2] [eval], epoch: 9/50, iter: 125/125, loss: 0.00000, top1: 0.51632, throughput: 558.63 | 2022-05-21 11:48:30.358 [rank:6] [eval], epoch: 9/50, iter: 125/125, loss: 0.00000, top1: 0.52016, throughput: 556.29 | 2022-05-21 11:48:30.404 [rank:4] [eval], epoch: 9/50, iter: 125/125, loss: 0.00000, top1: 0.52048, throughput: 556.12 | 2022-05-21 11:48:30.408 [rank:1] [eval], epoch: 9/50, iter: 125/125, loss: 0.00000, top1: 0.51872, throughput: 554.19 | 2022-05-21 11:48:30.447 [rank:3] [eval], epoch: 9/50, iter: 125/125, loss: 0.00000, top1: 0.51184, throughput: 551.15 | 2022-05-21 11:48:30.509 [rank:5] [eval], epoch: 9/50, iter: 125/125, loss: 0.00000, top1: 0.50576, throughput: 549.48 | 2022-05-21 11:48:30.543 [rank:6] [train], epoch: 10/50, iter: 100/834, loss: 0.36424, top1: 0.52938, throughput: 1310.01 | 2022-05-21 11:48:45.060 [rank:5] [train], epoch: 10/50, iter: 100/834, loss: 0.36552, top1: 0.53349, throughput: 1322.55 | 2022-05-21 11:48:45.060 [rank:2] [train], epoch: 10/50, iter: 100/834, loss: 0.36073, top1: 0.54151, throughput: 1305.93 | 2022-05-21 11:48:45.060 [rank:4] [train], epoch: 10/50, iter: 100/834, loss: 0.36470, top1: 0.53370, throughput: 1310.26 | 2022-05-21 11:48:45.062 [rank:0] [train], epoch: 10/50, iter: 100/834, loss: 0.36388, top1: 0.53464, throughput: 1297.54 | 2022-05-21 11:48:45.062 [rank:7] [train], epoch: 10/50, iter: 100/834, loss: 0.36669, top1: 0.52839, throughput: 1298.50 | 2022-05-21 11:48:45.062 [rank:1] [train], epoch: 10/50, iter: 100/834, loss: 0.36767, top1: 0.52719, throughput: 1313.68 | 2022-05-21 11:48:45.062 [rank:3] [train], epoch: 10/50, iter: 100/834, loss: 0.36609, top1: 0.52313, throughput: 1319.36 | 2022-05-21 11:48:45.062 [rank:2] [train], epoch: 10/50, iter: 200/834, loss: 0.36541, top1: 0.53359, throughput: 1330.11 | 2022-05-21 11:48:59.495 [rank:5] [train], epoch: 10/50, iter: 200/834, loss: 0.36679, top1: 0.53042, throughput: 1330.19 | 2022-05-21 11:48:59.494 [rank:4] [train], epoch: 10/50, iter: 200/834, loss: 0.36521, top1: 0.52828, throughput: 1330.28 | 2022-05-21 11:48:59.495 [rank:7] [train], epoch: 10/50, iter: 200/834, loss: 0.36679, top1: 0.52979, throughput: 1330.33 | 2022-05-21 11:48:59.495 [rank:0] [train], epoch: 10/50, iter: 200/834, loss: 0.36571, top1: 0.53052, throughput: 1330.05 | 2022-05-21 11:48:59.498 [rank:3] [train], epoch: 10/50, iter: 200/834, loss: 0.36865, top1: 0.52651, throughput: 1330.08 | 2022-05-21 11:48:59.497 [rank:1] [train], epoch: 10/50, iter: 200/834, loss: 0.36769, top1: 0.52437, throughput: 1330.09 | 2022-05-21 11:48:59.497 [rank:6] [train], epoch: 10/50, iter: 200/834, loss: 0.36774, top1: 0.52760, throughput: 1329.81 | 2022-05-21 11:48:59.498 [rank:7] [train], epoch: 10/50, iter: 300/834, loss: 0.36593, top1: 0.52859, throughput: 1329.86 | 2022-05-21 11:49:13.932 [rank:5] [train], epoch: 10/50, iter: 300/834, loss: 0.36694, top1: 0.52516, throughput: 1329.90 | 2022-05-21 11:49:13.932 [rank:6] [train], epoch: 10/50, iter: 300/834, loss: 0.36547, top1: 0.53312, throughput: 1330.18 | 2022-05-21 11:49:13.933 [rank:4] [train], epoch: 10/50, iter: 300/834, loss: 0.36734, top1: 0.53089, throughput: 1329.77 | 2022-05-21 11:49:13.934 [rank:3] [train], epoch: 10/50, iter: 300/834, loss: 0.36665, top1: 0.53250, throughput: 1329.88 | 2022-05-21 11:49:13.935 [rank:1] [train], epoch: 10/50, iter: 300/834, loss: 0.36778, top1: 0.52781, throughput: 1329.91 | 2022-05-21 11:49:13.934 [rank:0] [train], epoch: 10/50, iter: 300/834, loss: 0.36583, top1: 0.53057, throughput: 1329.88 | 2022-05-21 11:49:13.935 [rank:2] [train], epoch: 10/50, iter: 300/834, loss: 0.36771, top1: 0.52844, throughput: 1329.61 | 2022-05-21 11:49:13.936 [rank:6] [train], epoch: 10/50, iter: 400/834, loss: 0.36656, top1: 0.53266, throughput: 1327.82 | 2022-05-21 11:49:28.392 [rank:7] [train], epoch: 10/50, iter: 400/834, loss: 0.36475, top1: 0.52833, throughput: 1327.80 | 2022-05-21 11:49:28.392 [rank:0] [train], epoch: 10/50, iter: 400/834, loss: 0.36967, top1: 0.52240, throughput: 1327.91 | 2022-05-21 11:49:28.394 [rank:2] [train], epoch: 10/50, iter: 400/834, loss: 0.36802, top1: 0.52474, throughput: 1328.09 | 2022-05-21 11:49:28.392 [rank:3] [train], epoch: 10/50, iter: 400/834, loss: 0.36445, top1: 0.52714, throughput: 1327.82 | 2022-05-21 11:49:28.394 [rank:1] [train], epoch: 10/50, iter: 400/834, loss: 0.36534, top1: 0.52880, throughput: 1327.85 | 2022-05-21 11:49:28.394 [rank:4] [train], epoch: 10/50, iter: 400/834, loss: 0.36853, top1: 0.52578, throughput: 1327.56 | 2022-05-21 11:49:28.396 [rank:5] [train], epoch: 10/50, iter: 400/834, loss: 0.36860, top1: 0.52734, throughput: 1327.37 | 2022-05-21 11:49:28.396 [rank:5] [train], epoch: 10/50, iter: 500/834, loss: 0.36932, top1: 0.52380, throughput: 1329.53 | 2022-05-21 11:49:42.838 [rank:7] [train], epoch: 10/50, iter: 500/834, loss: 0.36869, top1: 0.52297, throughput: 1329.07 | 2022-05-21 11:49:42.838 [rank:4] [train], epoch: 10/50, iter: 500/834, loss: 0.37047, top1: 0.52370, throughput: 1329.42 | 2022-05-21 11:49:42.839 [rank:6] [train], epoch: 10/50, iter: 500/834, loss: 0.36715, top1: 0.52964, throughput: 1329.01 | 2022-05-21 11:49:42.839 [rank:0] [train], epoch: 10/50, iter: 500/834, loss: 0.36910, top1: 0.52151, throughput: 1329.08[rank:2] [train], epoch: 10/50, iter: 500/834, loss: 0.36770, top1: 0.52635, throughput: 1329.13 | 2022-05-21 11:49:42.838| 2022-05-21 11:49:42.840 [rank:3] [train], epoch: 10/50, iter: 500/834, loss: 0.36834, top1: 0.53125, throughput: 1328.81 | 2022-05-21 11:49:42.843 [rank:1] [train], epoch: 10/50, iter: 500/834, loss: 0.36538, top1: 0.52958, throughput: 1328.82 | 2022-05-21 11:49:42.843 [rank:1] [train], epoch: 10/50, iter: 600/834, loss: 0.36574, top1: 0.53135, throughput: 1320.17 | 2022-05-21 11:49:57.386 [rank:6] [train], epoch: 10/50, iter: 600/834, loss: 0.36712, top1: 0.53182, throughput: 1319.78 | 2022-05-21 11:49:57.387 [rank:5] [train], epoch: 10/50, iter: 600/834, loss: 0.36743, top1: 0.52687, throughput: 1319.71 | 2022-05-21 11:49:57.386 [rank:0] [train], epoch: 10/50, iter: 600/834, loss: 0.36795, top1: 0.52891, throughput: 1319.90 | 2022-05-21 11:49:57.386 [rank:7] [train], epoch: 10/50, iter: 600/834, loss: 0.36704, top1: 0.52948, throughput: 1319.78 | 2022-05-21 11:49:57.386 [rank:3] [train], epoch: 10/50, iter: 600/834, loss: 0.36398, top1: 0.53641, throughput: 1320.10 | 2022-05-21 11:49:57.388 [rank:4] [train], epoch: 10/50, iter: 600/834, loss: 0.36880, top1: 0.52151, throughput: 1319.50 | 2022-05-21 11:49:57.390 [rank:2] [train], epoch: 10/50, iter: 600/834, loss: 0.36893, top1: 0.52266, throughput: 1319.57 | 2022-05-21 11:49:57.388 [rank:4] [train], epoch: 10/50, iter: 700/834, loss: 0.36785, top1: 0.52760, throughput: 1327.06 | 2022-05-21 11:50:11.858 [rank:1] [train], epoch: 10/50, iter: 700/834, loss: 0.36674, top1: 0.52740, throughput: 1326.74 | 2022-05-21 11:50:11.858 [rank:7] [train], epoch: 10/50, iter: 700/834, loss: 0.36576, top1: 0.53151, throughput: 1326.68 | 2022-05-21 11:50:11.858 [rank:5] [train], epoch: 10/50, iter: 700/834, loss: 0.36884, top1: 0.52677, throughput: 1326.67 | 2022-05-21 11:50:11.858 [rank:2] [train], epoch: 10/50, iter: 700/834, loss: 0.36465, top1: 0.53359, throughput: 1326.88 | 2022-05-21 11:50:11.858 [rank:3] [train], epoch: 10/50, iter: 700/834, loss: 0.36794, top1: 0.52891, throughput: 1326.75 | 2022-05-21 11:50:11.859 [rank:0] [train], epoch: 10/50, iter: 700/834, loss: 0.36793, top1: 0.52677, throughput: 1326.45 | 2022-05-21 11:50:11.861 [rank:6] [train], epoch: 10/50, iter: 700/834, loss: 0.36991, top1: 0.52563, throughput: 1326.55 | 2022-05-21 11:50:11.861 [rank:5] [train], epoch: 10/50, iter: 800/834, loss: 0.36372, top1: 0.53161, throughput: 1327.25 | 2022-05-21 11:50:26.324 [rank:6] [train], epoch: 10/50, iter: 800/834, loss: 0.36435, top1: 0.53365, throughput: 1327.51 | 2022-05-21 11:50:26.324 [rank:2] [train], epoch: 10/50, iter: 800/834, loss: 0.36895, top1: 0.52688, throughput: 1327.33 | 2022-05-21 11:50:26.323 [rank:1] [train], epoch: 10/50, iter: 800/834, loss: 0.36502, top1: 0.53328, throughput: 1327.07 | 2022-05-21 11:50:26.326 [rank:0] [train], epoch: 10/50, iter: 800/834, loss: 0.36918, top1: 0.52323, throughput: 1327.37 | 2022-05-21 11:50:26.326 [rank:3] [train], epoch: 10/50, iter: 800/834, loss: 0.36509, top1: 0.53130, throughput: 1327.35 | 2022-05-21 11:50:26.324 [rank:7] [train], epoch: 10/50, iter: 800/834, loss: 0.36728, top1: 0.52969, throughput: 1327.14 | 2022-05-21 11:50:26.326 [rank:4] [train], epoch: 10/50, iter: 800/834, loss: 0.36811, top1: 0.52594, throughput: 1327.01 | 2022-05-21 11:50:26.326 [rank:5] [train], epoch: 10/50, iter: 834/834, loss: 0.36700, top1: 0.52987, throughput: 1324.14 | 2022-05-21 11:50:31.254 [rank:0] [train], epoch: 10/50, iter: 834/834, loss: 0.36657, top1: 0.52987, throughput: 1324.06 | 2022-05-21 11:50:31.256 [rank:6] [train], epoch: 10/50, iter: 834/834, loss: 0.37017, top1: 0.52727, throughput: 1323.40 | 2022-05-21 11:50:31.257 [rank:7] [train], epoch: 10/50, iter: 834/834, loss: 0.36338, top1: 0.53661, throughput: 1323.69 | 2022-05-21 11:50:31.257 [rank:2] [train], epoch: 10/50, iter: 834/834, loss: 0.36860, top1: 0.52390, throughput: 1323.17 | 2022-05-21 11:50:31.257 [rank:4] [train], epoch: 10/50, iter: 834/834, loss: 0.36781, top1: 0.52773, throughput: 1323.77 | 2022-05-21 11:50:31.258 [rank:1] [train], epoch: 10/50, iter: 834/834, loss: 0.36470, top1: 0.53784, throughput: 1323.11 | 2022-05-21 11:50:31.260 [rank:3] [train], epoch: 10/50, iter: 834/834, loss: 0.36613, top1: 0.52788, throughput: 1322.48 | 2022-05-21 11:50:31.260 [rank:0] [eval], epoch: 10/50, iter: 125/125, loss: 0.00000, top1: 0.50064, throughput: 579.50 | 2022-05-21 11:50:42.041 [rank:7] [eval], epoch: 10/50, iter: 125/125, loss: 0.00000, top1: 0.49632, throughput: 575.88 | 2022-05-21 11:50:42.110 [rank:2] [eval], epoch: 10/50, iter: 125/125, loss: 0.00000, top1: 0.49824, throughput: 574.65 | 2022-05-21 11:50:42.133 [rank:4] [eval], epoch: 10/50, iter: 125/125, loss: 0.00000, top1: 0.49712, throughput: 572.61 | 2022-05-21 11:50:42.173 [rank:6] [eval], epoch: 10/50, iter: 125/125, loss: 0.00000, top1: 0.50656, throughput: 572.21 | 2022-05-21 11:50:42.179 [rank:3] [eval], epoch: 10/50, iter: 125/125, loss: 0.00000, top1: 0.49104, throughput: 568.93 | 2022-05-21 11:50:42.246 [rank:5] [eval], epoch: 10/50, iter: 125/125, loss: 0.00000, top1: 0.49520, throughput: 566.17 | 2022-05-21 11:50:42.293 [rank:1] [eval], epoch: 10/50, iter: 125/125, loss: 0.00000, top1: 0.49760, throughput: 563.90 | 2022-05-21 11:50:42.343 [rank:1] [train], epoch: 11/50, iter: 100/834, loss: 0.36458, top1: 0.52875, throughput: 1331.52 | 2022-05-21 11:50:56.763 [rank:2] [train], epoch: 11/50, iter: 100/834, loss: 0.36249, top1: 0.53552, throughput: 1312.42 | 2022-05-21 11:50:56.763 [rank:0] [train], epoch: 11/50, iter: 100/834, loss: 0.36022, top1: 0.53859, throughput: 1304.18 | 2022-05-21 11:50:56.763 [rank:5] [train], epoch: 11/50, iter: 100/834, loss: 0.36349, top1: 0.52958, throughput: 1326.87 | 2022-05-21 11:50:56.764 [rank:4] [train], epoch: 11/50, iter: 100/834, loss: 0.36250, top1: 0.53672, throughput: 1315.76[rank:3] [train], epoch: 11/50, iter: 100/834, loss: 0.36055, top1: 0.53818, throughput: 1322.43 | 2022-05-21 11:50:56.764 | 2022-05-21 11:50:56.765 [rank:6] [train], epoch: 11/50, iter: 100/834, loss: 0.35874, top1: 0.54578, throughput: 1316.30 | 2022-05-21 11:50:56.766 [rank:7] [train], epoch: 11/50, iter: 100/834, loss: 0.35795, top1: 0.54594, throughput: 1310.03 | 2022-05-21 11:50:56.766 [rank:0] [train], epoch: 11/50, iter: 200/834, loss: 0.35952, top1: 0.54109, throughput: 1327.86 | 2022-05-21 11:51:11.222 [rank:5] [train], epoch: 11/50, iter: 200/834, loss: 0.36075, top1: 0.53953, throughput: 1327.90 | 2022-05-21 11:51:11.223 [rank:1] [train], epoch: 11/50, iter: 200/834, loss: 0.35998, top1: 0.53875, throughput: 1327.77[rank:7] [train], epoch: 11/50, iter: 200/834, loss: 0.36236, top1: 0.53766, throughput: 1328.15 | 2022-05-21 11:51:11.223 | 2022-05-21 11:51:11.222 [rank:3] [train], epoch: 11/50, iter: 200/834, loss: 0.36231, top1: 0.53729, throughput: 1327.79 | 2022-05-21 11:51:11.224 [rank:2] [train], epoch: 11/50, iter: 200/834, loss: 0.35738, top1: 0.54406, throughput: 1327.59 | 2022-05-21 11:51:11.225 [rank:4] [train], epoch: 11/50, iter: 200/834, loss: 0.36294, top1: 0.53120, throughput: 1327.55 | 2022-05-21 11:51:11.228 [rank:6] [train], epoch: 11/50, iter: 200/834, loss: 0.36180, top1: 0.53719, throughput: 1327.56 | 2022-05-21 11:51:11.228 [rank:7] [train], epoch: 11/50, iter: 300/834, loss: 0.36261, top1: 0.53792, throughput: 1318.16 | 2022-05-21 11:51:25.788 [rank:5] [train], epoch: 11/50, iter: 300/834, loss: 0.36050, top1: 0.54021, throughput: 1318.11[rank:2] [train], epoch: 11/50, iter: 300/834, loss: 0.36131, top1: 0.53974, throughput: 1318.38 | 2022-05-21 11:51:25.789| 2022-05-21 11:51:25.788 [rank:6] [train], epoch: 11/50, iter: 300/834, loss: 0.36268, top1: 0.53667, throughput: 1318.57 | 2022-05-21 11:51:25.789 [rank:4] [train], epoch: 11/50, iter: 300/834, loss: 0.36316, top1: 0.52937, throughput: 1318.42 | 2022-05-21 11:51:25.790 [rank:1] [train], epoch: 11/50, iter: 300/834, loss: 0.36204, top1: 0.53609, throughput: 1317.97 | 2022-05-21 11:51:25.791 [rank:0] [train], epoch: 11/50, iter: 300/834, loss: 0.36318, top1: 0.53464, throughput: 1317.88 | 2022-05-21 11:51:25.791 [rank:3] [train], epoch: 11/50, iter: 300/834, loss: 0.36216, top1: 0.53422, throughput: 1318.17 | 2022-05-21 11:51:25.790 [rank:2] [train], epoch: 11/50, iter: 400/834, loss: 0.36547, top1: 0.52995, throughput: 1328.38 | 2022-05-21 11:51:40.242 [rank:5] [train], epoch: 11/50, iter: 400/834, loss: 0.36086, top1: 0.54182, throughput: 1328.40 | 2022-05-21 11:51:40.242 [rank:7] [train], epoch: 11/50, iter: 400/834, loss: 0.36235, top1: 0.53755, throughput: 1328.32 | 2022-05-21 11:51:40.243 [rank:1] [train], epoch: 11/50, iter: 400/834, loss: 0.36370, top1: 0.53479, throughput: 1328.54 | 2022-05-21 11:51:40.243 [rank:3] [train], epoch: 11/50, iter: 400/834, loss: 0.36398, top1: 0.53938, throughput: 1328.32 | 2022-05-21 11:51:40.244 [rank:0] [train], epoch: 11/50, iter: 400/834, loss: 0.36156, top1: 0.53885, throughput: 1328.39 | 2022-05-21 11:51:40.245 [rank:4] [train], epoch: 11/50, iter: 400/834, loss: 0.36098, top1: 0.53677, throughput: 1328.32 | 2022-05-21 11:51:40.245 [rank:6] [train], epoch: 11/50, iter: 400/834, loss: 0.36402, top1: 0.53594, throughput: 1328.19 | 2022-05-21 11:51:40.245 [rank:5] [train], epoch: 11/50, iter: 500/834, loss: 0.36287, top1: 0.53484, throughput: 1328.76 | 2022-05-21 11:51:54.692 [rank:3] [train], epoch: 11/50, iter: 500/834, loss: 0.36082, top1: 0.53807, throughput: 1328.97 | 2022-05-21 11:51:54.692 [rank:1] [train], epoch: 11/50, iter: 500/834, loss: 0.36298, top1: 0.53698, throughput: 1328.80 | 2022-05-21 11:51:54.692 [rank:6] [train], epoch: 11/50, iter: 500/834, loss: 0.36038, top1: 0.53750, throughput: 1328.93 | 2022-05-21 11:51:54.693 [rank:4] [train], epoch: 11/50, iter: 500/834, loss: 0.36440, top1: 0.53656, throughput: 1328.96 | 2022-05-21 11:51:54.692 [rank:7] [train], epoch: 11/50, iter: 500/834, loss: 0.36227, top1: 0.53229, throughput: 1328.81 | 2022-05-21 11:51:54.692 [rank:0] [train], epoch: 11/50, iter: 500/834, loss: 0.36170, top1: 0.54240, throughput: 1328.94 | 2022-05-21 11:51:54.693 [rank:2] [train], epoch: 11/50, iter: 500/834, loss: 0.36189, top1: 0.52917, throughput: 1328.67 | 2022-05-21 11:51:54.692 [rank:0] [train], epoch: 11/50, iter: 600/834, loss: 0.36071, top1: 0.54068, throughput: 1325.72 | 2022-05-21 11:52:09.175 [rank:7] [train], epoch: 11/50, iter: 600/834, loss: 0.36263, top1: 0.53568, throughput: 1325.75 | 2022-05-21 11:52:09.174 [rank:6] [train], epoch: 11/50, iter: 600/834, loss: 0.36149, top1: 0.54125, throughput: 1325.77 | 2022-05-21 11:52:09.175 [rank:4] [train], epoch: 11/50, iter: 600/834, loss: 0.35969, top1: 0.54151, throughput: 1325.55 | 2022-05-21 11:52:09.177 [rank:5] [train], epoch: 11/50, iter: 600/834, loss: 0.36173, top1: 0.53859, throughput: 1325.52 | 2022-05-21 11:52:09.177 [rank:2] [train], epoch: 11/50, iter: 600/834, loss: 0.35797, top1: 0.54583, throughput: 1325.57 | 2022-05-21 11:52:09.177 [rank:3] [train], epoch: 11/50, iter: 600/834, loss: 0.35931, top1: 0.53734, throughput: 1325.25 | 2022-05-21 11:52:09.180 [rank:1] [train], epoch: 11/50, iter: 600/834, loss: 0.36189, top1: 0.53724, throughput: 1325.29 | 2022-05-21 11:52:09.180 [rank:5] [train], epoch: 11/50, iter: 700/834, loss: 0.36186, top1: 0.53964, throughput: 1315.96 | 2022-05-21 11:52:23.767 [rank:3] [train], epoch: 11/50, iter: 700/834, loss: 0.36034, top1: 0.53911, throughput: 1316.24 | 2022-05-21 11:52:23.767 [rank:1] [train], epoch: 11/50, iter: 700/834, loss: 0.36067, top1: 0.53854, throughput: 1316.14 | 2022-05-21 11:52:23.768 [rank:4] [train], epoch: 11/50, iter: 700/834, loss: 0.36427, top1: 0.53266, throughput: 1315.77 | 2022-05-21 11:52:23.769 [rank:6] [train], epoch: 11/50, iter: 700/834, loss: 0.35866, top1: 0.53948, throughput: 1315.54 | 2022-05-21 11:52:23.770 [rank:0] [train], epoch: 11/50, iter: 700/834, loss: 0.36302, top1: 0.53740, throughput: 1315.66 | 2022-05-21 11:52:23.769 [rank:7] [train], epoch: 11/50, iter: 700/834, loss: 0.36246, top1: 0.53740, throughput: 1315.51 | 2022-05-21 11:52:23.769 [rank:2] [train], epoch: 11/50, iter: 700/834, loss: 0.36442, top1: 0.53615, throughput: 1315.83 | 2022-05-21 11:52:23.768 [rank:7] [train], epoch: 11/50, iter: 800/834, loss: 0.36048, top1: 0.53953, throughput: 1325.56 | 2022-05-21 11:52:38.254 [rank:4] [train], epoch: 11/50, iter: 800/834, loss: 0.36162, top1: 0.53708, throughput: 1325.53 | 2022-05-21 11:52:38.254 [rank:5] [train], epoch: 11/50, iter: 800/834, loss: 0.36151, top1: 0.53568, throughput: 1325.32 | 2022-05-21 11:52:38.254 [rank:2] [train], epoch: 11/50, iter: 800/834, loss: 0.36486, top1: 0.53276, throughput: 1325.49 | 2022-05-21 11:52:38.254 [rank:6] [train], epoch: 11/50, iter: 800/834, loss: 0.36180, top1: 0.53646, throughput: 1325.53 | 2022-05-21 11:52:38.255 [rank:3] [train], epoch: 11/50, iter: 800/834, loss: 0.36107, top1: 0.53849, throughput: 1325.29 | 2022-05-21 11:52:38.254 [rank:1] [train], epoch: 11/50, iter: 800/834, loss: 0.36190, top1: 0.53953, throughput: 1325.19 | 2022-05-21 11:52:38.256 [rank:0] [train], epoch: 11/50, iter: 800/834, loss: 0.36086, top1: 0.53703, throughput: 1325.30 | 2022-05-21 11:52:38.256 [rank:6] [train], epoch: 11/50, iter: 834/834, loss: 0.36291, top1: 0.53707, throughput: 1327.75 | 2022-05-21 11:52:43.171 [rank:5] [train], epoch: 11/50, iter: 834/834, loss: 0.36386, top1: 0.53171, throughput: 1327.59 | 2022-05-21 11:52:43.171 [rank:3] [train], epoch: 11/50, iter: 834/834, loss: 0.36219, top1: 0.54182, throughput: 1327.47 | 2022-05-21 11:52:43.172 [rank:1] [train], epoch: 11/50, iter: 834/834, loss: 0.35803, top1: 0.54305, throughput: 1327.86 | 2022-05-21 11:52:43.172 [rank:7] [train], epoch: 11/50, iter: 834/834, loss: 0.36288, top1: 0.52727, throughput: 1326.70 | 2022-05-21 11:52:43.174 [rank:4] [train], epoch: 11/50, iter: 834/834, loss: 0.35889, top1: 0.54335, throughput: 1326.68 | 2022-05-21 11:52:43.174 [rank:0] [train], epoch: 11/50, iter: 834/834, loss: 0.36153, top1: 0.53094, throughput: 1326.91 | 2022-05-21 11:52:43.176 [rank:2] [train], epoch: 11/50, iter: 834/834, loss: 0.35958, top1: 0.54427, throughput: 1326.14 | 2022-05-21 11:52:43.176 [rank:7] [eval], epoch: 11/50, iter: 125/125, loss: 0.00000, top1: 0.53248, throughput: 557.88 | 2022-05-21 11:52:54.377 [rank:4] [eval], epoch: 11/50, iter: 125/125, loss: 0.00000, top1: 0.52960, throughput: 557.72 | 2022-05-21 11:52:54.381 [rank:0] [eval], epoch: 11/50, iter: 125/125, loss: 0.00000, top1: 0.53392, throughput: 557.73 | 2022-05-21 11:52:54.382 [rank:2] [eval], epoch: 11/50, iter: 125/125, loss: 0.00000, top1: 0.51632, throughput: 553.56 | 2022-05-21 11:52:54.467 [rank:1] [eval], epoch: 11/50, iter: 125/125, loss: 0.00000, top1: 0.52528, throughput: 552.09 | 2022-05-21 11:52:54.493 [rank:3] [eval], epoch: 11/50, iter: 125/125, loss: 0.00000, top1: 0.51616, throughput: 551.88 | 2022-05-21 11:52:54.496 [rank:6] [eval], epoch: 11/50, iter: 125/125, loss: 0.00000, top1: 0.52640, throughput: 551.17 | 2022-05-21 11:52:54.511 [rank:5] [eval], epoch: 11/50, iter: 125/125, loss: 0.00000, top1: 0.52256, throughput: 545.55 | 2022-05-21 11:52:54.627 [rank:7] [train], epoch: 12/50, iter: 100/834, loss: 0.35784, top1: 0.54661, throughput: 1310.19 | 2022-05-21 11:53:09.031 [rank:0] [train], epoch: 12/50, iter: 100/834, loss: 0.35810, top1: 0.54245, throughput: 1310.52 | 2022-05-21 11:53:09.033 [rank:3] [train], epoch: 12/50, iter: 100/834, loss: 0.35545, top1: 0.55141, throughput: 1320.91 | 2022-05-21 11:53:09.032 [rank:5] [train], epoch: 12/50, iter: 100/834, loss: 0.35427, top1: 0.55411, throughput: 1332.93 | 2022-05-21 11:53:09.032 [rank:2] [train], epoch: 12/50, iter: 100/834, loss: 0.35394, top1: 0.55078, throughput: 1318.24 | 2022-05-21 11:53:09.032 [rank:1] [train], epoch: 12/50, iter: 100/834, loss: 0.35540, top1: 0.54974, throughput: 1320.52 | 2022-05-21 11:53:09.033 [rank:6] [train], epoch: 12/50, iter: 100/834, loss: 0.35772, top1: 0.53943, throughput: 1321.98 | 2022-05-21 11:53:09.034 [rank:4] [train], epoch: 12/50, iter: 100/834, loss: 0.35941, top1: 0.54161, throughput: 1310.32 | 2022-05-21 11:53:09.034 [rank:5] [train], epoch: 12/50, iter: 200/834, loss: 0.35778, top1: 0.54036, throughput: 1328.62[rank:2] [train], epoch: 12/50, iter: 200/834, loss: 0.35793, top1: 0.54010, throughput: 1328.64 | 2022-05-21 11:53:23.483 | 2022-05-21 11:53:23.483 [rank:0] [train], epoch: 12/50, iter: 200/834, loss: 0.35291, top1: 0.55208, throughput: 1328.72 | 2022-05-21 11:53:23.483 [rank:1] [train], epoch: 12/50, iter: 200/834, loss: 0.35770, top1: 0.54266, throughput: 1328.61 | 2022-05-21 11:53:23.484 [rank:7] [train], epoch: 12/50, iter: 200/834, loss: 0.35693, top1: 0.54760, throughput: 1328.57[rank:4] [train], epoch: 12/50, iter: 200/834, loss: 0.35595, top1: 0.54714, throughput: 1328.67 | 2022-05-21 11:53:23.483 | 2022-05-21 11:53:23.484 [rank:3] [train], epoch: 12/50, iter: 200/834, loss: 0.35867, top1: 0.54198, throughput: 1328.51 | 2022-05-21 11:53:23.484 [rank:6] [train], epoch: 12/50, iter: 200/834, loss: 0.35835, top1: 0.54286, throughput: 1328.63 | 2022-05-21 11:53:23.485 [rank:5] [train], epoch: 12/50, iter: 300/834, loss: 0.35625, top1: 0.54964, throughput: 1330.42 | 2022-05-21 11:53:37.914 [rank:2] [train], epoch: 12/50, iter: 300/834, loss: 0.36105, top1: 0.53990, throughput: 1330.37 | 2022-05-21 11:53:37.915 [rank:6] [train], epoch: 12/50, iter: 300/834, loss: 0.35820, top1: 0.54531, throughput: 1330.62 | 2022-05-21 11:53:37.915 [rank:4] [train], epoch: 12/50, iter: 300/834, loss: 0.35558, top1: 0.54922, throughput: 1330.51 | 2022-05-21 11:53:37.915 [rank:0] [train], epoch: 12/50, iter: 300/834, loss: 0.35401, top1: 0.54724, throughput: 1330.37 | 2022-05-21 11:53:37.915 [rank:7] [train], epoch: 12/50, iter: 300/834, loss: 0.35676, top1: 0.54464, throughput: 1330.41 | 2022-05-21 11:53:37.915 [rank:3] [train], epoch: 12/50, iter: 300/834, loss: 0.35943, top1: 0.54135, throughput: 1330.36 | 2022-05-21 11:53:37.916 [rank:1] [train], epoch: 12/50, iter: 300/834, loss: 0.36013, top1: 0.53943, throughput: 1330.33 | 2022-05-21 11:53:37.916 [rank:5] [train], epoch: 12/50, iter: 400/834, loss: 0.35567, top1: 0.54620, throughput: 1326.88 | 2022-05-21 11:53:52.384 [rank:7] [train], epoch: 12/50, iter: 400/834, loss: 0.35751, top1: 0.54219, throughput: 1326.87 | 2022-05-21 11:53:52.385 [rank:1] [train], epoch: 12/50, iter: 400/834, loss: 0.35612, top1: 0.54484, throughput: 1326.95 | 2022-05-21 11:53:52.386 [rank:6] [train], epoch: 12/50, iter: 400/834, loss: 0.35774, top1: 0.54245, throughput: 1326.80 | 2022-05-21 11:53:52.385 [rank:4] [train], epoch: 12/50, iter: 400/834, loss: 0.35848, top1: 0.54146, throughput: 1326.85 | 2022-05-21 11:53:52.385 [rank:0] [train], epoch: 12/50, iter: 400/834, loss: 0.35626, top1: 0.54776, throughput: 1326.64 | 2022-05-21 11:53:52.387 [rank:3] [train], epoch: 12/50, iter: 400/834, loss: 0.35663, top1: 0.54734, throughput: 1326.86 | 2022-05-21 11:53:52.387 [rank:2] [train], epoch: 12/50, iter: 400/834, loss: 0.35861, top1: 0.54302, throughput: 1326.79 | 2022-05-21 11:53:52.386 [rank:7] [train], epoch: 12/50, iter: 500/834, loss: 0.35789, top1: 0.54719, throughput: 1327.51 | 2022-05-21 11:54:06.848 [rank:1] [train], epoch: 12/50, iter: 500/834, loss: 0.35748, top1: 0.54526, throughput: 1327.53 | 2022-05-21 11:54:06.849 [rank:3] [train], epoch: 12/50, iter: 500/834, loss: 0.35822, top1: 0.54161, throughput: 1327.69 | 2022-05-21 11:54:06.848 [rank:6] [train], epoch: 12/50, iter: 500/834, loss: 0.35954, top1: 0.54427, throughput: 1327.60 | 2022-05-21 11:54:06.848 [rank:4] [train], epoch: 12/50, iter: 500/834, loss: 0.35667, top1: 0.54630, throughput: 1327.42 | 2022-05-21 11:54:06.849 [rank:5] [train], epoch: 12/50, iter: 500/834, loss: 0.35555, top1: 0.54990, throughput: 1327.33 | 2022-05-21 11:54:06.849 [rank:0] [train], epoch: 12/50, iter: 500/834, loss: 0.35802, top1: 0.54516, throughput: 1327.55 | 2022-05-21 11:54:06.850 [rank:2] [train], epoch: 12/50, iter: 500/834, loss: 0.35663, top1: 0.55099, throughput: 1327.45 | 2022-05-21 11:54:06.849 [rank:7] [train], epoch: 12/50, iter: 600/834, loss: 0.35664, top1: 0.54844, throughput: 1325.97 | 2022-05-21 11:54:21.328 [rank:0] [train], epoch: 12/50, iter: 600/834, loss: 0.35723, top1: 0.54526, throughput: 1326.10 | 2022-05-21 11:54:21.328 [rank:3] [train], epoch: 12/50, iter: 600/834, loss: 0.35759, top1: 0.54203, throughput: 1325.75 | 2022-05-21 11:54:21.330 [rank:2] [train], epoch: 12/50, iter: 600/834, loss: 0.35881, top1: 0.54219, throughput: 1326.10 | 2022-05-21 11:54:21.328 [rank:5] [train], epoch: 12/50, iter: 600/834, loss: 0.35892, top1: 0.53865, throughput: 1325.76 | 2022-05-21 11:54:21.332 [rank:1] [train], epoch: 12/50, iter: 600/834, loss: 0.35881, top1: 0.54615, throughput: 1325.80 | 2022-05-21 11:54:21.330 [rank:4] [train], epoch: 12/50, iter: 600/834, loss: 0.35970, top1: 0.53807, throughput: 1325.84 | 2022-05-21 11:54:21.331 [rank:6] [train], epoch: 12/50, iter: 600/834, loss: 0.36018, top1: 0.54172, throughput: 1325.51 | 2022-05-21 11:54:21.333 [rank:3] [train], epoch: 12/50, iter: 700/834, loss: 0.35690, top1: 0.54469, throughput: 1325.55 | 2022-05-21 11:54:35.815 [rank:0] [train], epoch: 12/50, iter: 700/834, loss: 0.35705, top1: 0.54469, throughput: 1325.30 [rank:4] [train], epoch: 12/50, iter: 700/834, loss: 0.36035, top1: 0.53865, throughput: 1325.53| 2022-05-21 11:54:35.816 | 2022-05-21 11:54:35.815 [rank:2] [train], epoch: 12/50, iter: 700/834, loss: 0.35911, top1: 0.54229, throughput: 1325.32 | 2022-05-21 11:54:35.815 [rank:6] [train], epoch: 12/50, iter: 700/834, loss: 0.35908, top1: 0.53896, throughput: 1325.50 | 2022-05-21 11:54:35.818 [rank:7] [train], epoch: 12/50, iter: 700/834, loss: 0.35739, top1: 0.54198, throughput: 1324.94 | 2022-05-21 11:54:35.819 [rank:5] [train], epoch: 12/50, iter: 700/834, loss: 0.35637, top1: 0.54839, throughput: 1325.46 | 2022-05-21 11:54:35.817 [rank:1] [train], epoch: 12/50, iter: 700/834, loss: 0.35496, top1: 0.54964, throughput: 1325.31 | 2022-05-21 11:54:35.818 [rank:4] [train], epoch: 12/50, iter: 800/834, loss: 0.35371, top1: 0.54792, throughput: 1328.57 | 2022-05-21 11:54:50.267 [rank:6] [train], epoch: 12/50, iter: 800/834, loss: 0.35854, top1: 0.54370, throughput: 1328.76 | 2022-05-21 11:54:50.267 [rank:3] [train], epoch: 12/50, iter: 800/834, loss: 0.35924, top1: 0.54000, throughput: 1328.45 | 2022-05-21 11:54:50.268[rank:2] [train], epoch: 12/50, iter: 800/834, loss: 0.35877, top1: 0.54323, throughput: 1328.47 | 2022-05-21 11:54:50.268 [rank:0] [train], epoch: 12/50, iter: 800/834, loss: 0.35938, top1: 0.54047, throughput: 1328.42 | 2022-05-21 11:54:50.269 [rank:1] [train], epoch: 12/50, iter: 800/834, loss: 0.35462, top1: 0.54917, throughput: 1328.60 | 2022-05-21 11:54:50.269 [rank:5] [train], epoch: 12/50, iter: 800/834, loss: 0.35492, top1: 0.54703, throughput: 1328.41 | 2022-05-21 11:54:50.271 [rank:7] [train], epoch: 12/50, iter: 800/834, loss: 0.35796, top1: 0.54146, throughput: 1328.58 | 2022-05-21 11:54:50.271 [rank:4] [train], epoch: 12/50, iter: 834/834, loss: 0.35631, top1: 0.55331, throughput: 1325.82 | 2022-05-21 11:54:55.191 [rank:2] [train], epoch: 12/50, iter: 834/834, loss: 0.35961, top1: 0.53814, throughput: 1326.10 | 2022-05-21 11:54:55.190 [rank:3] [train], epoch: 12/50, iter: 834/834, loss: 0.36197, top1: 0.53278, throughput: 1325.93 | 2022-05-21 11:54:55.191 [rank:6] [train], epoch: 12/50, iter: 834/834, loss: 0.36120, top1: 0.53646, throughput: 1325.71 | 2022-05-21 11:54:55.191 [rank:0] [train], epoch: 12/50, iter: 834/834, loss: 0.35761, top1: 0.55300, throughput: 1325.71 | 2022-05-21 11:54:55.193 [rank:5] [train], epoch: 12/50, iter: 834/834, loss: 0.35797, top1: 0.54442, throughput: 1326.03 | 2022-05-21 11:54:55.193 [rank:1] [train], epoch: 12/50, iter: 834/834, loss: 0.35400, top1: 0.55499, throughput: 1325.60 | 2022-05-21 11:54:55.193 [rank:7] [train], epoch: 12/50, iter: 834/834, loss: 0.35431, top1: 0.55254, throughput: 1324.45 | 2022-05-21 11:54:55.200 [rank:4] [eval], epoch: 12/50, iter: 125/125, loss: 0.00000, top1: 0.54560, throughput: 561.15 | 2022-05-21 11:55:06.328 [rank:7] [eval], epoch: 12/50, iter: 125/125, loss: 0.00000, top1: 0.54736, throughput: 561.53 | 2022-05-21 11:55:06.330 [rank:0] [eval], epoch: 12/50, iter: 125/125, loss: 0.00000, top1: 0.55632, throughput: 561.16 | 2022-05-21 11:55:06.331 [rank:2] [eval], epoch: 12/50, iter: 125/125, loss: 0.00000, top1: 0.54176, throughput: 560.80 | 2022-05-21 11:55:06.335 [rank:3] [eval], epoch: 12/50, iter: 125/125, loss: 0.00000, top1: 0.54240, throughput: 558.12 | 2022-05-21 11:55:06.389 [rank:6] [eval], epoch: 12/50, iter: 125/125, loss: 0.00000, top1: 0.54384, throughput: 557.64 | 2022-05-21 11:55:06.399 [rank:1] [eval], epoch: 12/50, iter: 125/125, loss: 0.00000, top1: 0.54336, throughput: 549.20 | 2022-05-21 11:55:06.574 [rank:5] [eval], epoch: 12/50, iter: 125/125, loss: 0.00000, top1: 0.53584, throughput: 547.83 | 2022-05-21 11:55:06.602 [rank:4] [train], epoch: 13/50, iter: 100/834, loss: 0.34903, top1: 0.56208, throughput: 1306.97 | 2022-05-21 11:55:21.019 [rank:2] [train], epoch: 13/50, iter: 100/834, loss: 0.35162, top1: 0.55724, throughput: 1307.56 | 2022-05-21 11:55:21.019 [rank:7] [train], epoch: 13/50, iter: 100/834, loss: 0.34691, top1: 0.56302, throughput: 1307.08 | 2022-05-21 11:55:21.019 [rank:1] [train], epoch: 13/50, iter: 100/834, loss: 0.35064, top1: 0.56177, throughput: 1328.97 | 2022-05-21 11:55:21.021 [rank:6] [train], epoch: 13/50, iter: 100/834, loss: 0.35252, top1: 0.55297, throughput: 1313.07 | 2022-05-21 11:55:21.022 [rank:3] [train], epoch: 13/50, iter: 100/834, loss: 0.35012, top1: 0.55854, throughput: 1312.15 | 2022-05-21 11:55:21.022 [rank:5] [train], epoch: 13/50, iter: 100/834, loss: 0.35400, top1: 0.54766, throughput: 1331.39 | 2022-05-21 11:55:21.023 [rank:0] [train], epoch: 13/50, iter: 100/834, loss: 0.35437, top1: 0.55089, throughput: 1306.93 | 2022-05-21 11:55:21.022 [rank:4] [train], epoch: 13/50, iter: 200/834, loss: 0.34952, top1: 0.55885, throughput: 1325.83 | 2022-05-21 11:55:35.500 [rank:5] [train], epoch: 13/50, iter: 200/834, loss: 0.35495, top1: 0.55167, throughput: 1326.23 | 2022-05-21 11:55:35.500 [rank:1] [train], epoch: 13/50, iter: 200/834, loss: 0.35104, top1: 0.55958, throughput: 1325.79 | 2022-05-21 11:55:35.503 [rank:3] [train], epoch: 13/50, iter: 200/834, loss: 0.35262, top1: 0.55365, throughput: 1325.94 | 2022-05-21 11:55:35.502 [rank:2] [train], epoch: 13/50, iter: 200/834, loss: 0.35261, top1: 0.55276, throughput: 1325.61 | 2022-05-21 11:55:35.503 [rank:6] [train], epoch: 13/50, iter: 200/834, loss: 0.35043, top1: 0.55766, throughput: 1325.70 | 2022-05-21 11:55:35.504 [rank:0] [train], epoch: 13/50, iter: 200/834, loss: 0.35662, top1: 0.54526, throughput: 1325.77 | 2022-05-21 11:55:35.504 [rank:7] [train], epoch: 13/50, iter: 200/834, loss: 0.35119, top1: 0.55625, throughput: 1325.49 | 2022-05-21 11:55:35.504 [rank:4] [train], epoch: 13/50, iter: 300/834, loss: 0.35098, top1: 0.55161, throughput: 1329.59 | 2022-05-21 11:55:49.941 [rank:7] [train], epoch: 13/50, iter: 300/834, loss: 0.35252, top1: 0.55375, throughput: 1329.91 | 2022-05-21 11:55:49.941 [rank:6] [train], epoch: 13/50, iter: 300/834, loss: 0.35488, top1: 0.55365, throughput: 1329.89 | 2022-05-21 11:55:49.942 [rank:2] [train], epoch: 13/50, iter: 300/834, loss: 0.35545, top1: 0.54521, throughput: 1329.74 | 2022-05-21 11:55:49.942 [rank:0] [train], epoch: 13/50, iter: 300/834, loss: 0.35561, top1: 0.54604, throughput: 1329.62 | 2022-05-21 11:55:49.944 [rank:5] [train], epoch: 13/50, iter: 300/834, loss: 0.35268, top1: 0.55318, throughput: 1329.33 | 2022-05-21 11:55:49.944 [rank:1] [train], epoch: 13/50, iter: 300/834, loss: 0.35332, top1: 0.55313, throughput: 1329.58 | 2022-05-21 11:55:49.943 [rank:3] [train], epoch: 13/50, iter: 300/834, loss: 0.35620, top1: 0.55198, throughput: 1329.47 | 2022-05-21 11:55:49.944 [rank:0] [train], epoch: 13/50, iter: 400/834, loss: 0.35327, top1: 0.55531, throughput: 1327.81 | 2022-05-21 11:56:04.404 [rank:4] [train], epoch: 13/50, iter: 400/834, loss: 0.35541, top1: 0.55052, throughput: 1327.50 | 2022-05-21 11:56:04.404 [rank:5] [train], epoch: 13/50, iter: 400/834, loss: 0.35389, top1: 0.55031, throughput: 1327.77 | 2022-05-21 11:56:04.404 [rank:7] [train], epoch: 13/50, iter: 400/834, loss: 0.35143, top1: 0.55604, throughput: 1327.56 | 2022-05-21 11:56:04.404 [rank:6] [train], epoch: 13/50, iter: 400/834, loss: 0.35467, top1: 0.55250, throughput: 1327.57 | 2022-05-21 11:56:04.404 [rank:3] [train], epoch: 13/50, iter: 400/834, loss: 0.35297, top1: 0.55417, throughput: 1327.75 | 2022-05-21 11:56:04.404 [rank:2] [train], epoch: 13/50, iter: 400/834, loss: 0.35566, top1: 0.54698, throughput: 1327.61 | 2022-05-21 11:56:04.404 [rank:1] [train], epoch: 13/50, iter: 400/834, loss: 0.35535, top1: 0.55161, throughput: 1327.72 | 2022-05-21 11:56:04.404 [rank:2] [train], epoch: 13/50, iter: 500/834, loss: 0.35173, top1: 0.55375, throughput: 1325.25 | 2022-05-21 11:56:18.892 [rank:4] [train], epoch: 13/50, iter: 500/834, loss: 0.35403, top1: 0.54557, throughput: 1325.30 | 2022-05-21 11:56:18.892 [rank:5] [train], epoch: 13/50, iter: 500/834, loss: 0.35332, top1: 0.55172, throughput: 1325.30 | 2022-05-21 11:56:18.891 [rank:0] [train], epoch: 13/50, iter: 500/834, loss: 0.35357, top1: 0.55078, throughput: 1325.20 | 2022-05-21 11:56:18.892 [rank:3] [train], epoch: 13/50, iter: 500/834, loss: 0.35332, top1: 0.55198, throughput: 1325.21 | 2022-05-21 11:56:18.893 [rank:6] [train], epoch: 13/50, iter: 500/834, loss: 0.35087, top1: 0.55948, throughput: 1325.13 | 2022-05-21 11:56:18.893 [rank:7] [train], epoch: 13/50, iter: 500/834, loss: 0.35474, top1: 0.54573, throughput: 1325.14 | 2022-05-21 11:56:18.893 [rank:1] [train], epoch: 13/50, iter: 500/834, loss: 0.35574, top1: 0.54781, throughput: 1325.14 | 2022-05-21 11:56:18.893 [rank:7] [train], epoch: 13/50, iter: 600/834, loss: 0.35459, top1: 0.54771, throughput: 1326.36 | 2022-05-21 11:56:33.369 [rank:3] [train], epoch: 13/50, iter: 600/834, loss: 0.35321, top1: 0.54901, throughput: 1326.36 | 2022-05-21 11:56:33.368 [rank:4] [train], epoch: 13/50, iter: 600/834, loss: 0.35234, top1: 0.55224, throughput: 1326.20 | 2022-05-21 11:56:33.369 [rank:5] [train], epoch: 13/50, iter: 600/834, loss: 0.35267, top1: 0.55167, throughput: 1326.02 | 2022-05-21 11:56:33.371 [rank:2] [train], epoch: 13/50, iter: 600/834, loss: 0.35180, top1: 0.55786, throughput: 1326.16 | 2022-05-21 11:56:33.370 [rank:1] [train], epoch: 13/50, iter: 600/834, loss: 0.35357, top1: 0.55234, throughput: 1326.20 | 2022-05-21 11:56:33.371 [rank:0] [train], epoch: 13/50, iter: 600/834, loss: 0.35407, top1: 0.54953, throughput: 1326.07 | 2022-05-21 11:56:33.371 [rank:6] [train], epoch: 13/50, iter: 600/834, loss: 0.35316, top1: 0.54724, throughput: 1326.12 | 2022-05-21 11:56:33.372 [rank:6] [train], epoch: 13/50, iter: 700/834, loss: 0.35255, top1: 0.55260, throughput: 1321.15 | 2022-05-21 11:56:47.905 [rank:3] [train], epoch: 13/50, iter: 700/834, loss: 0.35387, top1: 0.55365, throughput: 1321.02 | 2022-05-21 11:56:47.903 [rank:2] [train], epoch: 13/50, iter: 700/834, loss: 0.35624, top1: 0.54693, throughput: 1321.03 | 2022-05-21 11:56:47.904[rank:0] [train], epoch: 13/50, iter: 700/834, loss: 0.35181, top1: 0.55698, throughput: 1321.14 | 2022-05-21 11:56:47.904 [rank:5] [train], epoch: 13/50, iter: 700/834, loss: 0.35429, top1: 0.55750, throughput: 1321.19 | 2022-05-21 11:56:47.903 [rank:7] [train], epoch: 13/50, iter: 700/834, loss: 0.35371, top1: 0.55187, throughput: 1320.94 | 2022-05-21 11:56:47.904 [rank:1] [train], epoch: 13/50, iter: 700/834, loss: 0.35440, top1: 0.54906, throughput: 1321.14 | 2022-05-21 11:56:47.904 [rank:4] [train], epoch: 13/50, iter: 700/834, loss: 0.35415, top1: 0.55281, throughput: 1320.90 | 2022-05-21 11:56:47.905 [rank:7] [train], epoch: 13/50, iter: 800/834, loss: 0.35138, top1: 0.55510, throughput: 1327.72 | 2022-05-21 11:57:02.365 [rank:6] [train], epoch: 13/50, iter: 800/834, loss: 0.35258, top1: 0.55266, throughput: 1327.82 | 2022-05-21 11:57:02.364 [rank:5] [train], epoch: 13/50, iter: 800/834, loss: 0.35229, top1: 0.55375, throughput: 1327.68 | 2022-05-21 11:57:02.365 [rank:1] [train], epoch: 13/50, iter: 800/834, loss: 0.35501, top1: 0.54833, throughput: 1327.67 | 2022-05-21 11:57:02.365 [rank:3] [train], epoch: 13/50, iter: 800/834, loss: 0.35492, top1: 0.55130, throughput: 1327.45 | 2022-05-21 11:57:02.366 [rank:2] [train], epoch: 13/50, iter: 800/834, loss: 0.35429, top1: 0.54812, throughput: 1327.55 | 2022-05-21 11:57:02.366 [rank:0] [train], epoch: 13/50, iter: 800/834, loss: 0.35631, top1: 0.54781, throughput: 1327.54 | 2022-05-21 11:57:02.367 [rank:4] [train], epoch: 13/50, iter: 800/834, loss: 0.35314, top1: 0.54979, throughput: 1327.61 | 2022-05-21 11:57:02.367 [rank:4] [train], epoch: 13/50, iter: 834/834, loss: 0.35278, top1: 0.55423, throughput: 1324.90 | 2022-05-21 11:57:07.294 [rank:5] [train], epoch: 13/50, iter: 834/834, loss: 0.35383, top1: 0.54703, throughput: 1324.34 | 2022-05-21 11:57:07.294 [rank:0] [train], epoch: 13/50, iter: 834/834, loss: 0.35343, top1: 0.55392, throughput: 1324.92 | 2022-05-21 11:57:07.294 [rank:7] [train], epoch: 13/50, iter: 834/834, loss: 0.35041, top1: 0.55790, throughput: 1324.26 | 2022-05-21 11:57:07.294 [rank:6] [train], epoch: 13/50, iter: 834/834, loss: 0.34706, top1: 0.56342, throughput: 1323.95 | 2022-05-21 11:57:07.295 [rank:3] [train], epoch: 13/50, iter: 834/834, loss: 0.35184, top1: 0.56235, throughput: 1324.57 | 2022-05-21 11:57:07.295 [rank:2] [train], epoch: 13/50, iter: 834/834, loss: 0.35400, top1: 0.54488, throughput: 1324.20 | 2022-05-21 11:57:07.296 [rank:1] [train], epoch: 13/50, iter: 834/834, loss: 0.35423, top1: 0.55331, throughput: 1323.36 | 2022-05-21 11:57:07.298 [rank:7] [eval], epoch: 13/50, iter: 125/125, loss: 0.00000, top1: 0.56480, throughput: 559.71 | 2022-05-21 11:57:18.461 [rank:0] [eval], epoch: 13/50, iter: 125/125, loss: 0.00000, top1: 0.56672, throughput: 559.50 | 2022-05-21 11:57:18.465 [rank:4] [eval], epoch: 13/50, iter: 125/125, loss: 0.00000, top1: 0.55520, throughput: 559.44 | 2022-05-21 11:57:18.466 [rank:6] [eval], epoch: 13/50, iter: 125/125, loss: 0.00000, top1: 0.56848, throughput: 559.45 | 2022-05-21 11:57:18.467 [rank:2] [eval], epoch: 13/50, iter: 125/125, loss: 0.00000, top1: 0.55248, throughput: 559.01 | 2022-05-21 11:57:18.477 [rank:1] [eval], epoch: 13/50, iter: 125/125, loss: 0.00000, top1: 0.56464, throughput: 556.22 | 2022-05-21 11:57:18.534 [rank:3] [eval], epoch: 13/50, iter: 125/125, loss: 0.00000, top1: 0.55760, throughput: 556.04 | 2022-05-21 11:57:18.535 [rank:5] [eval], epoch: 13/50, iter: 125/125, loss: 0.00000, top1: 0.55424, throughput: 545.43 | 2022-05-21 11:57:18.753 [rank:6] [train], epoch: 14/50, iter: 100/834, loss: 0.34985, top1: 0.55766, throughput: 1306.53 | 2022-05-21 11:57:33.162 [rank:4] [train], epoch: 14/50, iter: 100/834, loss: 0.34542, top1: 0.57115, throughput: 1306.43 | 2022-05-21 11:57:33.162 [rank:2] [train], epoch: 14/50, iter: 100/834, loss: 0.34829, top1: 0.56208, throughput: 1307.42 | 2022-05-21 11:57:33.162 [rank:5] [train], epoch: 14/50, iter: 100/834, loss: 0.34732, top1: 0.56073, throughput: 1332.34 | 2022-05-21 11:57:33.163 [rank:3] [train], epoch: 14/50, iter: 100/834, loss: 0.35154, top1: 0.55630, throughput: 1312.42 | 2022-05-21 11:57:33.164 [rank:1] [train], epoch: 14/50, iter: 100/834, loss: 0.34875, top1: 0.56193, throughput: 1312.35 | 2022-05-21 11:57:33.165 [rank:0] [train], epoch: 14/50, iter: 100/834, loss: 0.34575, top1: 0.56375, throughput: 1306.08 | 2022-05-21 11:57:33.165 [rank:7] [train], epoch: 14/50, iter: 100/834, loss: 0.34718, top1: 0.56302, throughput: 1305.81 | 2022-05-21 11:57:33.164 [rank:2] [train], epoch: 14/50, iter: 200/834, loss: 0.35109, top1: 0.55656, throughput: 1324.87 | 2022-05-21 11:57:47.654 [rank:5] [train], epoch: 14/50, iter: 200/834, loss: 0.34762, top1: 0.56292, throughput: 1325.06 | 2022-05-21 11:57:47.653 [rank:4] [train], epoch: 14/50, iter: 200/834, loss: 0.34748, top1: 0.56354, throughput: 1324.84 | 2022-05-21 11:57:47.655 [rank:3] [train], epoch: 14/50, iter: 200/834, loss: 0.34724, top1: 0.56302, throughput: 1324.83 | 2022-05-21 11:57:47.657 [rank:0] [train], epoch: 14/50, iter: 200/834, loss: 0.34907, top1: 0.56271, throughput: 1324.97[rank:6] [train], epoch: 14/50, iter: 200/834, loss: 0.34865, top1: 0.55729, throughput: 1324.72 | 2022-05-21 11:57:47.656 | 2022-05-21 11:57:47.656 [rank:7] [train], epoch: 14/50, iter: 200/834, loss: 0.35140, top1: 0.55536, throughput: 1324.94 | 2022-05-21 11:57:47.656 [rank:1] [train], epoch: 14/50, iter: 200/834, loss: 0.34971, top1: 0.56042, throughput: 1324.89 | 2022-05-21 11:57:47.656 [rank:6] [train], epoch: 14/50, iter: 300/834, loss: 0.34810, top1: 0.56406, throughput: 1330.23 | 2022-05-21 11:58:02.089 [rank:5] [train], epoch: 14/50, iter: 300/834, loss: 0.34976, top1: 0.55885, throughput: 1329.97 | 2022-05-21 11:58:02.090 [rank:3] [train], epoch: 14/50, iter: 300/834, loss: 0.35011, top1: 0.55422, throughput: 1330.33 | 2022-05-21 11:58:02.089 [rank:4] [train], epoch: 14/50, iter: 300/834, loss: 0.34990, top1: 0.55995, throughput: 1330.01 | 2022-05-21 11:58:02.091 [rank:0] [train], epoch: 14/50, iter: 300/834, loss: 0.34863, top1: 0.56026, throughput: 1330.05 | 2022-05-21 11:58:02.092 [rank:7] [train], epoch: 14/50, iter: 300/834, loss: 0.35057, top1: 0.55625, throughput: 1330.18 | 2022-05-21 11:58:02.090 [rank:1] [train], epoch: 14/50, iter: 300/834, loss: 0.34999, top1: 0.55937, throughput: 1330.06[rank:2] [train], epoch: 14/50, iter: 300/834, loss: 0.34937, top1: 0.55969, throughput: 1329.85 | 2022-05-21 11:58:02.092 | 2022-05-21 11:58:02.092 [rank:4] [train], epoch: 14/50, iter: 400/834, loss: 0.34686, top1: 0.56187, throughput: 1329.33 | 2022-05-21 11:58:16.534 [rank:7] [train], epoch: 14/50, iter: 400/834, loss: 0.35204, top1: 0.55370, throughput: 1329.23 | 2022-05-21 11:58:16.534 [rank:1] [train], epoch: 14/50, iter: 400/834, loss: 0.34935, top1: 0.56234, throughput: 1329.42 | 2022-05-21 11:58:16.534 [rank:5] [train], epoch: 14/50, iter: 400/834, loss: 0.35045, top1: 0.56057, throughput: 1329.18 | 2022-05-21 11:58:16.535 [rank:3] [train], epoch: 14/50, iter: 400/834, loss: 0.34674, top1: 0.55786, throughput: 1329.21 | 2022-05-21 11:58:16.534 [rank:6] [train], epoch: 14/50, iter: 400/834, loss: 0.35020, top1: 0.55964, throughput: 1328.99 | 2022-05-21 11:58:16.536 [rank:2] [train], epoch: 14/50, iter: 400/834, loss: 0.35085, top1: 0.55943, throughput: 1329.23 | 2022-05-21 11:58:16.536 [rank:0] [train], epoch: 14/50, iter: 400/834, loss: 0.34961, top1: 0.55573, throughput: 1329.15 | 2022-05-21 11:58:16.537 [rank:3] [train], epoch: 14/50, iter: 500/834, loss: 0.34957, top1: 0.56318, throughput: 1329.55 | 2022-05-21 11:58:30.975 [rank:7] [train], epoch: 14/50, iter: 500/834, loss: 0.35085, top1: 0.55667, throughput: 1329.61 | 2022-05-21 11:58:30.975 [rank:0] [train], epoch: 14/50, iter: 500/834, loss: 0.34768, top1: 0.56188, throughput: 1329.75 | 2022-05-21 11:58:30.976 [rank:1] [train], epoch: 14/50, iter: 500/834, loss: 0.34860, top1: 0.56036, throughput: 1329.49 | 2022-05-21 11:58:30.976 [rank:5] [train], epoch: 14/50, iter: 500/834, loss: 0.35254, top1: 0.55401, throughput: 1329.60 | 2022-05-21 11:58:30.975[rank:6] [train], epoch: 14/50, iter: 500/834, loss: 0.35191, top1: 0.55307, throughput: 1329.73 | 2022-05-21 11:58:30.976 [rank:4] [train], epoch: 14/50, iter: 500/834, loss: 0.34900, top1: 0.55615, throughput: 1329.44 | 2022-05-21 11:58:30.976 [rank:2] [train], epoch: 14/50, iter: 500/834, loss: 0.35167, top1: 0.55620, throughput: 1329.72 | 2022-05-21 11:58:30.975 [rank:7] [train], epoch: 14/50, iter: 600/834, loss: 0.35142, top1: 0.55365, throughput: 1322.72 | 2022-05-21 11:58:45.490 [rank:1] [train], epoch: 14/50, iter: 600/834, loss: 0.34954, top1: 0.55729, throughput: 1322.71 | 2022-05-21 11:58:45.491 [rank:3] [train], epoch: 14/50, iter: 600/834, loss: 0.34929, top1: 0.56005, throughput: 1322.68 | 2022-05-21 11:58:45.491 [rank:6] [train], epoch: 14/50, iter: 600/834, loss: 0.35140, top1: 0.55297, throughput: 1322.57[rank:5] [train], epoch: 14/50, iter: 600/834, loss: 0.35058, top1: 0.55807, throughput: 1322.52 | 2022-05-21 11:58:45.493 | 2022-05-21 11:58:45.493 [rank:4] [train], epoch: 14/50, iter: 600/834, loss: 0.35186, top1: 0.56198, throughput: 1322.58 | 2022-05-21 11:58:45.493 [rank:2] [train], epoch: 14/50, iter: 600/834, loss: 0.35225, top1: 0.55437, throughput: 1322.63 | 2022-05-21 11:58:45.492 [rank:0] [train], epoch: 14/50, iter: 600/834, loss: 0.35272, top1: 0.55271, throughput: 1322.56 | 2022-05-21 11:58:45.493 [rank:6] [train], epoch: 14/50, iter: 700/834, loss: 0.35085, top1: 0.55526, throughput: 1315.25 | 2022-05-21 11:59:00.091 [rank:5] [train], epoch: 14/50, iter: 700/834, loss: 0.35307, top1: 0.54891, throughput: 1315.25 | 2022-05-21 11:59:00.091 [rank:1] [train], epoch: 14/50, iter: 700/834, loss: 0.35277, top1: 0.55573, throughput: 1315.05 | 2022-05-21 11:59:00.092 [rank:7] [train], epoch: 14/50, iter: 700/834, loss: 0.35008, top1: 0.55672, throughput: 1315.02 | 2022-05-21 11:59:00.091 [rank:2] [train], epoch: 14/50, iter: 700/834, loss: 0.35073, top1: 0.55932, throughput: 1315.11 | 2022-05-21 11:59:00.091[rank:3] [train], epoch: 14/50, iter: 700/834, loss: 0.35092, top1: 0.55813, throughput: 1314.93 | 2022-05-21 11:59:00.092 [rank:4] [train], epoch: 14/50, iter: 700/834, loss: 0.35176, top1: 0.55521, throughput: 1315.08 | 2022-05-21 11:59:00.093 [rank:0] [train], epoch: 14/50, iter: 700/834, loss: 0.35105, top1: 0.55854, throughput: 1315.05 | 2022-05-21 11:59:00.093 [rank:5] [train], epoch: 14/50, iter: 800/834, loss: 0.34958, top1: 0.56057, throughput: 1328.31 | 2022-05-21 11:59:14.545 [rank:4] [train], epoch: 14/50, iter: 800/834, loss: 0.35022, top1: 0.55875, throughput: 1328.52 | 2022-05-21 11:59:14.545 [rank:3] [train], epoch: 14/50, iter: 800/834, loss: 0.35049, top1: 0.55969, throughput: 1328.39 | 2022-05-21 11:59:14.546 [rank:2] [train], epoch: 14/50, iter: 800/834, loss: 0.35166, top1: 0.56120, throughput: 1328.22 | 2022-05-21 11:59:14.547 [rank:7] [train], epoch: 14/50, iter: 800/834, loss: 0.35016, top1: 0.55724, throughput: 1328.22 | 2022-05-21 11:59:14.546 [rank:1] [train], epoch: 14/50, iter: 800/834, loss: 0.34941, top1: 0.55766, throughput: 1328.23 | 2022-05-21 11:59:14.547 [rank:6] [train], epoch: 14/50, iter: 800/834, loss: 0.34545, top1: 0.56734, throughput: 1328.14 | 2022-05-21 11:59:14.547 [rank:0] [train], epoch: 14/50, iter: 800/834, loss: 0.35031, top1: 0.55599, throughput: 1328.48 | 2022-05-21 11:59:14.546 [rank:5] [train], epoch: 14/50, iter: 834/834, loss: 0.35064, top1: 0.56158, throughput: 1312.91 | 2022-05-21 11:59:19.517 [rank:3] [train], epoch: 14/50, iter: 834/834, loss: 0.34876, top1: 0.55423, throughput: 1313.14 | 2022-05-21 11:59:19.517 [rank:1] [train], epoch: 14/50, iter: 834/834, loss: 0.34896, top1: 0.56173, throughput: 1313.10 | 2022-05-21 11:59:19.518 [rank:4] [train], epoch: 14/50, iter: 834/834, loss: 0.34857, top1: 0.55744, throughput: 1312.61 | 2022-05-21 11:59:19.519 [rank:7] [train], epoch: 14/50, iter: 834/834, loss: 0.35114, top1: 0.54779, throughput: 1312.70 | 2022-05-21 11:59:19.519 [rank:6] [train], epoch: 14/50, iter: 834/834, loss: 0.35183, top1: 0.55362, throughput: 1312.58[rank:2] [train], epoch: 14/50, iter: 834/834, loss: 0.35095, top1: 0.55162, throughput: 1312.67 | 2022-05-21 11:59:19.520| 2022-05-21 11:59:19.520 [rank:0] [train], epoch: 14/50, iter: 834/834, loss: 0.35401, top1: 0.54764, throughput: 1312.11 | 2022-05-21 11:59:19.521 [rank:0] [eval], epoch: 14/50, iter: 125/125, loss: 0.00000, top1: 0.57712, throughput: 556.87 | 2022-05-21 11:59:30.744 [rank:7] [eval], epoch: 14/50, iter: 125/125, loss: 0.00000, top1: 0.56880, throughput: 556.77 | 2022-05-21 11:59:30.744 [rank:4] [eval], epoch: 14/50, iter: 125/125, loss: 0.00000, top1: 0.56656, throughput: 556.56 | 2022-05-21 11:59:30.748 [rank:2] [eval], epoch: 14/50, iter: 125/125, loss: 0.00000, top1: 0.56368, throughput: 553.88 | 2022-05-21 11:59:30.804 [rank:6] [eval], epoch: 14/50, iter: 125/125, loss: 0.00000, top1: 0.56624, throughput: 552.64 | 2022-05-21 11:59:30.830 [rank:3] [eval], epoch: 14/50, iter: 125/125, loss: 0.00000, top1: 0.56032, throughput: 548.82 | 2022-05-21 11:59:30.905 [rank:1] [eval], epoch: 14/50, iter: 125/125, loss: 0.00000, top1: 0.57552, throughput: 543.92 | 2022-05-21 11:59:31.009 [rank:5] [eval], epoch: 14/50, iter: 125/125, loss: 0.00000, top1: 0.55744, throughput: 539.27 | 2022-05-21 11:59:31.107 [rank:2] [train], epoch: 15/50, iter: 100/834, loss: 0.34245, top1: 0.57016, throughput: 1304.25 | 2022-05-21 11:59:45.525 [rank:5] [train], epoch: 15/50, iter: 100/834, loss: 0.34080, top1: 0.57422, throughput: 1331.59 | 2022-05-21 11:59:45.526 [rank:7] [train], epoch: 15/50, iter: 100/834, loss: 0.34199, top1: 0.57214, throughput: 1298.90 | 2022-05-21 11:59:45.526 [rank:6] [train], epoch: 15/50, iter: 100/834, loss: 0.34224, top1: 0.57318, throughput: 1306.46 | 2022-05-21 11:59:45.526 [rank:3] [train], epoch: 15/50, iter: 100/834, loss: 0.34139, top1: 0.57370, throughput: 1313.24 | 2022-05-21 11:59:45.526 [rank:4] [train], epoch: 15/50, iter: 100/834, loss: 0.34398, top1: 0.57182, throughput: 1299.27 | 2022-05-21 11:59:45.526 [rank:1] [train], epoch: 15/50, iter: 100/834, loss: 0.34378, top1: 0.56609, throughput: 1322.63 | 2022-05-21 11:59:45.526 [rank:0] [train], epoch: 15/50, iter: 100/834, loss: 0.34123, top1: 0.56589, throughput: 1298.92 | 2022-05-21 11:59:45.526 [rank:5] [train], epoch: 15/50, iter: 200/834, loss: 0.34686, top1: 0.56469, throughput: 1328.30 | 2022-05-21 11:59:59.980 [rank:0] [train], epoch: 15/50, iter: 200/834, loss: 0.34514, top1: 0.56411, throughput: 1328.07 | 2022-05-21 11:59:59.983 [rank:3] [train], epoch: 15/50, iter: 200/834, loss: 0.34638, top1: 0.56469, throughput: 1328.21 | 2022-05-21 11:59:59.981 [rank:2] [train], epoch: 15/50, iter: 200/834, loss: 0.34619, top1: 0.56474, throughput: 1328.11 | 2022-05-21 11:59:59.982 [rank:6] [train], epoch: 15/50, iter: 200/834, loss: 0.34573, top1: 0.56750, throughput: 1328.12 | 2022-05-21 11:59:59.983 [rank:7] [train], epoch: 15/50, iter: 200/834, loss: 0.34976, top1: 0.56234, throughput: 1328.07 | 2022-05-21 11:59:59.983 [rank:4] [train], epoch: 15/50, iter: 200/834, loss: 0.34794, top1: 0.56234, throughput: 1327.98 | 2022-05-21 11:59:59.984 [rank:1] [train], epoch: 15/50, iter: 200/834, loss: 0.34501, top1: 0.56479, throughput: 1327.97 | 2022-05-21 11:59:59.984 [rank:7] [train], epoch: 15/50, iter: 300/834, loss: 0.34526, top1: 0.57151, throughput: 1329.13 | 2022-05-21 12:00:14.429 [rank:4] [train], epoch: 15/50, iter: 300/834, loss: 0.34675, top1: 0.56578, throughput: 1329.21 | 2022-05-21 12:00:14.429 [rank:1] [train], epoch: 15/50, iter: 300/834, loss: 0.34731, top1: 0.56698, throughput: 1329.21 | 2022-05-21 12:00:14.428 [rank:5] [train], epoch: 15/50, iter: 300/834, loss: 0.34724, top1: 0.56172, throughput: 1328.88 | 2022-05-21 12:00:14.429 [rank:6] [train], epoch: 15/50, iter: 300/834, loss: 0.34680, top1: 0.56625, throughput: 1329.01 | 2022-05-21 12:00:14.429 [rank:2] [train], epoch: 15/50, iter: 300/834, loss: 0.34611, top1: 0.56396, throughput: 1328.97 | 2022-05-21 12:00:14.429 [rank:0] [train], epoch: 15/50, iter: 300/834, loss: 0.34570, top1: 0.56714, throughput: 1328.90 | 2022-05-21 12:00:14.431 [rank:3] [train], epoch: 15/50, iter: 300/834, loss: 0.34565, top1: 0.56464, throughput: 1328.74 | 2022-05-21 12:00:14.431 [rank:6] [train], epoch: 15/50, iter: 400/834, loss: 0.34497, top1: 0.56750, throughput: 1323.54 | 2022-05-21 12:00:28.936 [rank:2] [train], epoch: 15/50, iter: 400/834, loss: 0.34376, top1: 0.56990, throughput: 1323.56 | 2022-05-21 12:00:28.935 [rank:4] [train], epoch: 15/50, iter: 400/834, loss: 0.34915, top1: 0.55604, throughput: 1323.48 | 2022-05-21 12:00:28.936 [rank:1] [train], epoch: 15/50, iter: 400/834, loss: 0.34497, top1: 0.56542, throughput: 1323.43 | 2022-05-21 12:00:28.936 [rank:5] [train], epoch: 15/50, iter: 400/834, loss: 0.34643, top1: 0.56625, throughput: 1323.32 | 2022-05-21 12:00:28.938 [rank:7] [train], epoch: 15/50, iter: 400/834, loss: 0.34531, top1: 0.56687, throughput: 1323.36 | 2022-05-21 12:00:28.937 [rank:0] [train], epoch: 15/50, iter: 400/834, loss: 0.34647, top1: 0.56620, throughput: 1323.55 | 2022-05-21 12:00:28.937 [rank:3] [train], epoch: 15/50, iter: 400/834, loss: 0.34922, top1: 0.55958, throughput: 1323.57 | 2022-05-21 12:00:28.937 [rank:5] [train], epoch: 15/50, iter: 500/834, loss: 0.34575, top1: 0.56130, throughput: 1327.79 | 2022-05-21 12:00:43.398 [rank:2] [train], epoch: 15/50, iter: 500/834, loss: 0.34686, top1: 0.56313, throughput: 1327.55 | 2022-05-21 12:00:43.398 [rank:6] [train], epoch: 15/50, iter: 500/834, loss: 0.34862, top1: 0.55974, throughput: 1327.41 | 2022-05-21 12:00:43.400 [rank:4] [train], epoch: 15/50, iter: 500/834, loss: 0.34840, top1: 0.56646, throughput: 1327.44 | 2022-05-21 12:00:43.400 [rank:7] [train], epoch: 15/50, iter: 500/834, loss: 0.34822, top1: 0.56542, throughput: 1327.53 | 2022-05-21 12:00:43.400 [rank:3] [train], epoch: 15/50, iter: 500/834, loss: 0.34715, top1: 0.56130, throughput: 1327.54 | 2022-05-21 12:00:43.400 [rank:1] [train], epoch: 15/50, iter: 500/834, loss: 0.34637, top1: 0.56042, throughput: 1327.41 | 2022-05-21 12:00:43.401 [rank:0] [train], epoch: 15/50, iter: 500/834, loss: 0.34735, top1: 0.55995, throughput: 1327.51 | 2022-05-21 12:00:43.401 [rank:7] [train], epoch: 15/50, iter: 600/834, loss: 0.34844, top1: 0.56182, throughput: 1325.24 | 2022-05-21 12:00:57.888 [rank:6] [train], epoch: 15/50, iter: 600/834, loss: 0.35113, top1: 0.55807, throughput: 1325.15 | 2022-05-21 12:00:57.889 [rank:4] [train], epoch: 15/50, iter: 600/834, loss: 0.34603, top1: 0.56094, throughput: 1325.17 | 2022-05-21 12:00:57.888 [rank:5] [train], epoch: 15/50, iter: 600/834, loss: 0.34992, top1: 0.55880, throughput: 1324.98 | 2022-05-21 12:00:57.889 [rank:0] [train], epoch: 15/50, iter: 600/834, loss: 0.34631, top1: 0.56531, throughput: 1325.02 | 2022-05-21 12:00:57.891 [rank:1] [train], epoch: 15/50, iter: 600/834, loss: 0.34682, top1: 0.56516, throughput: 1325.10 | 2022-05-21 12:00:57.890 [rank:2] [train], epoch: 15/50, iter: 600/834, loss: 0.34633, top1: 0.56734, throughput: 1324.92 | 2022-05-21 12:00:57.889 [rank:3] [train], epoch: 15/50, iter: 600/834, loss: 0.34741, top1: 0.56604, throughput: 1325.03 | 2022-05-21 12:00:57.891 [rank:6] [train], epoch: 15/50, iter: 700/834, loss: 0.34934, top1: 0.55995, throughput: 1322.16 | 2022-05-21 12:01:12.411 [rank:5] [train], epoch: 15/50, iter: 700/834, loss: 0.34619, top1: 0.56401, throughput: 1322.12 | 2022-05-21 12:01:12.411 [rank:7] [train], epoch: 15/50, iter: 700/834, loss: 0.34587, top1: 0.56708, throughput: 1322.04 | 2022-05-21 12:01:12.411 [rank:2] [train], epoch: 15/50, iter: 700/834, loss: 0.34373, top1: 0.56917, throughput: 1322.16 | 2022-05-21 12:01:12.411 [rank:4] [train], epoch: 15/50, iter: 700/834, loss: 0.34607, top1: 0.56687, throughput: 1322.16 | 2022-05-21 12:01:12.410 [rank:3] [train], epoch: 15/50, iter: 700/834, loss: 0.34771, top1: 0.56354, throughput: 1322.14[rank:0] [train], epoch: 15/50, iter: 700/834, loss: 0.34619, top1: 0.56307, throughput: 1322.12 | 2022-05-21 12:01:12.412 | 2022-05-21 12:01:12.413 [rank:1] [train], epoch: 15/50, iter: 700/834, loss: 0.34854, top1: 0.56099, throughput: 1322.11 | 2022-05-21 12:01:12.412 [rank:5] [train], epoch: 15/50, iter: 800/834, loss: 0.34406, top1: 0.56854, throughput: 1327.00 | 2022-05-21 12:01:26.879 [rank:0] [train], epoch: 15/50, iter: 800/834, loss: 0.34902, top1: 0.55630, throughput: 1327.21 | 2022-05-21 12:01:26.880 [rank:4] [train], epoch: 15/50, iter: 800/834, loss: 0.34593, top1: 0.56078, throughput: 1326.94 | 2022-05-21 12:01:26.880 [rank:6] [train], epoch: 15/50, iter: 800/834, loss: 0.34874, top1: 0.55698, throughput: 1326.77 | 2022-05-21 12:01:26.882 [rank:7] [train], epoch: 15/50, iter: 800/834, loss: 0.34854, top1: 0.55833, throughput: 1326.89 | 2022-05-21 12:01:26.881 [rank:1] [train], epoch: 15/50, iter: 800/834, loss: 0.34622, top1: 0.56625, throughput: 1327.07 | 2022-05-21 12:01:26.880 [rank:3] [train], epoch: 15/50, iter: 800/834, loss: 0.34457, top1: 0.56682, throughput: 1326.94 | 2022-05-21 12:01:26.882 [rank:2] [train], epoch: 15/50, iter: 800/834, loss: 0.34668, top1: 0.56318, throughput: 1326.79 | 2022-05-21 12:01:26.882 [rank:5] [train], epoch: 15/50, iter: 834/834, loss: 0.34297, top1: 0.57246, throughput: 1324.56 | 2022-05-21 12:01:31.808 [rank:1] [train], epoch: 15/50, iter: 834/834, loss: 0.34789, top1: 0.56388, throughput: 1324.71 | 2022-05-21 12:01:31.808 [rank:4] [train], epoch: 15/50, iter: 834/834, loss: 0.34416, top1: 0.56725, throughput: 1324.33 | 2022-05-21 12:01:31.809 [rank:2] [train], epoch: 15/50, iter: 834/834, loss: 0.34610, top1: 0.55775, throughput: 1325.10 | 2022-05-21 12:01:31.809 [rank:6] [train], epoch: 15/50, iter: 834/834, loss: 0.34644, top1: 0.56556, throughput: 1324.77 | 2022-05-21 12:01:31.810 [rank:0] [train], epoch: 15/50, iter: 834/834, loss: 0.34521, top1: 0.56572, throughput: 1323.78 | 2022-05-21 12:01:31.811 [rank:3] [train], epoch: 15/50, iter: 834/834, loss: 0.34286, top1: 0.57230, throughput: 1324.39 | 2022-05-21 12:01:31.811 [rank:7] [train], epoch: 15/50, iter: 834/834, loss: 0.34503, top1: 0.56694, throughput: 1323.83 | 2022-05-21 12:01:31.812 [rank:0] [eval], epoch: 15/50, iter: 125/125, loss: 0.00000, top1: 0.56560, throughput: 554.73 | 2022-05-21 12:01:43.078 [rank:4] [eval], epoch: 15/50, iter: 125/125, loss: 0.00000, top1: 0.56560, throughput: 554.50 | 2022-05-21 12:01:43.080 [rank:2] [eval], epoch: 15/50, iter: 125/125, loss: 0.00000, top1: 0.55424, throughput: 554.37 | 2022-05-21 12:01:43.082 [rank:7] [eval], epoch: 15/50, iter: 125/125, loss: 0.00000, top1: 0.56704, throughput: 554.53 | 2022-05-21 12:01:43.083 [rank:6] [eval], epoch: 15/50, iter: 125/125, loss: 0.00000, top1: 0.55568, throughput: 551.39 | 2022-05-21 12:01:43.145 [rank:1] [eval], epoch: 15/50, iter: 125/125, loss: 0.00000, top1: 0.57168, throughput: 551.13 | 2022-05-21 12:01:43.149 [rank:3] [eval], epoch: 15/50, iter: 125/125, loss: 0.00000, top1: 0.55312, throughput: 550.97 | 2022-05-21 12:01:43.154 [rank:5] [eval], epoch: 15/50, iter: 125/125, loss: 0.00000, top1: 0.54496, throughput: 538.58 | 2022-05-21 12:01:43.412 [rank:6] [train], epoch: 16/50, iter: 100/834, loss: 0.33850, top1: 0.57833, throughput: 1306.93 | 2022-05-21 12:01:57.836 [rank:3] [train], epoch: 16/50, iter: 100/834, loss: 0.34299, top1: 0.57047, throughput: 1307.78 | 2022-05-21 12:01:57.836 [rank:7] [train], epoch: 16/50, iter: 100/834, loss: 0.34240, top1: 0.57057, throughput: 1301.42 | 2022-05-21 12:01:57.836 [rank:4] [train], epoch: 16/50, iter: 100/834, loss: 0.33884, top1: 0.58104, throughput: 1301.16 | 2022-05-21 12:01:57.836 [rank:2] [train], epoch: 16/50, iter: 100/834, loss: 0.34001, top1: 0.57297, throughput: 1301.40 | 2022-05-21 12:01:57.836 [rank:5] [train], epoch: 16/50, iter: 100/834, loss: 0.34136, top1: 0.57120, throughput: 1330.96 | 2022-05-21 12:01:57.838 [rank:1] [train], epoch: 16/50, iter: 100/834, loss: 0.34248, top1: 0.57250, throughput: 1307.06 | 2022-05-21 12:01:57.838 [rank:0] [train], epoch: 16/50, iter: 100/834, loss: 0.34059, top1: 0.57349, throughput: 1300.78 | 2022-05-21 12:01:57.838 [rank:5] [train], epoch: 16/50, iter: 200/834, loss: 0.34309, top1: 0.57161, throughput: 1326.93 | 2022-05-21 12:02:12.307 [rank:6] [train], epoch: 16/50, iter: 200/834, loss: 0.34068, top1: 0.57719, throughput: 1326.68 [rank:4] [train], epoch: 16/50, iter: 200/834, loss: 0.34338, top1: 0.56979, throughput: 1326.76| 2022-05-21 12:02:12.308 | 2022-05-21 12:02:12.308 [rank:7] [train], epoch: 16/50, iter: 200/834, loss: 0.34064, top1: 0.57438, throughput: 1326.66 | 2022-05-21 12:02:12.309 [rank:3] [train], epoch: 16/50, iter: 200/834, loss: 0.34101, top1: 0.57656, throughput: 1326.62 | 2022-05-21 12:02:12.309 [rank:0] [train], epoch: 16/50, iter: 200/834, loss: 0.34354, top1: 0.56943, throughput: 1326.88 | 2022-05-21 12:02:12.308 [rank:2] [train], epoch: 16/50, iter: 200/834, loss: 0.34160, top1: 0.57266, throughput: 1326.72 | 2022-05-21 12:02:12.308 [rank:1] [train], epoch: 16/50, iter: 200/834, loss: 0.34157, top1: 0.57609, throughput: 1326.78 | 2022-05-21 12:02:12.309 [rank:3] [train], epoch: 16/50, iter: 300/834, loss: 0.34404, top1: 0.57005, throughput: 1327.97 [rank:2] [train], epoch: 16/50, iter: 300/834, loss: 0.34234, top1: 0.57365, throughput: 1327.83| 2022-05-21 12:02:26.767 | 2022-05-21 12:02:26.767 [rank:6] [train], epoch: 16/50, iter: 300/834, loss: 0.34043, top1: 0.57833, throughput: 1327.88 | 2022-05-21 12:02:26.767 [rank:7] [train], epoch: 16/50, iter: 300/834, loss: 0.34185, top1: 0.57229, throughput: 1328.00 | 2022-05-21 12:02:26.766 [rank:0] [train], epoch: 16/50, iter: 300/834, loss: 0.34272, top1: 0.57042, throughput: 1327.87 | 2022-05-21 12:02:26.767 [rank:5] [train], epoch: 16/50, iter: 300/834, loss: 0.34057, top1: 0.57865, throughput: 1327.71 | 2022-05-21 12:02:26.768 [rank:1] [train], epoch: 16/50, iter: 300/834, loss: 0.34231, top1: 0.57542, throughput: 1327.93 | 2022-05-21 12:02:26.768 [rank:4] [train], epoch: 16/50, iter: 300/834, loss: 0.34219, top1: 0.57370, throughput: 1327.67 | 2022-05-21 12:02:26.769 [rank:7] [train], epoch: 16/50, iter: 400/834, loss: 0.34011, top1: 0.57771, throughput: 1326.75 | 2022-05-21 12:02:41.238 [rank:6] [train], epoch: 16/50, iter: 400/834, loss: 0.34096, top1: 0.57953, throughput: 1326.77 | 2022-05-21 12:02:41.238 [rank:0] [train], epoch: 16/50, iter: 400/834, loss: 0.34302, top1: 0.57010, throughput: 1326.93 | 2022-05-21 12:02:41.237 [rank:5] [train], epoch: 16/50, iter: 400/834, loss: 0.34192, top1: 0.57568, throughput: 1326.99 | 2022-05-21 12:02:41.237 [rank:4] [train], epoch: 16/50, iter: 400/834, loss: 0.34306, top1: 0.57031, throughput: 1326.98 | 2022-05-21 12:02:41.238 [rank:3] [train], epoch: 16/50, iter: 400/834, loss: 0.34299, top1: 0.56802, throughput: 1326.79 | 2022-05-21 12:02:41.238 [rank:1] [train], epoch: 16/50, iter: 400/834, loss: 0.34315, top1: 0.56786, throughput: 1326.75 | 2022-05-21 12:02:41.239 [rank:2] [train], epoch: 16/50, iter: 400/834, loss: 0.34450, top1: 0.57047, throughput: 1326.66 | 2022-05-21 12:02:41.240 [rank:7] [train], epoch: 16/50, iter: 500/834, loss: 0.34275, top1: 0.57167, throughput: 1327.97 | 2022-05-21 12:02:55.696 [rank:3] [train], epoch: 16/50, iter: 500/834, loss: 0.34537, top1: 0.56333, throughput: 1327.84 | 2022-05-21 12:02:55.697 [rank:1] [train], epoch: 16/50, iter: 500/834, loss: 0.34274, top1: 0.56708, throughput: 1327.93 | 2022-05-21 12:02:55.698 [rank:2] [train], epoch: 16/50, iter: 500/834, loss: 0.34333, top1: 0.57063, throughput: 1327.96 | 2022-05-21 12:02:55.698 [rank:6] [train], epoch: 16/50, iter: 500/834, loss: 0.34710, top1: 0.56432, throughput: 1327.62 | 2022-05-21 12:02:55.700 [rank:4] [train], epoch: 16/50, iter: 500/834, loss: 0.34187, top1: 0.57036, throughput: 1327.50 | 2022-05-21 12:02:55.701 [rank:5] [train], epoch: 16/50, iter: 500/834, loss: 0.34683, top1: 0.56266, throughput: 1327.55 | 2022-05-21 12:02:55.700 [rank:0] [train], epoch: 16/50, iter: 500/834, loss: 0.34420, top1: 0.56937, throughput: 1327.53 | 2022-05-21 12:02:55.700 [rank:7] [train], epoch: 16/50, iter: 600/834, loss: 0.34410, top1: 0.56578, throughput: 1328.60 | 2022-05-21 12:03:10.147 [rank:3] [train], epoch: 16/50, iter: 600/834, loss: 0.34496, top1: 0.56620, throughput: 1328.76 | 2022-05-21 12:03:10.147 [rank:5] [train], epoch: 16/50, iter: 600/834, loss: 0.34294, top1: 0.57000, throughput: 1328.94 | 2022-05-21 12:03:10.148 [rank:2] [train], epoch: 16/50, iter: 600/834, loss: 0.34457, top1: 0.57016, throughput: 1328.78 | 2022-05-21 12:03:10.147 [rank:6] [train], epoch: 16/50, iter: 600/834, loss: 0.34274, top1: 0.57151, throughput: 1328.89 | 2022-05-21 12:03:10.148 [rank:1] [train], epoch: 16/50, iter: 600/834, loss: 0.34797, top1: 0.56083, throughput: 1328.56 | 2022-05-21 12:03:10.149 [rank:4] [train], epoch: 16/50, iter: 600/834, loss: 0.34360, top1: 0.56828, throughput: 1328.81 | 2022-05-21 12:03:10.150 [rank:0] [train], epoch: 16/50, iter: 600/834, loss: 0.34586, top1: 0.56510, throughput: 1328.66 | 2022-05-21 12:03:10.150 [rank:0] [train], epoch: 16/50, iter: 700/834, loss: 0.34743, top1: 0.56182, throughput: 1329.34 | 2022-05-21 12:03:24.594 [rank:7] [train], epoch: 16/50, iter: 700/834, loss: 0.34333, top1: 0.57354, throughput: 1329.24 | 2022-05-21 12:03:24.592 [rank:5] [train], epoch: 16/50, iter: 700/834, loss: 0.34628, top1: 0.56406, throughput: 1329.23 | 2022-05-21 12:03:24.592 [rank:2] [train], epoch: 16/50, iter: 700/834, loss: 0.34193, top1: 0.57188, throughput: 1329.24 | 2022-05-21 12:03:24.592 [rank:3] [train], epoch: 16/50, iter: 700/834, loss: 0.34022, top1: 0.57682, throughput: 1329.05 | 2022-05-21 12:03:24.593 [rank:6] [train], epoch: 16/50, iter: 700/834, loss: 0.34290, top1: 0.56589, throughput: 1329.16 | 2022-05-21 12:03:24.594 [rank:4] [train], epoch: 16/50, iter: 700/834, loss: 0.34548, top1: 0.56255, throughput: 1329.28 | 2022-05-21 12:03:24.594 [rank:1] [train], epoch: 16/50, iter: 700/834, loss: 0.34584, top1: 0.56661, throughput: 1329.20 | 2022-05-21 12:03:24.594 [rank:5] [train], epoch: 16/50, iter: 800/834, loss: 0.34475, top1: 0.56714, throughput: 1329.85 | 2022-05-21 12:03:39.030 [rank:7] [train], epoch: 16/50, iter: 800/834, loss: 0.34278, top1: 0.57177, throughput: 1329.83 | 2022-05-21 12:03:39.030 [rank:2] [train], epoch: 16/50, iter: 800/834, loss: 0.34114, top1: 0.56896, throughput: 1329.75 [rank:1] [train], epoch: 16/50, iter: 800/834, loss: 0.34365, top1: 0.57089, throughput: 1330.04| 2022-05-21 12:03:39.031 | 2022-05-21 12:03:39.030 [rank:3] [train], epoch: 16/50, iter: 800/834, loss: 0.34261, top1: 0.56938, throughput: 1329.95 | 2022-05-21 12:03:39.030 [rank:0] [train], epoch: 16/50, iter: 800/834, loss: 0.34318, top1: 0.57156, throughput: 1329.94 | 2022-05-21 12:03:39.030 [rank:4] [train], epoch: 16/50, iter: 800/834, loss: 0.34373, top1: 0.56490, throughput: 1330.00 | 2022-05-21 12:03:39.030 [rank:6] [train], epoch: 16/50, iter: 800/834, loss: 0.34376, top1: 0.56833, throughput: 1329.92 | 2022-05-21 12:03:39.030 [rank:4] [train], epoch: 16/50, iter: 834/834, loss: 0.34497, top1: 0.56235, throughput: 1319.10 | 2022-05-21 12:03:43.979 [rank:7] [train], epoch: 16/50, iter: 834/834, loss: 0.34520, top1: 0.57031, throughput: 1318.77 | 2022-05-21 12:03:43.980[rank:1] [train], epoch: 16/50, iter: 834/834, loss: 0.34893, top1: 0.56020, throughput: 1318.80 | 2022-05-21 12:03:43.980 [rank:6] [train], epoch: 16/50, iter: 834/834, loss: 0.34451, top1: 0.57368, throughput: 1319.01 | 2022-05-21 12:03:43.980 [rank:3] [train], epoch: 16/50, iter: 834/834, loss: 0.33837, top1: 0.57858, throughput: 1318.86 | 2022-05-21 12:03:43.980 [rank:0] [train], epoch: 16/50, iter: 834/834, loss: 0.34504, top1: 0.56357, throughput: 1318.70 | 2022-05-21 12:03:43.981 [rank:5] [train], epoch: 16/50, iter: 834/834, loss: 0.34498, top1: 0.56587, throughput: 1318.46[rank:2] [train], epoch: 16/50, iter: 834/834, loss: 0.34499, top1: 0.56694, throughput: 1318.78 | 2022-05-21 12:03:43.981| 2022-05-21 12:03:43.981 [rank:7] [eval], epoch: 16/50, iter: 125/125, loss: 0.00000, top1: 0.56704, throughput: 574.84 | 2022-05-21 12:03:54.852 [rank:0] [eval], epoch: 16/50, iter: 125/125, loss: 0.00000, top1: 0.57008, throughput: 574.38 | 2022-05-21 12:03:54.862 [rank:4] [eval], epoch: 16/50, iter: 125/125, loss: 0.00000, top1: 0.56832, throughput: 573.60 | 2022-05-21 12:03:54.875 [rank:2] [eval], epoch: 16/50, iter: 125/125, loss: 0.00000, top1: 0.57008, throughput: 568.61 | 2022-05-21 12:03:54.972 [rank:6] [eval], epoch: 16/50, iter: 125/125, loss: 0.00000, top1: 0.57040, throughput: 568.34 | 2022-05-21 12:03:54.976 [rank:3] [eval], epoch: 16/50, iter: 125/125, loss: 0.00000, top1: 0.55952, throughput: 565.72 | 2022-05-21 12:03:55.028 [rank:5] [eval], epoch: 16/50, iter: 125/125, loss: 0.00000, top1: 0.56384, throughput: 560.22 | 2022-05-21 12:03:55.137 [rank:1] [eval], epoch: 16/50, iter: 125/125, loss: 0.00000, top1: 0.57728, throughput: 558.01 | 2022-05-21 12:03:55.180 [rank:4] [train], epoch: 17/50, iter: 100/834, loss: 0.33909, top1: 0.57979, throughput: 1303.70 | 2022-05-21 12:04:09.602 [rank:6] [train], epoch: 17/50, iter: 100/834, loss: 0.33949, top1: 0.57693, throughput: 1312.68 | 2022-05-21 12:04:09.603 [rank:7] [train], epoch: 17/50, iter: 100/834, loss: 0.34053, top1: 0.57750, throughput: 1301.65 | 2022-05-21 12:04:09.603 [rank:1] [train], epoch: 17/50, iter: 100/834, loss: 0.33937, top1: 0.57760, throughput: 1330.98 | 2022-05-21 12:04:09.606 [rank:5] [train], epoch: 17/50, iter: 100/834, loss: 0.33732, top1: 0.58177, throughput: 1327.09 | 2022-05-21 12:04:09.605 [rank:3] [train], epoch: 17/50, iter: 100/834, loss: 0.33827, top1: 0.57948, throughput: 1317.06 | 2022-05-21 12:04:09.606 [rank:0] [train], epoch: 17/50, iter: 100/834, loss: 0.33508, top1: 0.58516, throughput: 1302.26 | 2022-05-21 12:04:09.606 [rank:2] [train], epoch: 17/50, iter: 100/834, loss: 0.33629, top1: 0.58260, throughput: 1312.13 | 2022-05-21 12:04:09.605 [rank:7] [train], epoch: 17/50, iter: 200/834, loss: 0.33808, top1: 0.58031, throughput: 1329.88 | 2022-05-21 12:04:24.040 [rank:5] [train], epoch: 17/50, iter: 200/834, loss: 0.34104, top1: 0.57151, throughput: 1330.11 | 2022-05-21 12:04:24.040 [rank:3] [train], epoch: 17/50, iter: 200/834, loss: 0.33981, top1: 0.57677, throughput: 1329.93 | 2022-05-21 12:04:24.042 [rank:4] [train], epoch: 17/50, iter: 200/834, loss: 0.33916, top1: 0.57786, throughput: 1329.64 | 2022-05-21 12:04:24.042 [rank:2] [train], epoch: 17/50, iter: 200/834, loss: 0.34008, top1: 0.57745, throughput: 1329.83 | 2022-05-21 12:04:24.043 [rank:6] [train], epoch: 17/50, iter: 200/834, loss: 0.33977, top1: 0.57766, throughput: 1329.62 | 2022-05-21 12:04:24.043 [rank:1] [train], epoch: 17/50, iter: 200/834, loss: 0.33790, top1: 0.58042, throughput: 1329.86 | 2022-05-21 12:04:24.043 [rank:0] [train], epoch: 17/50, iter: 200/834, loss: 0.33982, top1: 0.57802, throughput: 1329.91 | 2022-05-21 12:04:24.043 [rank:5] [train], epoch: 17/50, iter: 300/834, loss: 0.33888, top1: 0.57875, throughput: 1328.69 | 2022-05-21 12:04:38.490 [rank:2] [train], epoch: 17/50, iter: 300/834, loss: 0.34096, top1: 0.57490, throughput: 1328.94 | 2022-05-21 12:04:38.490 [rank:4] [train], epoch: 17/50, iter: 300/834, loss: 0.33677, top1: 0.58021, throughput: 1328.91 | 2022-05-21 12:04:38.490 [rank:1] [train], epoch: 17/50, iter: 300/834, loss: 0.34085, top1: 0.57151, throughput: 1328.80 | 2022-05-21 12:04:38.493 [rank:7] [train], epoch: 17/50, iter: 300/834, loss: 0.34128, top1: 0.57609, throughput: 1328.53 | 2022-05-21 12:04:38.492 [rank:3] [train], epoch: 17/50, iter: 300/834, loss: 0.33909, top1: 0.57443, throughput: 1328.79 | 2022-05-21 12:04:38.492 [rank:6] [train], epoch: 17/50, iter: 300/834, loss: 0.33983, top1: 0.57240, throughput: 1328.84 | 2022-05-21 12:04:38.492 [rank:0] [train], epoch: 17/50, iter: 300/834, loss: 0.34200, top1: 0.57203, throughput: 1328.81 | 2022-05-21 12:04:38.492 [rank:4] [train], epoch: 17/50, iter: 400/834, loss: 0.34280, top1: 0.57437, throughput: 1327.98 | 2022-05-21 12:04:52.948 [rank:7] [train], epoch: 17/50, iter: 400/834, loss: 0.34071, top1: 0.57651, throughput: 1328.24 | 2022-05-21 12:04:52.947 [rank:2] [train], epoch: 17/50, iter: 400/834, loss: 0.34028, top1: 0.57786, throughput: 1327.97 | 2022-05-21 12:04:52.949 [rank:5] [train], epoch: 17/50, iter: 400/834, loss: 0.33869, top1: 0.57510, throughput: 1327.97 | 2022-05-21 12:04:52.948 [rank:6] [train], epoch: 17/50, iter: 400/834, loss: 0.33971, top1: 0.57708, throughput: 1328.01 | 2022-05-21 12:04:52.950 [rank:3] [train], epoch: 17/50, iter: 400/834, loss: 0.33492, top1: 0.58354, throughput: 1327.66 | 2022-05-21 12:04:52.953 [rank:0] [train], epoch: 17/50, iter: 400/834, loss: 0.34279, top1: 0.57089, throughput: 1327.82 | 2022-05-21 12:04:52.951 [rank:1] [train], epoch: 17/50, iter: 400/834, loss: 0.34249, top1: 0.57010, throughput: 1327.67 | 2022-05-21 12:04:52.954 [rank:5] [train], epoch: 17/50, iter: 500/834, loss: 0.34369, top1: 0.56802, throughput: 1321.38 | 2022-05-21 12:05:07.479 [rank:4] [train], epoch: 17/50, iter: 500/834, loss: 0.33854, top1: 0.58063, throughput: 1321.35 | 2022-05-21 12:05:07.479 [rank:7] [train], epoch: 17/50, iter: 500/834, loss: 0.34235, top1: 0.57422, throughput: 1321.28 | 2022-05-21 12:05:07.479 [rank:6] [train], epoch: 17/50, iter: 500/834, loss: 0.34047, top1: 0.57813, throughput: 1321.41 | 2022-05-21 12:05:07.480 [rank:3] [train], epoch: 17/50, iter: 500/834, loss: 0.33920, top1: 0.57740, throughput: 1321.72 | 2022-05-21 12:05:07.480 [rank:2] [train], epoch: 17/50, iter: 500/834, loss: 0.33988, top1: 0.57724, throughput: 1321.15 | 2022-05-21 12:05:07.481 [rank:0] [train], epoch: 17/50, iter: 500/834, loss: 0.34155, top1: 0.57583, throughput: 1321.32 | 2022-05-21 12:05:07.482 [rank:1] [train], epoch: 17/50, iter: 500/834, loss: 0.33938, top1: 0.57547, throughput: 1321.58 | 2022-05-21 12:05:07.482 [rank:6] [train], epoch: 17/50, iter: 600/834, loss: 0.34116, top1: 0.57214, throughput: 1328.63 | 2022-05-21 12:05:21.931 [rank:3] [train], epoch: 17/50, iter: 600/834, loss: 0.34066, top1: 0.57458, throughput: 1328.68 | 2022-05-21 12:05:21.930 [rank:5] [train], epoch: 17/50, iter: 600/834, loss: 0.34348, top1: 0.56849, throughput: 1328.55 | 2022-05-21 12:05:21.931 [rank:2] [train], epoch: 17/50, iter: 600/834, loss: 0.33754, top1: 0.58260, throughput: 1328.76 | 2022-05-21 12:05:21.931 [rank:0] [train], epoch: 17/50, iter: 600/834, loss: 0.34282, top1: 0.56703, throughput: 1328.84 | 2022-05-21 12:05:21.931 [rank:1] [train], epoch: 17/50, iter: 600/834, loss: 0.33898, top1: 0.57943, throughput: 1328.63 | 2022-05-21 12:05:21.933 [rank:7] [train], epoch: 17/50, iter: 600/834, loss: 0.34062, top1: 0.57339, throughput: 1328.16 | 2022-05-21 12:05:21.935 [rank:4] [train], epoch: 17/50, iter: 600/834, loss: 0.34142, top1: 0.57714, throughput: 1328.11 | 2022-05-21 12:05:21.936 [rank:3] [train], epoch: 17/50, iter: 700/834, loss: 0.34162, top1: 0.57422, throughput: 1327.32 | 2022-05-21 12:05:36.396 [rank:4] [train], epoch: 17/50, iter: 700/834, loss: 0.33932, top1: 0.57859, throughput: 1327.82 | 2022-05-21 12:05:36.395 [rank:0] [train], epoch: 17/50, iter: 700/834, loss: 0.34119, top1: 0.57562, throughput: 1327.21 | 2022-05-21 12:05:36.397 [rank:2] [train], epoch: 17/50, iter: 700/834, loss: 0.34168, top1: 0.57208, throughput: 1327.40 | 2022-05-21 12:05:36.395 [rank:7] [train], epoch: 17/50, iter: 700/834, loss: 0.33906, top1: 0.57349, throughput: 1327.64 | 2022-05-21 12:05:36.397 [rank:1] [train], epoch: 17/50, iter: 700/834, loss: 0.34303, top1: 0.56953, throughput: 1327.42 | 2022-05-21 12:05:36.397 [rank:5] [train], epoch: 17/50, iter: 700/834, loss: 0.34202, top1: 0.57146, throughput: 1326.96 | 2022-05-21 12:05:36.400 [rank:6] [train], epoch: 17/50, iter: 700/834, loss: 0.33934, top1: 0.57812, throughput: 1326.91 | 2022-05-21 12:05:36.400 [rank:6] [train], epoch: 17/50, iter: 800/834, loss: 0.34019, top1: 0.57443, throughput: 1327.63 | 2022-05-21 12:05:50.862 [rank:1] [train], epoch: 17/50, iter: 800/834, loss: 0.34245, top1: 0.57099, throughput: 1327.30 | 2022-05-21 12:05:50.863 [rank:3] [train], epoch: 17/50, iter: 800/834, loss: 0.34212, top1: 0.57307, throughput: 1327.01 | 2022-05-21 12:05:50.864 [rank:2] [train], epoch: 17/50, iter: 800/834, loss: 0.34478, top1: 0.56219, throughput: 1327.14 | 2022-05-21 12:05:50.863 [rank:7] [train], epoch: 17/50, iter: 800/834, loss: 0.34010, top1: 0.57698, throughput: 1327.08 | 2022-05-21 12:05:50.865 [rank:4] [train], epoch: 17/50, iter: 800/834, loss: 0.33715, top1: 0.58234, throughput: 1326.95 | 2022-05-21 12:05:50.865 [rank:0] [train], epoch: 17/50, iter: 800/834, loss: 0.34003, top1: 0.57469, throughput: 1327.29 | 2022-05-21 12:05:50.863 [rank:5] [train], epoch: 17/50, iter: 800/834, loss: 0.34109, top1: 0.57766, throughput: 1327.31 | 2022-05-21 12:05:50.865 [rank:7] [train], epoch: 17/50, iter: 834/834, loss: 0.33971, top1: 0.58027, throughput: 1328.00[rank:4] [train], epoch: 17/50, iter: 834/834, loss: 0.34079, top1: 0.57292, throughput: 1328.04 | 2022-05-21 12:05:55.780 | 2022-05-21 12:05:55.780 [rank:0] [train], epoch: 17/50, iter: 834/834, loss: 0.34596, top1: 0.56388, throughput: 1327.41 | 2022-05-21 12:05:55.781 [rank:5] [train], epoch: 17/50, iter: 834/834, loss: 0.34026, top1: 0.57843, throughput: 1328.02 | 2022-05-21 12:05:55.781 [rank:6] [train], epoch: 17/50, iter: 834/834, loss: 0.33812, top1: 0.58471, throughput: 1327.24 | 2022-05-21 12:05:55.781 [rank:3] [train], epoch: 17/50, iter: 834/834, loss: 0.33896, top1: 0.57460, throughput: 1327.62 | 2022-05-21 12:05:55.781 [rank:2] [train], epoch: 17/50, iter: 834/834, loss: 0.34071, top1: 0.57966, throughput: 1327.37 | 2022-05-21 12:05:55.781 [rank:1] [train], epoch: 17/50, iter: 834/834, loss: 0.34110, top1: 0.58012, throughput: 1327.12 | 2022-05-21 12:05:55.782 [rank:0] [eval], epoch: 17/50, iter: 125/125, loss: 0.00000, top1: 0.57104, throughput: 561.15 | 2022-05-21 12:06:06.919 [rank:2] [eval], epoch: 17/50, iter: 125/125, loss: 0.00000, top1: 0.54496, throughput: 561.07 | 2022-05-21 12:06:06.920 [rank:7] [eval], epoch: 17/50, iter: 125/125, loss: 0.00000, top1: 0.55872, throughput: 560.62 | 2022-05-21 12:06:06.929 [rank:4] [eval], epoch: 17/50, iter: 125/125, loss: 0.00000, top1: 0.55616, throughput: 560.49 | 2022-05-21 12:06:06.931 [rank:1] [eval], epoch: 17/50, iter: 125/125, loss: 0.00000, top1: 0.56976, throughput: 555.09 | 2022-05-21 12:06:07.041 [rank:3] [eval], epoch: 17/50, iter: 125/125, loss: 0.00000, top1: 0.55600, throughput: 553.51 | 2022-05-21 12:06:07.073 [rank:6] [eval], epoch: 17/50, iter: 125/125, loss: 0.00000, top1: 0.56064, throughput: 552.12 | 2022-05-21 12:06:07.101 [rank:5] [eval], epoch: 17/50, iter: 125/125, loss: 0.00000, top1: 0.54480, throughput: 541.76 | 2022-05-21 12:06:07.317 [rank:5] [train], epoch: 18/50, iter: 100/834, loss: 0.33621, top1: 0.58078, throughput: 1331.80 | 2022-05-21 12:06:21.734 [rank:6] [train], epoch: 18/50, iter: 100/834, loss: 0.33368, top1: 0.59401, throughput: 1311.97 | 2022-05-21 12:06:21.735 [rank:7] [train], epoch: 18/50, iter: 100/834, loss: 0.33538, top1: 0.58026, throughput: 1296.77 | 2022-05-21 12:06:21.735 [rank:4] [train], epoch: 18/50, iter: 100/834, loss: 0.33686, top1: 0.57974, throughput: 1296.84 | 2022-05-21 12:06:21.736 [rank:2] [train], epoch: 18/50, iter: 100/834, loss: 0.33349, top1: 0.58969, throughput: 1295.87 | 2022-05-21 12:06:21.736 [rank:0] [train], epoch: 18/50, iter: 100/834, loss: 0.33494, top1: 0.58453, throughput: 1295.71 | 2022-05-21 12:06:21.737 [rank:3] [train], epoch: 18/50, iter: 100/834, loss: 0.33063, top1: 0.59333, throughput: 1309.34 | 2022-05-21 12:06:21.737 [rank:1] [train], epoch: 18/50, iter: 100/834, loss: 0.33412, top1: 0.58458, throughput: 1306.51 | 2022-05-21 12:06:21.737 [rank:4] [train], epoch: 18/50, iter: 200/834, loss: 0.33691, top1: 0.58401, throughput: 1325.93 | 2022-05-21 12:06:36.217 [rank:3] [train], epoch: 18/50, iter: 200/834, loss: 0.33657, top1: 0.58234, throughput: 1325.93 | 2022-05-21 12:06:36.217 [rank:7] [train], epoch: 18/50, iter: 200/834, loss: 0.33127, top1: 0.58938, throughput: 1325.79 | 2022-05-21 12:06:36.217 [rank:0] [train], epoch: 18/50, iter: 200/834, loss: 0.33333, top1: 0.58802, throughput: 1325.97 | 2022-05-21 12:06:36.217 [rank:2] [train], epoch: 18/50, iter: 200/834, loss: 0.33655, top1: 0.58365, throughput: 1325.76 | 2022-05-21 12:06:36.219 [rank:1] [train], epoch: 18/50, iter: 200/834, loss: 0.33433, top1: 0.58844, throughput: 1325.84 | 2022-05-21 12:06:36.218 [rank:6] [train], epoch: 18/50, iter: 200/834, loss: 0.33594, top1: 0.58271, throughput: 1325.33 | 2022-05-21 12:06:36.222 [rank:5] [train], epoch: 18/50, iter: 200/834, loss: 0.33555, top1: 0.58557, throughput: 1325.28 | 2022-05-21 12:06:36.221 [rank:5] [train], epoch: 18/50, iter: 300/834, loss: 0.33275, top1: 0.58802, throughput: 1320.46 | 2022-05-21 12:06:50.762 [rank:3] [train], epoch: 18/50, iter: 300/834, loss: 0.33624, top1: 0.58307, throughput: 1319.83 | 2022-05-21 12:06:50.765 [rank:7] [train], epoch: 18/50, iter: 300/834, loss: 0.33957, top1: 0.57062, throughput: 1320.07 | 2022-05-21 12:06:50.761 [rank:4] [train], epoch: 18/50, iter: 300/834, loss: 0.33472, top1: 0.58349, throughput: 1319.94 | 2022-05-21 12:06:50.763 [rank:6] [train], epoch: 18/50, iter: 300/834, loss: 0.33714, top1: 0.57948, throughput: 1320.34 | 2022-05-21 12:06:50.764 [rank:1] [train], epoch: 18/50, iter: 300/834, loss: 0.33631, top1: 0.57974, throughput: 1320.00 | 2022-05-21 12:06:50.764 [rank:2] [train], epoch: 18/50, iter: 300/834, loss: 0.33710, top1: 0.58161, throughput: 1320.10 | 2022-05-21 12:06:50.763 [rank:0] [train], epoch: 18/50, iter: 300/834, loss: 0.33564, top1: 0.58042, throughput: 1319.81 | 2022-05-21 12:06:50.764 [rank:4] [train], epoch: 18/50, iter: 400/834, loss: 0.33779, top1: 0.57901, throughput: 1329.51 | 2022-05-21 12:07:05.204 [rank:0] [train], epoch: 18/50, iter: 400/834, loss: 0.33812, top1: 0.57896, throughput: 1329.45 | 2022-05-21 12:07:05.206 [rank:5] [train], epoch: 18/50, iter: 400/834, loss: 0.33840, top1: 0.58036, throughput: 1329.33 | 2022-05-21 12:07:05.205 [rank:1] [train], epoch: 18/50, iter: 400/834, loss: 0.33826, top1: 0.58151, throughput: 1329.56 | 2022-05-21 12:07:05.204 [rank:6] [train], epoch: 18/50, iter: 400/834, loss: 0.33785, top1: 0.58224, throughput: 1329.52 | 2022-05-21 12:07:05.205 [rank:2] [train], epoch: 18/50, iter: 400/834, loss: 0.33650, top1: 0.58214, throughput: 1329.34 | 2022-05-21 12:07:05.206 [rank:3] [train], epoch: 18/50, iter: 400/834, loss: 0.33843, top1: 0.57823, throughput: 1329.51 | 2022-05-21 12:07:05.206 [rank:7] [train], epoch: 18/50, iter: 400/834, loss: 0.33865, top1: 0.57604, throughput: 1329.01 | 2022-05-21 12:07:05.208 [rank:7] [train], epoch: 18/50, iter: 500/834, loss: 0.33716, top1: 0.58109, throughput: 1321.03 | 2022-05-21 12:07:19.742 [rank:6] [train], epoch: 18/50, iter: 500/834, loss: 0.33886, top1: 0.57615, throughput: 1320.71 | 2022-05-21 12:07:19.743 [rank:5] [train], epoch: 18/50, iter: 500/834, loss: 0.33902, top1: 0.57516, throughput: 1320.73 | 2022-05-21 12:07:19.742 [rank:4] [train], epoch: 18/50, iter: 500/834, loss: 0.33910, top1: 0.57708, throughput: 1320.48 | 2022-05-21 12:07:19.744 [rank:2] [train], epoch: 18/50, iter: 500/834, loss: 0.33881, top1: 0.58005, throughput: 1320.76[rank:1] [train], epoch: 18/50, iter: 500/834, loss: 0.33574, top1: 0.58552, throughput: 1320.51 | 2022-05-21 12:07:19.744 [rank:0] [train], epoch: 18/50, iter: 500/834, loss: 0.33941, top1: 0.57870, throughput: 1320.74 | 2022-05-21 12:07:19.744 | 2022-05-21 12:07:19.743 [rank:3] [train], epoch: 18/50, iter: 500/834, loss: 0.33705, top1: 0.58344, throughput: 1320.61 | 2022-05-21 12:07:19.745 [rank:4] [train], epoch: 18/50, iter: 600/834, loss: 0.33868, top1: 0.58208, throughput: 1316.37 | 2022-05-21 12:07:34.330 [rank:0] [train], epoch: 18/50, iter: 600/834, loss: 0.33591, top1: 0.58099, throughput: 1316.18 | 2022-05-21 12:07:34.331 [rank:5] [train], epoch: 18/50, iter: 600/834, loss: 0.33888, top1: 0.57641, throughput: 1316.27 | 2022-05-21 12:07:34.329 [rank:2] [train], epoch: 18/50, iter: 600/834, loss: 0.33753, top1: 0.57953, throughput: 1316.25 | 2022-05-21 12:07:34.330 [rank:7] [train], epoch: 18/50, iter: 600/834, loss: 0.34002, top1: 0.57474, throughput: 1316.25 | 2022-05-21 12:07:34.329 [rank:3] [train], epoch: 18/50, iter: 600/834, loss: 0.33779, top1: 0.58078, throughput: 1316.30 | 2022-05-21 12:07:34.331 [rank:6] [train], epoch: 18/50, iter: 600/834, loss: 0.34134, top1: 0.57193, throughput: 1316.14 | 2022-05-21 12:07:34.331 [rank:1] [train], epoch: 18/50, iter: 600/834, loss: 0.33769, top1: 0.57776, throughput: 1316.27 | 2022-05-21 12:07:34.331 [rank:7] [train], epoch: 18/50, iter: 700/834, loss: 0.33844, top1: 0.57786, throughput: 1325.79 | 2022-05-21 12:07:48.811 [rank:6] [train], epoch: 18/50, iter: 700/834, loss: 0.34053, top1: 0.57542, throughput: 1325.90 | 2022-05-21 12:07:48.812 [rank:3] [train], epoch: 18/50, iter: 700/834, loss: 0.33812, top1: 0.57714, throughput: 1325.93 | 2022-05-21 12:07:48.811 [rank:5] [train], epoch: 18/50, iter: 700/834, loss: 0.34022, top1: 0.57297, throughput: 1325.67 | 2022-05-21 12:07:48.812 [rank:0] [train], epoch: 18/50, iter: 700/834, loss: 0.33758, top1: 0.57802, throughput: 1325.88 | 2022-05-21 12:07:48.812 [rank:1] [train], epoch: 18/50, iter: 700/834, loss: 0.33895, top1: 0.57932, throughput: 1325.89 | 2022-05-21 12:07:48.812 [rank:4] [train], epoch: 18/50, iter: 700/834, loss: 0.33717, top1: 0.58198, throughput: 1325.67 | 2022-05-21 12:07:48.813 [rank:2] [train], epoch: 18/50, iter: 700/834, loss: 0.33690, top1: 0.57969, throughput: 1325.55 | 2022-05-21 12:07:48.815 [rank:5] [train], epoch: 18/50, iter: 800/834, loss: 0.33911, top1: 0.57786, throughput: 1327.28 | 2022-05-21 12:08:03.278 [rank:4] [train], epoch: 18/50, iter: 800/834, loss: 0.33510, top1: 0.58396, throughput: 1327.22 | 2022-05-21 12:08:03.279 [rank:7] [train], epoch: 18/50, iter: 800/834, loss: 0.33829, top1: 0.57938, throughput: 1327.03 | 2022-05-21 12:08:03.279 [rank:2] [train], epoch: 18/50, iter: 800/834, loss: 0.34216, top1: 0.57026, throughput: 1327.53 | 2022-05-21 12:08:03.278 [rank:1] [train], epoch: 18/50, iter: 800/834, loss: 0.33738, top1: 0.58115, throughput: 1327.08 | 2022-05-21 12:08:03.280 [rank:6] [train], epoch: 18/50, iter: 800/834, loss: 0.33707, top1: 0.58208, throughput: 1327.15[rank:0] [train], epoch: 18/50, iter: 800/834, loss: 0.33776, top1: 0.57958, throughput: 1327.13 | 2022-05-21 12:08:03.280 | 2022-05-21 12:08:03.279 [rank:3] [train], epoch: 18/50, iter: 800/834, loss: 0.33695, top1: 0.57896, throughput: 1327.05 | 2022-05-21 12:08:03.280 [rank:1] [train], epoch: 18/50, iter: 834/834, loss: 0.33489, top1: 0.58824, throughput: 1321.47 | 2022-05-21 12:08:08.220 [rank:5] [train], epoch: 18/50, iter: 834/834, loss: 0.33520, top1: 0.58471, throughput: 1320.73 | 2022-05-21 12:08:08.221 [rank:6] [train], epoch: 18/50, iter: 834/834, loss: 0.33636, top1: 0.58410, throughput: 1320.91 | 2022-05-21 12:08:08.221 [rank:0] [train], epoch: 18/50, iter: 834/834, loss: 0.33514, top1: 0.58578, throughput: 1321.16 | 2022-05-21 12:08:08.221 [rank:7] [train], epoch: 18/50, iter: 834/834, loss: 0.33723, top1: 0.58716, throughput: 1320.98 | 2022-05-21 12:08:08.221 [rank:4] [train], epoch: 18/50, iter: 834/834, loss: 0.33993, top1: 0.57812, throughput: 1321.11 | 2022-05-21 12:08:08.221 [rank:2] [train], epoch: 18/50, iter: 834/834, loss: 0.33582, top1: 0.58012, throughput: 1319.66 | 2022-05-21 12:08:08.224 [rank:3] [train], epoch: 18/50, iter: 834/834, loss: 0.33576, top1: 0.58226, throughput: 1320.16 | 2022-05-21 12:08:08.224 [rank:7] [eval], epoch: 18/50, iter: 125/125, loss: 0.00000, top1: 0.58592, throughput: 572.30 | 2022-05-21 12:08:19.142 [rank:0] [eval], epoch: 18/50, iter: 125/125, loss: 0.00000, top1: 0.58080, throughput: 572.02 | 2022-05-21 12:08:19.147 [rank:2] [eval], epoch: 18/50, iter: 125/125, loss: 0.00000, top1: 0.57632, throughput: 568.58 | 2022-05-21 12:08:19.217 [rank:4] [eval], epoch: 18/50, iter: 125/125, loss: 0.00000, top1: 0.58560, throughput: 568.30 | 2022-05-21 12:08:19.218 [rank:6] [eval], epoch: 18/50, iter: 125/125, loss: 0.00000, top1: 0.58224, throughput: 565.39 | 2022-05-21 12:08:19.275 [rank:3] [eval], epoch: 18/50, iter: 125/125, loss: 0.00000, top1: 0.57632, throughput: 564.35 | 2022-05-21 12:08:19.299 [rank:1] [eval], epoch: 18/50, iter: 125/125, loss: 0.00000, top1: 0.58160, throughput: 554.43 | 2022-05-21 12:08:19.492 [rank:5] [eval], epoch: 18/50, iter: 125/125, loss: 0.00000, top1: 0.57072, throughput: 554.42 | 2022-05-21 12:08:19.494 [rank:6] [train], epoch: 19/50, iter: 100/834, loss: 0.33412, top1: 0.58641, throughput: 1310.70 | 2022-05-21 12:08:33.924 [rank:7] [train], epoch: 19/50, iter: 100/834, loss: 0.33379, top1: 0.58536, throughput: 1298.90 | 2022-05-21 12:08:33.924 [rank:1] [train], epoch: 19/50, iter: 100/834, loss: 0.32904, top1: 0.59599, throughput: 1330.38 | 2022-05-21 12:08:33.924 [rank:4] [train], epoch: 19/50, iter: 100/834, loss: 0.33267, top1: 0.58859, throughput: 1305.70 | 2022-05-21 12:08:33.923 [rank:3] [train], epoch: 19/50, iter: 100/834, loss: 0.33612, top1: 0.58354, throughput: 1312.74 | 2022-05-21 12:08:33.925 [rank:5] [train], epoch: 19/50, iter: 100/834, loss: 0.33119, top1: 0.59016, throughput: 1330.36 | 2022-05-21 12:08:33.926 [rank:2] [train], epoch: 19/50, iter: 100/834, loss: 0.33344, top1: 0.58854, throughput: 1305.31 | 2022-05-21 12:08:33.926 [rank:0] [train], epoch: 19/50, iter: 100/834, loss: 0.32965, top1: 0.59583, throughput: 1299.18 | 2022-05-21 12:08:33.926 [rank:7] [train], epoch: 19/50, iter: 200/834, loss: 0.33153, top1: 0.58594, throughput: 1331.60 | 2022-05-21 12:08:48.343 [rank:6] [train], epoch: 19/50, iter: 200/834, loss: 0.33156, top1: 0.59146, throughput: 1331.54 | 2022-05-21 12:08:48.343 [rank:2] [train], epoch: 19/50, iter: 200/834, loss: 0.33270, top1: 0.58807, throughput: 1331.76 | 2022-05-21 12:08:48.343 [rank:4] [train], epoch: 19/50, iter: 200/834, loss: 0.33402, top1: 0.58604, throughput: 1331.45[rank:0] [train], epoch: 19/50, iter: 200/834, loss: 0.33253, top1: 0.59115, throughput: 1331.70 | 2022-05-21 12:08:48.344 | 2022-05-21 12:08:48.343 [rank:5] [train], epoch: 19/50, iter: 200/834, loss: 0.33275, top1: 0.59036, throughput: 1331.60 | 2022-05-21 12:08:48.345 [rank:3] [train], epoch: 19/50, iter: 200/834, loss: 0.33295, top1: 0.58766, throughput: 1331.50 | 2022-05-21 12:08:48.345 [rank:1] [train], epoch: 19/50, iter: 200/834, loss: 0.33384, top1: 0.59021, throughput: 1331.44 | 2022-05-21 12:08:48.345 [rank:3] [train], epoch: 19/50, iter: 300/834, loss: 0.33113, top1: 0.59448, throughput: 1322.15 | 2022-05-21 12:09:02.867 [rank:2] [train], epoch: 19/50, iter: 300/834, loss: 0.33740, top1: 0.58438, throughput: 1322.06 | 2022-05-21 12:09:02.866 [rank:6] [train], epoch: 19/50, iter: 300/834, loss: 0.33377, top1: 0.58698, throughput: 1321.88[rank:7] [train], epoch: 19/50, iter: 300/834, loss: 0.33074, top1: 0.59495, throughput: 1321.86 | 2022-05-21 12:09:02.868| 2022-05-21 12:09:02.868 [rank:4] [train], epoch: 19/50, iter: 300/834, loss: 0.33292, top1: 0.58911, throughput: 1321.93 | 2022-05-21 12:09:02.868 [rank:5] [train], epoch: 19/50, iter: 300/834, loss: 0.33579, top1: 0.58146, throughput: 1322.01 | 2022-05-21 12:09:02.868 [rank:0] [train], epoch: 19/50, iter: 300/834, loss: 0.33445, top1: 0.58703, throughput: 1321.99 | 2022-05-21 12:09:02.867 [rank:1] [train], epoch: 19/50, iter: 300/834, loss: 0.33261, top1: 0.58922, throughput: 1322.02 | 2022-05-21 12:09:02.868 [rank:0] [train], epoch: 19/50, iter: 400/834, loss: 0.33394, top1: 0.58510, throughput: 1328.45 | 2022-05-21 12:09:17.320 [rank:3] [train], epoch: 19/50, iter: 400/834, loss: 0.33096, top1: 0.59479, throughput: 1328.53 | 2022-05-21 12:09:17.319 [rank:7] [train], epoch: 19/50, iter: 400/834, loss: 0.33625, top1: 0.58323, throughput: 1328.61 | 2022-05-21 12:09:17.319 [rank:6] [train], epoch: 19/50, iter: 400/834, loss: 0.33330, top1: 0.58547, throughput: 1328.58 | 2022-05-21 12:09:17.320 [rank:4] [train], epoch: 19/50, iter: 400/834, loss: 0.33489, top1: 0.58797, throughput: 1328.58 | 2022-05-21 12:09:17.319 [rank:5] [train], epoch: 19/50, iter: 400/834, loss: 0.33466, top1: 0.58370, throughput: 1328.46 | 2022-05-21 12:09:17.321 [rank:2] [train], epoch: 19/50, iter: 400/834, loss: 0.33624, top1: 0.58516, throughput: 1328.30 | 2022-05-21 12:09:17.320 [rank:1] [train], epoch: 19/50, iter: 400/834, loss: 0.33729, top1: 0.58099, throughput: 1328.45 | 2022-05-21 12:09:17.321 [rank:7] [train], epoch: 19/50, iter: 500/834, loss: 0.33470, top1: 0.58578, throughput: 1329.15 | 2022-05-21 12:09:31.764 [rank:4] [train], epoch: 19/50, iter: 500/834, loss: 0.33364, top1: 0.58771, throughput: 1329.14 | 2022-05-21 12:09:31.765 [rank:5] [train], epoch: 19/50, iter: 500/834, loss: 0.33482, top1: 0.59078, throughput: 1329.20 | 2022-05-21 12:09:31.766 [rank:6] [train], epoch: 19/50, iter: 500/834, loss: 0.33551, top1: 0.58073, throughput: 1329.12 | 2022-05-21 12:09:31.765 [rank:3] [train], epoch: 19/50, iter: 500/834, loss: 0.33547, top1: 0.57911, throughput: 1329.04 | 2022-05-21 12:09:31.765 [rank:0] [train], epoch: 19/50, iter: 500/834, loss: 0.33415, top1: 0.58635, throughput: 1329.14 | 2022-05-21 12:09:31.765 [rank:1] [train], epoch: 19/50, iter: 500/834, loss: 0.33536, top1: 0.58312, throughput: 1329.25 | 2022-05-21 12:09:31.765 [rank:2] [train], epoch: 19/50, iter: 500/834, loss: 0.33245, top1: 0.59167, throughput: 1329.17 | 2022-05-21 12:09:31.765 [rank:1] [train], epoch: 19/50, iter: 600/834, loss: 0.33723, top1: 0.58089, throughput: 1328.02[rank:6] [train], epoch: 19/50, iter: 600/834, loss: 0.33499, top1: 0.58760, throughput: 1328.01 | 2022-05-21 12:09:46.223| 2022-05-21 12:09:46.223 [rank:7] [train], epoch: 19/50, iter: 600/834, loss: 0.33429, top1: 0.58932, throughput: 1327.94 | 2022-05-21 12:09:46.223 [rank:3] [train], epoch: 19/50, iter: 600/834, loss: 0.33644, top1: 0.58141, throughput: 1327.83 | 2022-05-21 12:09:46.225 [rank:4] [train], epoch: 19/50, iter: 600/834, loss: 0.33345, top1: 0.59000, throughput: 1327.87 | 2022-05-21 12:09:46.224[rank:5] [train], epoch: 19/50, iter: 600/834, loss: 0.33787, top1: 0.57932, throughput: 1327.92 | 2022-05-21 12:09:46.224 [rank:2] [train], epoch: 19/50, iter: 600/834, loss: 0.33490, top1: 0.58417, throughput: 1327.80 | 2022-05-21 12:09:46.225 [rank:0] [train], epoch: 19/50, iter: 600/834, loss: 0.33466, top1: 0.58536, throughput: 1327.80 | 2022-05-21 12:09:46.225 [rank:5] [train], epoch: 19/50, iter: 700/834, loss: 0.33689, top1: 0.58365, throughput: 1327.29 | 2022-05-21 12:10:00.690 [rank:7] [train], epoch: 19/50, iter: 700/834, loss: 0.33516, top1: 0.57917, throughput: 1327.35[rank:6] [train], epoch: 19/50, iter: 700/834, loss: 0.33464, top1: 0.58583, throughput: 1327.27 | 2022-05-21 12:10:00.689 [rank:1] [train], epoch: 19/50, iter: 700/834, loss: 0.33403, top1: 0.58958, throughput: 1327.08 | 2022-05-21 12:10:00.691 | 2022-05-21 12:10:00.688 [rank:0] [train], epoch: 19/50, iter: 700/834, loss: 0.33578, top1: 0.58411, throughput: 1327.49 | 2022-05-21 12:10:00.688 [rank:4] [train], epoch: 19/50, iter: 700/834, loss: 0.33562, top1: 0.58599, throughput: 1327.26 | 2022-05-21 12:10:00.690 [rank:3] [train], epoch: 19/50, iter: 700/834, loss: 0.33656, top1: 0.58500, throughput: 1327.34 | 2022-05-21 12:10:00.690 [rank:2] [train], epoch: 19/50, iter: 700/834, loss: 0.33363, top1: 0.58734, throughput: 1327.48 | 2022-05-21 12:10:00.689 [rank:4] [train], epoch: 19/50, iter: 800/834, loss: 0.33634, top1: 0.58516, throughput: 1326.35 | 2022-05-21 12:10:15.166 [rank:5] [train], epoch: 19/50, iter: 800/834, loss: 0.33189, top1: 0.59344, throughput: 1326.38 | 2022-05-21 12:10:15.165 [rank:0] [train], epoch: 19/50, iter: 800/834, loss: 0.33739, top1: 0.57927, throughput: 1326.19 | 2022-05-21 12:10:15.166 [rank:3] [train], epoch: 19/50, iter: 800/834, loss: 0.33317, top1: 0.58583, throughput: 1326.16 | 2022-05-21 12:10:15.168 [rank:2] [train], epoch: 19/50, iter: 800/834, loss: 0.33359, top1: 0.58932, throughput: 1326.27[rank:7] [train], epoch: 19/50, iter: 800/834, loss: 0.33380, top1: 0.58198, throughput: 1325.97 | 2022-05-21 12:10:15.168 | 2022-05-21 12:10:15.165 [rank:1] [train], epoch: 19/50, iter: 800/834, loss: 0.33455, top1: 0.58500, throughput: 1326.35 | 2022-05-21 12:10:15.166 [rank:6] [train], epoch: 19/50, iter: 800/834, loss: 0.33437, top1: 0.58755, throughput: 1326.03 | 2022-05-21 12:10:15.168 [rank:4] [train], epoch: 19/50, iter: 834/834, loss: 0.33352, top1: 0.58885, throughput: 1322.78 | 2022-05-21 12:10:20.101 [rank:6] [train], epoch: 19/50, iter: 834/834, loss: 0.33514, top1: 0.58241, throughput: 1323.36 | 2022-05-21 12:10:20.101 [rank:7] [train], epoch: 19/50, iter: 834/834, loss: 0.33654, top1: 0.58318, throughput: 1323.24 | 2022-05-21 12:10:20.101 [rank:3] [train], epoch: 19/50, iter: 834/834, loss: 0.33775, top1: 0.57843, throughput: 1323.13 | 2022-05-21 12:10:20.101 [rank:5] [train], epoch: 19/50, iter: 834/834, loss: 0.33653, top1: 0.57981, throughput: 1322.64 | 2022-05-21 12:10:20.101 [rank:2] [train], epoch: 19/50, iter: 834/834, loss: 0.33388, top1: 0.58487, throughput: 1322.59 | 2022-05-21 12:10:20.101 [rank:1] [train], epoch: 19/50, iter: 834/834, loss: 0.33283, top1: 0.58318, throughput: 1322.67 | 2022-05-21 12:10:20.102 [rank:0] [train], epoch: 19/50, iter: 834/834, loss: 0.33707, top1: 0.57935, throughput: 1322.54 | 2022-05-21 12:10:20.102 [rank:0] [eval], epoch: 19/50, iter: 125/125, loss: 0.00000, top1: 0.61952, throughput: 578.01 | 2022-05-21 12:10:30.915 [rank:7] [eval], epoch: 19/50, iter: 125/125, loss: 0.00000, top1: 0.60432, throughput: 574.98 | 2022-05-21 12:10:30.971 [rank:6] [eval], epoch: 19/50, iter: 125/125, loss: 0.00000, top1: 0.60528, throughput: 568.92 | 2022-05-21 12:10:31.087 [rank:3] [eval], epoch: 19/50, iter: 125/125, loss: 0.00000, top1: 0.59728, throughput: 566.80 | 2022-05-21 12:10:31.128 [rank:2] [eval], epoch: 19/50, iter: 125/125, loss: 0.00000, top1: 0.59568, throughput: 566.45 | 2022-05-21 12:10:31.135 [rank:1] [eval], epoch: 19/50, iter: 125/125, loss: 0.00000, top1: 0.60832, throughput: 564.12 | 2022-05-21 12:10:31.181 [rank:4] [eval], epoch: 19/50, iter: 125/125, loss: 0.00000, top1: 0.59136, throughput: 563.26 | 2022-05-21 12:10:31.197 [rank:5] [eval], epoch: 19/50, iter: 125/125, loss: 0.00000, top1: 0.59696, throughput: 558.52 | 2022-05-21 12:10:31.291 [rank:2] [train], epoch: 20/50, iter: 100/834, loss: 0.32878, top1: 0.59490, throughput: 1308.69 | 2022-05-21 12:10:45.806 [rank:4] [train], epoch: 20/50, iter: 100/834, loss: 0.33064, top1: 0.59573, throughput: 1314.17 | 2022-05-21 12:10:45.807 [rank:5] [train], epoch: 20/50, iter: 100/834, loss: 0.33150, top1: 0.59255, throughput: 1322.61 | 2022-05-21 12:10:45.808 [rank:0] [train], epoch: 20/50, iter: 100/834, loss: 0.32711, top1: 0.59766, throughput: 1289.17 | 2022-05-21 12:10:45.808 [rank:1] [train], epoch: 20/50, iter: 100/834, loss: 0.33126, top1: 0.59276, throughput: 1312.62 | 2022-05-21 12:10:45.808 [rank:3] [train], epoch: 20/50, iter: 100/834, loss: 0.32904, top1: 0.59589, throughput: 1307.86 | 2022-05-21 12:10:45.809 [rank:6] [train], epoch: 20/50, iter: 100/834, loss: 0.33029, top1: 0.59536, throughput: 1304.14 | 2022-05-21 12:10:45.809[rank:7] [train], epoch: 20/50, iter: 100/834, loss: 0.32750, top1: 0.60073, throughput: 1294.00 | 2022-05-21 12:10:45.809 [rank:4] [train], epoch: 20/50, iter: 200/834, loss: 0.33032, top1: 0.59250, throughput: 1329.91 | 2022-05-21 12:11:00.244 [rank:6] [train], epoch: 20/50, iter: 200/834, loss: 0.33046, top1: 0.59078, throughput: 1330.11 | 2022-05-21 12:11:00.244 [rank:2] [train], epoch: 20/50, iter: 200/834, loss: 0.32977, top1: 0.59734, throughput: 1329.82 | 2022-05-21 12:11:00.244 [rank:0] [train], epoch: 20/50, iter: 200/834, loss: 0.33245, top1: 0.59411, throughput: 1329.98 | 2022-05-21 12:11:00.244 [rank:3] [train], epoch: 20/50, iter: 200/834, loss: 0.33152, top1: 0.58531, throughput: 1330.09 | 2022-05-21 12:11:00.244 [rank:1] [train], epoch: 20/50, iter: 200/834, loss: 0.32845, top1: 0.59792, throughput: 1330.12 | 2022-05-21 12:11:00.243 [rank:5] [train], epoch: 20/50, iter: 200/834, loss: 0.32742, top1: 0.59891, throughput: 1329.92 | 2022-05-21 12:11:00.245 [rank:7] [train], epoch: 20/50, iter: 200/834, loss: 0.32733, top1: 0.59990, throughput: 1330.07 | 2022-05-21 12:11:00.244 [rank:4] [train], epoch: 20/50, iter: 300/834, loss: 0.33008, top1: 0.59167, throughput: 1328.72 | 2022-05-21 12:11:14.694 [rank:7] [train], epoch: 20/50, iter: 300/834, loss: 0.32949, top1: 0.59271, throughput: 1328.76 | 2022-05-21 12:11:14.694 [rank:5] [train], epoch: 20/50, iter: 300/834, loss: 0.33142, top1: 0.58948, throughput: 1328.69 | 2022-05-21 12:11:14.695 [rank:3] [train], epoch: 20/50, iter: 300/834, loss: 0.33217, top1: 0.58979, throughput: 1328.47 | 2022-05-21 12:11:14.696 [rank:1] [train], epoch: 20/50, iter: 300/834, loss: 0.32675, top1: 0.60031, throughput: 1328.62 | 2022-05-21 12:11:14.694 [rank:6] [train], epoch: 20/50, iter: 300/834, loss: 0.33013, top1: 0.59406, throughput: 1328.52 | 2022-05-21 12:11:14.696 [rank:2] [train], epoch: 20/50, iter: 300/834, loss: 0.33039, top1: 0.59266, throughput: 1328.59 | 2022-05-21 12:11:14.696 [rank:0] [train], epoch: 20/50, iter: 300/834, loss: 0.32932, top1: 0.59583, throughput: 1328.54 | 2022-05-21 12:11:14.696 [rank:5] [train], epoch: 20/50, iter: 400/834, loss: 0.33014, top1: 0.59432, throughput: 1328.19 | 2022-05-21 12:11:29.151 [rank:7] [train], epoch: 20/50, iter: 400/834, loss: 0.33252, top1: 0.58698, throughput: 1328.02 | 2022-05-21 12:11:29.151 [rank:3] [train], epoch: 20/50, iter: 400/834, loss: 0.33011, top1: 0.59245, throughput: 1328.24 | 2022-05-21 12:11:29.152 [rank:1] [train], epoch: 20/50, iter: 400/834, loss: 0.32980, top1: 0.59125, throughput: 1328.01 | 2022-05-21 12:11:29.152 [rank:6] [train], epoch: 20/50, iter: 400/834, loss: 0.32931, top1: 0.59630, throughput: 1328.15 | 2022-05-21 12:11:29.152 [rank:0] [train], epoch: 20/50, iter: 400/834, loss: 0.33183, top1: 0.58938, throughput: 1328.22 | 2022-05-21 12:11:29.152 [rank:2] [train], epoch: 20/50, iter: 400/834, loss: 0.33062, top1: 0.59479, throughput: 1327.98 | 2022-05-21 12:11:29.154 [rank:4] [train], epoch: 20/50, iter: 400/834, loss: 0.32857, top1: 0.59578, throughput: 1327.83 | 2022-05-21 12:11:29.154 [rank:7] [train], epoch: 20/50, iter: 500/834, loss: 0.33169, top1: 0.59214, throughput: 1328.16 | 2022-05-21 12:11:43.607 [rank:5] [train], epoch: 20/50, iter: 500/834, loss: 0.33275, top1: 0.58812, throughput: 1328.04 | 2022-05-21 12:11:43.609 [rank:1] [train], epoch: 20/50, iter: 500/834, loss: 0.32994, top1: 0.59141, throughput: 1328.15 | 2022-05-21 12:11:43.608 [rank:2] [train], epoch: 20/50, iter: 500/834, loss: 0.32955, top1: 0.59219, throughput: 1328.35 | 2022-05-21 12:11:43.608 [rank:3] [train], epoch: 20/50, iter: 500/834, loss: 0.33395, top1: 0.58781, throughput: 1327.97 | 2022-05-21 12:11:43.610 [rank:0] [train], epoch: 20/50, iter: 500/834, loss: 0.33211, top1: 0.58812, throughput: 1328.00 | 2022-05-21 12:11:43.610 [rank:6] [train], epoch: 20/50, iter: 500/834, loss: 0.33196, top1: 0.59005, throughput: 1327.80 | 2022-05-21 12:11:43.612 [rank:4] [train], epoch: 20/50, iter: 500/834, loss: 0.33051, top1: 0.59104, throughput: 1327.92 | 2022-05-21 12:11:43.612 [rank:5] [train], epoch: 20/50, iter: 600/834, loss: 0.33065, top1: 0.59193, throughput: 1324.95 | 2022-05-21 12:11:58.100 [rank:4] [train], epoch: 20/50, iter: 600/834, loss: 0.33101, top1: 0.59135, throughput: 1325.25 | 2022-05-21 12:11:58.100 [rank:7] [train], epoch: 20/50, iter: 600/834, loss: 0.33103, top1: 0.59057, throughput: 1324.81 | 2022-05-21 12:11:58.100 [rank:2] [train], epoch: 20/50, iter: 600/834, loss: 0.33228, top1: 0.58698, throughput: 1324.81 | 2022-05-21 12:11:58.100 [rank:6] [train], epoch: 20/50, iter: 600/834, loss: 0.33151, top1: 0.59005, throughput: 1325.06 | 2022-05-21 12:11:58.102 [rank:3] [train], epoch: 20/50, iter: 600/834, loss: 0.33226, top1: 0.58958, throughput: 1324.86 | 2022-05-21 12:11:58.102 [rank:0] [train], epoch: 20/50, iter: 600/834, loss: 0.33602, top1: 0.58422, throughput: 1324.90 | 2022-05-21 12:11:58.101 [rank:1] [train], epoch: 20/50, iter: 600/834, loss: 0.33583, top1: 0.58583, throughput: 1324.73 | 2022-05-21 12:11:58.102 [rank:3] [train], epoch: 20/50, iter: 700/834, loss: 0.32970, top1: 0.59630, throughput: 1328.51 | 2022-05-21 12:12:12.554 [rank:4] [train], epoch: 20/50, iter: 700/834, loss: 0.33260, top1: 0.58531, throughput: 1328.38 | 2022-05-21 12:12:12.554 [rank:7] [train], epoch: 20/50, iter: 700/834, loss: 0.33093, top1: 0.59219, throughput: 1328.19 | 2022-05-21 12:12:12.556 [rank:6] [train], epoch: 20/50, iter: 700/834, loss: 0.33286, top1: 0.58630, throughput: 1328.43 | 2022-05-21 12:12:12.555 [rank:5] [train], epoch: 20/50, iter: 700/834, loss: 0.33158, top1: 0.59661, throughput: 1328.16 | 2022-05-21 12:12:12.556 [rank:0] [train], epoch: 20/50, iter: 700/834, loss: 0.33167, top1: 0.58969, throughput: 1328.33 | 2022-05-21 12:12:12.555 [rank:2] [train], epoch: 20/50, iter: 700/834, loss: 0.33296, top1: 0.59104, throughput: 1328.24 | 2022-05-21 12:12:12.556 [rank:1] [train], epoch: 20/50, iter: 700/834, loss: 0.33263, top1: 0.58781, throughput: 1328.24 | 2022-05-21 12:12:12.557 [rank:5] [train], epoch: 20/50, iter: 800/834, loss: 0.33201, top1: 0.58672, throughput: 1327.93 | 2022-05-21 12:12:27.014 [rank:3] [train], epoch: 20/50, iter: 800/834, loss: 0.33355, top1: 0.58786, throughput: 1327.83 | 2022-05-21 12:12:27.014 [rank:1] [train], epoch: 20/50, iter: 800/834, loss: 0.33397, top1: 0.58526, throughput: 1328.06 | 2022-05-21 12:12:27.014 [rank:2] [train], epoch: 20/50, iter: 800/834, loss: 0.33152, top1: 0.58958, throughput: 1327.97 | 2022-05-21 12:12:27.014 [rank:7] [train], epoch: 20/50, iter: 800/834, loss: 0.33492, top1: 0.59021, throughput: 1327.83 | 2022-05-21 12:12:27.015 [rank:6] [train], epoch: 20/50, iter: 800/834, loss: 0.33257, top1: 0.58797, throughput: 1327.70 | 2022-05-21 12:12:27.016 [rank:4] [train], epoch: 20/50, iter: 800/834, loss: 0.33264, top1: 0.59104, throughput: 1327.59 | 2022-05-21 12:12:27.016 [rank:0] [train], epoch: 20/50, iter: 800/834, loss: 0.33118, top1: 0.58948, throughput: 1327.73 | 2022-05-21 12:12:27.016 [rank:3] [train], epoch: 20/50, iter: 834/834, loss: 0.33542, top1: 0.59053, throughput: 1325.07 | 2022-05-21 12:12:31.940 [rank:5] [train], epoch: 20/50, iter: 834/834, loss: 0.33156, top1: 0.59390, throughput: 1325.23 | 2022-05-21 12:12:31.940 [rank:7] [train], epoch: 20/50, iter: 834/834, loss: 0.33668, top1: 0.58195, throughput: 1325.44 | 2022-05-21 12:12:31.941 [rank:2] [train], epoch: 20/50, iter: 834/834, loss: 0.33517, top1: 0.58211, throughput: 1325.03 | 2022-05-21 12:12:31.940 [rank:6] [train], epoch: 20/50, iter: 834/834, loss: 0.33255, top1: 0.58885, throughput: 1325.57[rank:0] [train], epoch: 20/50, iter: 834/834, loss: 0.33691, top1: 0.57874, throughput: 1325.51 | 2022-05-21 12:12:31.941| 2022-05-21 12:12:31.941 [rank:4] [train], epoch: 20/50, iter: 834/834, loss: 0.32843, top1: 0.59498, throughput: 1325.36 | 2022-05-21 12:12:31.942 [rank:1] [train], epoch: 20/50, iter: 834/834, loss: 0.32845, top1: 0.59957, throughput: 1324.53 | 2022-05-21 12:12:31.943 [rank:7] [eval], epoch: 20/50, iter: 125/125, loss: 0.00000, top1: 0.58384, throughput: 579.32 | 2022-05-21 12:12:42.729 [rank:0] [eval], epoch: 20/50, iter: 125/125, loss: 0.00000, top1: 0.58624, throughput: 579.31 | 2022-05-21 12:12:42.730 [rank:2] [eval], epoch: 20/50, iter: 125/125, loss: 0.00000, top1: 0.57136, throughput: 572.42 | 2022-05-21 12:12:42.859 [rank:6] [eval], epoch: 20/50, iter: 125/125, loss: 0.00000, top1: 0.56800, throughput: 571.96 | 2022-05-21 12:12:42.868 [rank:4] [eval], epoch: 20/50, iter: 125/125, loss: 0.00000, top1: 0.57920, throughput: 570.66 | 2022-05-21 12:12:42.894 [rank:3] [eval], epoch: 20/50, iter: 125/125, loss: 0.00000, top1: 0.57904, throughput: 570.19 | 2022-05-21 12:12:42.902 [rank:1] [eval], epoch: 20/50, iter: 125/125, loss: 0.00000, top1: 0.58416, throughput: 561.31 | 2022-05-21 12:12:43.077 [rank:5] [eval], epoch: 20/50, iter: 125/125, loss: 0.00000, top1: 0.56560, throughput: 560.70 | 2022-05-21 12:12:43.087 [rank:2] [train], epoch: 21/50, iter: 100/834, loss: 0.32586, top1: 0.60125, throughput: 1310.61 | 2022-05-21 12:12:57.509 [rank:1] [train], epoch: 21/50, iter: 100/834, loss: 0.32621, top1: 0.59844, throughput: 1330.42 | 2022-05-21 12:12:57.509 [rank:3] [train], epoch: 21/50, iter: 100/834, loss: 0.32607, top1: 0.60266, throughput: 1314.43 | 2022-05-21 12:12:57.509 [rank:7] [train], epoch: 21/50, iter: 100/834, loss: 0.32148, top1: 0.61573, throughput: 1299.08 | 2022-05-21 12:12:57.509 [rank:4] [train], epoch: 21/50, iter: 100/834, loss: 0.32547, top1: 0.60313, throughput: 1313.58 | 2022-05-21 12:12:57.510 [rank:0] [train], epoch: 21/50, iter: 100/834, loss: 0.32873, top1: 0.59422, throughput: 1299.12 | 2022-05-21 12:12:57.509 [rank:6] [train], epoch: 21/50, iter: 100/834, loss: 0.32986, top1: 0.59667, throughput: 1310.95 | 2022-05-21 12:12:57.514 [rank:5] [train], epoch: 21/50, iter: 100/834, loss: 0.32510, top1: 0.59948, throughput: 1330.91 | 2022-05-21 12:12:57.513 [rank:6] [train], epoch: 21/50, iter: 200/834, loss: 0.32818, top1: 0.60042, throughput: 1327.68 | 2022-05-21 12:13:11.976 [rank:7] [train], epoch: 21/50, iter: 200/834, loss: 0.32852, top1: 0.59536, throughput: 1327.24 | 2022-05-21 12:13:11.975 [rank:2] [train], epoch: 21/50, iter: 200/834, loss: 0.32636, top1: 0.60135, throughput: 1327.25 | 2022-05-21 12:13:11.975 [rank:5] [train], epoch: 21/50, iter: 200/834, loss: 0.32644, top1: 0.59875, throughput: 1327.56 | 2022-05-21 12:13:11.976 [rank:3] [train], epoch: 21/50, iter: 200/834, loss: 0.32454, top1: 0.60219, throughput: 1326.74 | 2022-05-21 12:13:11.980 [rank:4] [train], epoch: 21/50, iter: 200/834, loss: 0.32850, top1: 0.59635, throughput: 1327.14 | 2022-05-21 12:13:11.978 [rank:0] [train], epoch: 21/50, iter: 200/834, loss: 0.33092, top1: 0.58776, throughput: 1327.02 | 2022-05-21 12:13:11.977 [rank:1] [train], epoch: 21/50, iter: 200/834, loss: 0.32738, top1: 0.59604, throughput: 1326.88 | 2022-05-21 12:13:11.979 [rank:5] [train], epoch: 21/50, iter: 300/834, loss: 0.32710, top1: 0.60125, throughput: 1331.16 | 2022-05-21 12:13:26.399 [rank:2] [train], epoch: 21/50, iter: 300/834, loss: 0.32413, top1: 0.60344, throughput: 1331.05 | 2022-05-21 12:13:26.399 [rank:7] [train], epoch: 21/50, iter: 300/834, loss: 0.33008, top1: 0.59146, throughput: 1330.86 | 2022-05-21 12:13:26.401 [rank:3] [train], epoch: 21/50, iter: 300/834, loss: 0.32454, top1: 0.60609, throughput: 1331.31 | 2022-05-21 12:13:26.402 [rank:0] [train], epoch: 21/50, iter: 300/834, loss: 0.32792, top1: 0.59901, throughput: 1331.04 | 2022-05-21 12:13:26.402 [rank:6] [train], epoch: 21/50, iter: 300/834, loss: 0.32713, top1: 0.59625, throughput: 1330.88 | 2022-05-21 12:13:26.402 [rank:4] [train], epoch: 21/50, iter: 300/834, loss: 0.32699, top1: 0.59156, throughput: 1331.01 | 2022-05-21 12:13:26.403 [rank:1] [train], epoch: 21/50, iter: 300/834, loss: 0.32811, top1: 0.59453, throughput: 1331.15 | 2022-05-21 12:13:26.402 [rank:0] [train], epoch: 21/50, iter: 400/834, loss: 0.32888, top1: 0.60156, throughput: 1328.12 | 2022-05-21 12:13:40.859 [rank:4] [train], epoch: 21/50, iter: 400/834, loss: 0.32770, top1: 0.59490, throughput: 1328.10 | 2022-05-21 12:13:40.859 [rank:6] [train], epoch: 21/50, iter: 400/834, loss: 0.32992, top1: 0.59130, throughput: 1328.08 | 2022-05-21 12:13:40.859 [rank:3] [train], epoch: 21/50, iter: 400/834, loss: 0.32898, top1: 0.59490, throughput: 1327.96 | 2022-05-21 12:13:40.860 [rank:7] [train], epoch: 21/50, iter: 400/834, loss: 0.32616, top1: 0.59818, throughput: 1328.02 | 2022-05-21 12:13:40.859 [rank:2] [train], epoch: 21/50, iter: 400/834, loss: 0.32904, top1: 0.59563, throughput: 1327.78 | 2022-05-21 12:13:40.859 [rank:1] [train], epoch: 21/50, iter: 400/834, loss: 0.32635, top1: 0.60094, throughput: 1327.99 | 2022-05-21 12:13:40.860 [rank:5] [train], epoch: 21/50, iter: 400/834, loss: 0.32945, top1: 0.59849, throughput: 1327.64 | 2022-05-21 12:13:40.861 [rank:5] [train], epoch: 21/50, iter: 500/834, loss: 0.32707, top1: 0.59776, throughput: 1329.14 | 2022-05-21 12:13:55.307 [rank:3] [train], epoch: 21/50, iter: 500/834, loss: 0.32680, top1: 0.59464, throughput: 1329.04 | 2022-05-21 12:13:55.307 [rank:7] [train], epoch: 21/50, iter: 500/834, loss: 0.32702, top1: 0.60089, throughput: 1328.94 [rank:4] [train], epoch: 21/50, iter: 500/834, loss: 0.32837, top1: 0.59573, throughput: 1328.94| 2022-05-21 12:13:55.307 | 2022-05-21 12:13:55.307 [rank:6] [train], epoch: 21/50, iter: 500/834, loss: 0.33175, top1: 0.59307, throughput: 1328.83 | 2022-05-21 12:13:55.308 [rank:2] [train], epoch: 21/50, iter: 500/834, loss: 0.32860, top1: 0.59755, throughput: 1328.90 | 2022-05-21 12:13:55.308 [rank:0] [train], epoch: 21/50, iter: 500/834, loss: 0.32947, top1: 0.59302, throughput: 1328.81 | 2022-05-21 12:13:55.308 [rank:1] [train], epoch: 21/50, iter: 500/834, loss: 0.32855, top1: 0.59891, throughput: 1328.83 | 2022-05-21 12:13:55.309 [rank:5] [train], epoch: 21/50, iter: 600/834, loss: 0.32860, top1: 0.60078, throughput: 1326.52 | 2022-05-21 12:14:09.780 [rank:4] [train], epoch: 21/50, iter: 600/834, loss: 0.33039, top1: 0.59865, throughput: 1326.48 | 2022-05-21 12:14:09.781 [rank:7] [train], epoch: 21/50, iter: 600/834, loss: 0.33063, top1: 0.59646, throughput: 1326.43 | 2022-05-21 12:14:09.782 [rank:6] [train], epoch: 21/50, iter: 600/834, loss: 0.32762, top1: 0.59771, throughput: 1326.55 | 2022-05-21 12:14:09.782 [rank:0] [train], epoch: 21/50, iter: 600/834, loss: 0.32778, top1: 0.59510, throughput: 1326.54 | 2022-05-21 12:14:09.782 [rank:2] [train], epoch: 21/50, iter: 600/834, loss: 0.32557, top1: 0.60365, throughput: 1326.54 | 2022-05-21 12:14:09.781 [rank:3] [train], epoch: 21/50, iter: 600/834, loss: 0.32811, top1: 0.59792, throughput: 1326.26 | 2022-05-21 12:14:09.784 [rank:1] [train], epoch: 21/50, iter: 600/834, loss: 0.33001, top1: 0.59432, throughput: 1326.48 | 2022-05-21 12:14:09.784 [rank:5] [train], epoch: 21/50, iter: 700/834, loss: 0.32947, top1: 0.59724, throughput: 1325.49 | 2022-05-21 12:14:24.266 [rank:6] [train], epoch: 21/50, iter: 700/834, loss: 0.32761, top1: 0.60005, throughput: 1325.40 | 2022-05-21 12:14:24.268 [rank:3] [train], epoch: 21/50, iter: 700/834, loss: 0.32987, top1: 0.59427, throughput: 1325.59 | 2022-05-21 12:14:24.268 [rank:4] [train], epoch: 21/50, iter: 700/834, loss: 0.32581, top1: 0.60141, throughput: 1325.40 | 2022-05-21 12:14:24.268 [rank:1] [train], epoch: 21/50, iter: 700/834, loss: 0.32989, top1: 0.59719, throughput: 1325.58 | 2022-05-21 12:14:24.268 [rank:0] [train], epoch: 21/50, iter: 700/834, loss: 0.33192, top1: 0.59151, throughput: 1325.37 | 2022-05-21 12:14:24.268 [rank:7] [train], epoch: 21/50, iter: 700/834, loss: 0.32933, top1: 0.59187, throughput: 1325.43 | 2022-05-21 12:14:24.267 [rank:2] [train], epoch: 21/50, iter: 700/834, loss: 0.33061, top1: 0.58937, throughput: 1325.50 | 2022-05-21 12:14:24.266 [rank:2] [train], epoch: 21/50, iter: 800/834, loss: 0.32821, top1: 0.59359, throughput: 1328.86 | 2022-05-21 12:14:38.715 [rank:0] [train], epoch: 21/50, iter: 800/834, loss: 0.33004, top1: 0.59292, throughput: 1328.90 | 2022-05-21 12:14:38.716 [rank:4] [train], epoch: 21/50, iter: 800/834, loss: 0.32927, top1: 0.59724, throughput: 1328.74 | 2022-05-21 12:14:38.717 [rank:5] [train], epoch: 21/50, iter: 800/834, loss: 0.32844, top1: 0.59563, throughput: 1328.59 | 2022-05-21 12:14:38.717 [rank:3] [train], epoch: 21/50, iter: 800/834, loss: 0.32717, top1: 0.59474, throughput: 1328.87 | 2022-05-21 12:14:38.716 [rank:7] [train], epoch: 21/50, iter: 800/834, loss: 0.32848, top1: 0.59755, throughput: 1328.79 | 2022-05-21 12:14:38.717 [rank:1] [train], epoch: 21/50, iter: 800/834, loss: 0.32953, top1: 0.59943, throughput: 1328.74 | 2022-05-21 12:14:38.718 [rank:6] [train], epoch: 21/50, iter: 800/834, loss: 0.32686, top1: 0.59823, throughput: 1328.72 | 2022-05-21 12:14:38.718 [rank:1] [train], epoch: 21/50, iter: 834/834, loss: 0.32540, top1: 0.60110, throughput: 1326.17 | 2022-05-21 12:14:43.640 [rank:0] [train], epoch: 21/50, iter: 834/834, loss: 0.33247, top1: 0.59145, throughput: 1325.47 | 2022-05-21 12:14:43.641[rank:6] [train], epoch: 21/50, iter: 834/834, loss: 0.32712, top1: 0.59115, throughput: 1325.86 | 2022-05-21 12:14:43.641 [rank:3] [train], epoch: 21/50, iter: 834/834, loss: 0.32630, top1: 0.60218, throughput: 1325.51 | 2022-05-21 12:14:43.641 [rank:4] [train], epoch: 21/50, iter: 834/834, loss: 0.32621, top1: 0.59957, throughput: 1325.38 | 2022-05-21 12:14:43.643 [rank:2] [train], epoch: 21/50, iter: 834/834, loss: 0.32568, top1: 0.59911, throughput: 1324.53 | 2022-05-21 12:14:43.643 [rank:7] [train], epoch: 21/50, iter: 834/834, loss: 0.32854, top1: 0.59467, throughput: 1324.51 | 2022-05-21 12:14:43.645 [rank:5] [train], epoch: 21/50, iter: 834/834, loss: 0.32975, top1: 0.59161, throughput: 1324.61 | 2022-05-21 12:14:43.645 [rank:0] [eval], epoch: 21/50, iter: 125/125, loss: 0.00000, top1: 0.62560, throughput: 569.83 | 2022-05-21 12:14:54.609 [rank:7] [eval], epoch: 21/50, iter: 125/125, loss: 0.00000, top1: 0.61376, throughput: 569.67 | 2022-05-21 12:14:54.616 [rank:2] [eval], epoch: 21/50, iter: 125/125, loss: 0.00000, top1: 0.60960, throughput: 569.55 | 2022-05-21 12:14:54.617 [rank:3] [eval], epoch: 21/50, iter: 125/125, loss: 0.00000, top1: 0.60896, throughput: 568.45 | 2022-05-21 12:14:54.636 [rank:4] [eval], epoch: 21/50, iter: 125/125, loss: 0.00000, top1: 0.61696, throughput: 567.88 | 2022-05-21 12:14:54.649 [rank:6] [eval], epoch: 21/50, iter: 125/125, loss: 0.00000, top1: 0.61616, throughput: 566.73 | 2022-05-21 12:14:54.669 [rank:1] [eval], epoch: 21/50, iter: 125/125, loss: 0.00000, top1: 0.61936, throughput: 560.45 | 2022-05-21 12:14:54.792 [rank:5] [eval], epoch: 21/50, iter: 125/125, loss: 0.00000, top1: 0.60976, throughput: 555.61 | 2022-05-21 12:14:54.894 [rank:5] [train], epoch: 22/50, iter: 100/834, loss: 0.32362, top1: 0.60578, throughput: 1330.22 | 2022-05-21 12:15:09.328 [rank:4] [train], epoch: 22/50, iter: 100/834, loss: 0.32366, top1: 0.60469, throughput: 1307.93 | 2022-05-21 12:15:09.328 [rank:7] [train], epoch: 22/50, iter: 100/834, loss: 0.32652, top1: 0.59802, throughput: 1305.09 | 2022-05-21 12:15:09.328 [rank:6] [train], epoch: 22/50, iter: 100/834, loss: 0.32485, top1: 0.60234, throughput: 1309.68 | 2022-05-21 12:15:09.330 [rank:1] [train], epoch: 22/50, iter: 100/834, loss: 0.32057, top1: 0.60755, throughput: 1320.71 | 2022-05-21 12:15:09.329 [rank:2] [train], epoch: 22/50, iter: 100/834, loss: 0.32187, top1: 0.61057, throughput: 1305.03 | 2022-05-21 12:15:09.329 [rank:3] [train], epoch: 22/50, iter: 100/834, loss: 0.31905, top1: 0.61557, throughput: 1306.63 | 2022-05-21 12:15:09.330 [rank:0] [train], epoch: 22/50, iter: 100/834, loss: 0.32357, top1: 0.60818, throughput: 1304.19 | 2022-05-21 12:15:09.331 [rank:7] [train], epoch: 22/50, iter: 200/834, loss: 0.32552, top1: 0.60333, throughput: 1327.64 | 2022-05-21 12:15:23.790 [rank:0] [train], epoch: 22/50, iter: 200/834, loss: 0.32313, top1: 0.60469, throughput: 1327.99 | 2022-05-21 12:15:23.789 [rank:4] [train], epoch: 22/50, iter: 200/834, loss: 0.32519, top1: 0.60130, throughput: 1327.64 | 2022-05-21 12:15:23.790 [rank:5] [train], epoch: 22/50, iter: 200/834, loss: 0.32447, top1: 0.60563, throughput: 1327.60 | 2022-05-21 12:15:23.790 [rank:6] [train], epoch: 22/50, iter: 200/834, loss: 0.32788, top1: 0.59964, throughput: 1327.75 | 2022-05-21 12:15:23.790 [rank:1] [train], epoch: 22/50, iter: 200/834, loss: 0.32426, top1: 0.60490, throughput: 1327.83 | 2022-05-21 12:15:23.789 [rank:3] [train], epoch: 22/50, iter: 200/834, loss: 0.32557, top1: 0.60005, throughput: 1327.77 | 2022-05-21 12:15:23.791 [rank:2] [train], epoch: 22/50, iter: 200/834, loss: 0.32249, top1: 0.60740, throughput: 1327.68 | 2022-05-21 12:15:23.791 [rank:5] [train], epoch: 22/50, iter: 300/834, loss: 0.32217, top1: 0.60427, throughput: 1330.64 | 2022-05-21 12:15:38.219 [rank:7] [train], epoch: 22/50, iter: 300/834, loss: 0.32413, top1: 0.60625, throughput: 1330.58 | 2022-05-21 12:15:38.220 [rank:3] [train], epoch: 22/50, iter: 300/834, loss: 0.32597, top1: 0.59911, throughput: 1330.62 | 2022-05-21 12:15:38.220 [rank:2] [train], epoch: 22/50, iter: 300/834, loss: 0.32585, top1: 0.60646, throughput: 1330.65 | 2022-05-21 12:15:38.220 [rank:6] [train], epoch: 22/50, iter: 300/834, loss: 0.32561, top1: 0.60370, throughput: 1330.48 | 2022-05-21 12:15:38.221 [rank:4] [train], epoch: 22/50, iter: 300/834, loss: 0.32491, top1: 0.60583, throughput: 1330.44 | 2022-05-21 12:15:38.221 [rank:1] [train], epoch: 22/50, iter: 300/834, loss: 0.32419, top1: 0.60833, throughput: 1330.31 | 2022-05-21 12:15:38.222 [rank:0] [train], epoch: 22/50, iter: 300/834, loss: 0.32541, top1: 0.60276, throughput: 1330.33 | 2022-05-21 12:15:38.222 [rank:3] [train], epoch: 22/50, iter: 400/834, loss: 0.32554, top1: 0.59979, throughput: 1320.96 | 2022-05-21 12:15:52.755 [rank:1] [train], epoch: 22/50, iter: 400/834, loss: 0.32469, top1: 0.60224, throughput: 1321.14 | 2022-05-21 12:15:52.755 [rank:7] [train], epoch: 22/50, iter: 400/834, loss: 0.32524, top1: 0.59948, throughput: 1320.92 | 2022-05-21 12:15:52.755 [rank:2] [train], epoch: 22/50, iter: 400/834, loss: 0.32621, top1: 0.60146, throughput: 1320.80 | 2022-05-21 12:15:52.756 [rank:5] [train], epoch: 22/50, iter: 400/834, loss: 0.32343, top1: 0.60198, throughput: 1320.73 | 2022-05-21 12:15:52.757 [rank:4] [train], epoch: 22/50, iter: 400/834, loss: 0.32746, top1: 0.59719, throughput: 1320.86 | 2022-05-21 12:15:52.757 [rank:6] [train], epoch: 22/50, iter: 400/834, loss: 0.32292, top1: 0.60375, throughput: 1320.78 | 2022-05-21 12:15:52.758 [rank:0] [train], epoch: 22/50, iter: 400/834, loss: 0.32441, top1: 0.60375, throughput: 1321.04 | 2022-05-21 12:15:52.756 [rank:0] [train], epoch: 22/50, iter: 500/834, loss: 0.32587, top1: 0.60167, throughput: 1331.24 | 2022-05-21 12:16:07.178 [rank:4] [train], epoch: 22/50, iter: 500/834, loss: 0.32611, top1: 0.60021, throughput: 1331.33 | 2022-05-21 12:16:07.179 [rank:2] [train], epoch: 22/50, iter: 500/834, loss: 0.32621, top1: 0.60073, throughput: 1331.30[rank:7] [train], epoch: 22/50, iter: 500/834, loss: 0.32674, top1: 0.60443, throughput: 1331.05 | 2022-05-21 12:16:07.178 | 2022-05-21 12:16:07.180 [rank:5] [train], epoch: 22/50, iter: 500/834, loss: 0.32479, top1: 0.60646, throughput: 1331.30 | 2022-05-21 12:16:07.179 [rank:6] [train], epoch: 22/50, iter: 500/834, loss: 0.32463, top1: 0.60479, throughput: 1331.21 | 2022-05-21 12:16:07.181 [rank:1] [train], epoch: 22/50, iter: 500/834, loss: 0.32721, top1: 0.60120, throughput: 1331.00 | 2022-05-21 12:16:07.180 [rank:3] [train], epoch: 22/50, iter: 500/834, loss: 0.32665, top1: 0.59802, throughput: 1330.96 | 2022-05-21 12:16:07.181 [rank:5] [train], epoch: 22/50, iter: 600/834, loss: 0.32229, top1: 0.60818, throughput: 1321.79 | 2022-05-21 12:16:21.705 [rank:6] [train], epoch: 22/50, iter: 600/834, loss: 0.32814, top1: 0.60318, throughput: 1322.02 | 2022-05-21 12:16:21.704 [rank:4] [train], epoch: 22/50, iter: 600/834, loss: 0.32527, top1: 0.60245, throughput: 1321.84 | 2022-05-21 12:16:21.704 [rank:7] [train], epoch: 22/50, iter: 600/834, loss: 0.32477, top1: 0.60245, throughput: 1321.96 | 2022-05-21 12:16:21.704 [rank:2] [train], epoch: 22/50, iter: 600/834, loss: 0.32488, top1: 0.60297, throughput: 1321.78 | 2022-05-21 12:16:21.704 [rank:3] [train], epoch: 22/50, iter: 600/834, loss: 0.32660, top1: 0.59818, throughput: 1321.90 | 2022-05-21 12:16:21.705 [rank:0] [train], epoch: 22/50, iter: 600/834, loss: 0.32781, top1: 0.60068, throughput: 1321.78 | 2022-05-21 12:16:21.704 [rank:1] [train], epoch: 22/50, iter: 600/834, loss: 0.32488, top1: 0.60401, throughput: 1321.81 | 2022-05-21 12:16:21.705 [rank:6] [train], epoch: 22/50, iter: 700/834, loss: 0.32604, top1: 0.59781, throughput: 1329.52 | 2022-05-21 12:16:36.145 [rank:3] [train], epoch: 22/50, iter: 700/834, loss: 0.32839, top1: 0.59479, throughput: 1329.57 | 2022-05-21 12:16:36.146 [rank:5] [train], epoch: 22/50, iter: 700/834, loss: 0.32568, top1: 0.60318, throughput: 1329.53 | 2022-05-21 12:16:36.146 [rank:4] [train], epoch: 22/50, iter: 700/834, loss: 0.32439, top1: 0.60130, throughput: 1329.47 | 2022-05-21 12:16:36.146 [rank:0] [train], epoch: 22/50, iter: 700/834, loss: 0.32503, top1: 0.60198, throughput: 1329.51 | 2022-05-21 12:16:36.146 [rank:1] [train], epoch: 22/50, iter: 700/834, loss: 0.32347, top1: 0.60760, throughput: 1329.21 | 2022-05-21 12:16:36.150 [rank:7] [train], epoch: 22/50, iter: 700/834, loss: 0.32813, top1: 0.59521, throughput: 1329.27 | 2022-05-21 12:16:36.148 [rank:2] [train], epoch: 22/50, iter: 700/834, loss: 0.32453, top1: 0.60531, throughput: 1329.27 | 2022-05-21 12:16:36.148 [rank:6] [train], epoch: 22/50, iter: 800/834, loss: 0.32531, top1: 0.60125, throughput: 1328.79[rank:7] [train], epoch: 22/50, iter: 800/834, loss: 0.32733, top1: 0.59938, throughput: 1329.01 | 2022-05-21 12:16:50.595 | 2022-05-21 12:16:50.595 [rank:3] [train], epoch: 22/50, iter: 800/834, loss: 0.32668, top1: 0.60198, throughput: 1328.81 | 2022-05-21 12:16:50.595 [rank:1] [train], epoch: 22/50, iter: 800/834, loss: 0.32452, top1: 0.60865, throughput: 1329.26 | 2022-05-21 12:16:50.594 [rank:5] [train], epoch: 22/50, iter: 800/834, loss: 0.32855, top1: 0.59385, throughput: 1328.65 | 2022-05-21 12:16:50.597 [rank:4] [train], epoch: 22/50, iter: 800/834, loss: 0.32661, top1: 0.59859, throughput: 1328.68 | 2022-05-21 12:16:50.596 [rank:0] [train], epoch: 22/50, iter: 800/834, loss: 0.32254, top1: 0.61255, throughput: 1328.31 | 2022-05-21 12:16:50.600 [rank:2] [train], epoch: 22/50, iter: 800/834, loss: 0.32434, top1: 0.60583, throughput: 1328.57 | 2022-05-21 12:16:50.600 [rank:6] [train], epoch: 22/50, iter: 834/834, loss: 0.32602, top1: 0.60340, throughput: 1325.40 | 2022-05-21 12:16:55.520 [rank:1] [train], epoch: 22/50, iter: 834/834, loss: 0.32791, top1: 0.59421, throughput: 1325.34 | 2022-05-21 12:16:55.520 [rank:4] [train], epoch: 22/50, iter: 834/834, loss: 0.32489, top1: 0.60080, throughput: 1325.74 | 2022-05-21 12:16:55.520 [rank:5] [train], epoch: 22/50, iter: 834/834, loss: 0.32759, top1: 0.59743, throughput: 1325.95 | 2022-05-21 12:16:55.520 [rank:2] [train], epoch: 22/50, iter: 834/834, loss: 0.32610, top1: 0.59850, throughput: 1326.68 | 2022-05-21 12:16:55.520 [rank:7] [train], epoch: 22/50, iter: 834/834, loss: 0.32602, top1: 0.60432, throughput: 1325.02 | 2022-05-21 12:16:55.521 [rank:3] [train], epoch: 22/50, iter: 834/834, loss: 0.32895, top1: 0.60447, throughput: 1324.55 | 2022-05-21 12:16:55.523 [rank:0] [train], epoch: 22/50, iter: 834/834, loss: 0.32590, top1: 0.60034, throughput: 1325.82 | 2022-05-21 12:16:55.524 [rank:0] [eval], epoch: 22/50, iter: 125/125, loss: 0.00000, top1: 0.61024, throughput: 564.36 | 2022-05-21 12:17:06.598 [rank:7] [eval], epoch: 22/50, iter: 125/125, loss: 0.00000, top1: 0.60240, throughput: 563.81 | 2022-05-21 12:17:06.607 [rank:1] [eval], epoch: 22/50, iter: 125/125, loss: 0.00000, top1: 0.60512, throughput: 561.77 | 2022-05-21 12:17:06.645 [rank:6] [eval], epoch: 22/50, iter: 125/125, loss: 0.00000, top1: 0.59904, throughput: 561.37 | 2022-05-21 12:17:06.653 [rank:3] [eval], epoch: 22/50, iter: 125/125, loss: 0.00000, top1: 0.59424, throughput: 560.52 | 2022-05-21 12:17:06.674 [rank:2] [eval], epoch: 22/50, iter: 125/125, loss: 0.00000, top1: 0.58912, throughput: 558.52 | 2022-05-21 12:17:06.711 [rank:4] [eval], epoch: 22/50, iter: 125/125, loss: 0.00000, top1: 0.60832, throughput: 557.12 | 2022-05-21 12:17:06.739 [rank:5] [eval], epoch: 22/50, iter: 125/125, loss: 0.00000, top1: 0.59712, throughput: 549.79 | 2022-05-21 12:17:06.888 [rank:4] [train], epoch: 23/50, iter: 100/834, loss: 0.32185, top1: 0.60990, throughput: 1320.62 | 2022-05-21 12:17:21.277 [rank:7] [train], epoch: 23/50, iter: 100/834, loss: 0.32079, top1: 0.61026, throughput: 1308.70 | 2022-05-21 12:17:21.278 [rank:6] [train], epoch: 23/50, iter: 100/834, loss: 0.32330, top1: 0.60411, throughput: 1312.71 | 2022-05-21 12:17:21.280 [rank:5] [train], epoch: 23/50, iter: 100/834, loss: 0.32108, top1: 0.60974, throughput: 1334.22 | 2022-05-21 12:17:21.278 [rank:2] [train], epoch: 23/50, iter: 100/834, loss: 0.32064, top1: 0.60937, throughput: 1318.03 | 2022-05-21 12:17:21.278 [rank:3] [train], epoch: 23/50, iter: 100/834, loss: 0.32038, top1: 0.61307, throughput: 1314.57 | 2022-05-21 12:17:21.279 [rank:1] [train], epoch: 23/50, iter: 100/834, loss: 0.31537, top1: 0.62104, throughput: 1312.02 | 2022-05-21 12:17:21.279 [rank:0] [train], epoch: 23/50, iter: 100/834, loss: 0.31905, top1: 0.61146, throughput: 1307.72 | 2022-05-21 12:17:21.280 [rank:5] [train], epoch: 23/50, iter: 200/834, loss: 0.32255, top1: 0.60917, throughput: 1331.53 | 2022-05-21 12:17:35.698 [rank:3] [train], epoch: 23/50, iter: 200/834, loss: 0.31682, top1: 0.61901, throughput: 1331.58 | 2022-05-21 12:17:35.698 [rank:4] [train], epoch: 23/50, iter: 200/834, loss: 0.32172, top1: 0.61214, throughput: 1331.46 | 2022-05-21 12:17:35.698 [rank:6] [train], epoch: 23/50, iter: 200/834, loss: 0.32182, top1: 0.61083, throughput: 1331.64 | 2022-05-21 12:17:35.698 [rank:7] [train], epoch: 23/50, iter: 200/834, loss: 0.32144, top1: 0.60974, throughput: 1331.37 | 2022-05-21 12:17:35.699 [rank:1] [train], epoch: 23/50, iter: 200/834, loss: 0.32404, top1: 0.60891, throughput: 1331.46 | 2022-05-21 12:17:35.699 [rank:0] [train], epoch: 23/50, iter: 200/834, loss: 0.31982, top1: 0.61833, throughput: 1331.40 | 2022-05-21 12:17:35.701 [rank:2] [train], epoch: 23/50, iter: 200/834, loss: 0.31953, top1: 0.61375, throughput: 1331.14 | 2022-05-21 12:17:35.701 [rank:2] [train], epoch: 23/50, iter: 300/834, loss: 0.32267, top1: 0.60516, throughput: 1320.84 | 2022-05-21 12:17:50.238 [rank:4] [train], epoch: 23/50, iter: 300/834, loss: 0.32072, top1: 0.60922, throughput: 1320.48 | 2022-05-21 12:17:50.238 [rank:0] [train], epoch: 23/50, iter: 300/834, loss: 0.32001, top1: 0.61026, throughput: 1320.82 | 2022-05-21 12:17:50.238 [rank:1] [train], epoch: 23/50, iter: 300/834, loss: 0.32123, top1: 0.60979, throughput: 1320.66 | 2022-05-21 12:17:50.238 [rank:3] [train], epoch: 23/50, iter: 300/834, loss: 0.32225, top1: 0.60896, throughput: 1320.57 | 2022-05-21 12:17:50.238 [rank:7] [train], epoch: 23/50, iter: 300/834, loss: 0.32420, top1: 0.60547, throughput: 1320.44 | 2022-05-21 12:17:50.239 [rank:5] [train], epoch: 23/50, iter: 300/834, loss: 0.32152, top1: 0.61068, throughput: 1320.31 | 2022-05-21 12:17:50.240 [rank:6] [train], epoch: 23/50, iter: 300/834, loss: 0.31992, top1: 0.61161, throughput: 1320.33 | 2022-05-21 12:17:50.240 [rank:7] [train], epoch: 23/50, iter: 400/834, loss: 0.32400, top1: 0.60703, throughput: 1327.46 | 2022-05-21 12:18:04.703 [rank:3] [train], epoch: 23/50, iter: 400/834, loss: 0.32119, top1: 0.61245, throughput: 1327.30 | 2022-05-21 12:18:04.703 [rank:4] [train], epoch: 23/50, iter: 400/834, loss: 0.32243, top1: 0.61120, throughput: 1327.30 | 2022-05-21 12:18:04.703 [rank:2] [train], epoch: 23/50, iter: 400/834, loss: 0.32262, top1: 0.60630, throughput: 1327.30 | 2022-05-21 12:18:04.703 [rank:6] [train], epoch: 23/50, iter: 400/834, loss: 0.32106, top1: 0.60646, throughput: 1327.49 | 2022-05-21 12:18:04.703 [rank:5] [train], epoch: 23/50, iter: 400/834, loss: 0.31914, top1: 0.61661, throughput: 1327.49 | 2022-05-21 12:18:04.703 [rank:0] [train], epoch: 23/50, iter: 400/834, loss: 0.32012, top1: 0.61312, throughput: 1327.08 | 2022-05-21 12:18:04.705 [rank:1] [train], epoch: 23/50, iter: 400/834, loss: 0.32399, top1: 0.60776, throughput: 1327.06 | 2022-05-21 12:18:04.706 [rank:4] [train], epoch: 23/50, iter: 500/834, loss: 0.32340, top1: 0.60302, throughput: 1328.92 | 2022-05-21 12:18:19.151 [rank:1] [train], epoch: 23/50, iter: 500/834, loss: 0.32609, top1: 0.59870, throughput: 1329.17 | 2022-05-21 12:18:19.151 [rank:5] [train], epoch: 23/50, iter: 500/834, loss: 0.32264, top1: 0.60771, throughput: 1328.86 | 2022-05-21 12:18:19.152 [rank:3] [train], epoch: 23/50, iter: 500/834, loss: 0.31935, top1: 0.61573, throughput: 1328.85 | 2022-05-21 12:18:19.152 [rank:6] [train], epoch: 23/50, iter: 500/834, loss: 0.32370, top1: 0.60682, throughput: 1328.81 | 2022-05-21 12:18:19.152 [rank:2] [train], epoch: 23/50, iter: 500/834, loss: 0.32504, top1: 0.60385, throughput: 1328.82 | 2022-05-21 12:18:19.152 [rank:7] [train], epoch: 23/50, iter: 500/834, loss: 0.32348, top1: 0.61146, throughput: 1328.74 | 2022-05-21 12:18:19.153 [rank:0] [train], epoch: 23/50, iter: 500/834, loss: 0.32247, top1: 0.60406, throughput: 1328.81 | 2022-05-21 12:18:19.154 [rank:6] [train], epoch: 23/50, iter: 600/834, loss: 0.32224, top1: 0.60526, throughput: 1328.68 | 2022-05-21 12:18:33.603 [rank:5] [train], epoch: 23/50, iter: 600/834, loss: 0.32133, top1: 0.60812, throughput: 1328.56 | 2022-05-21 12:18:33.603 [rank:3] [train], epoch: 23/50, iter: 600/834, loss: 0.32448, top1: 0.60344, throughput: 1328.58 | 2022-05-21 12:18:33.603 [rank:7] [train], epoch: 23/50, iter: 600/834, loss: 0.32107, top1: 0.60958, throughput: 1328.71 | 2022-05-21 12:18:33.603 [rank:1] [train], epoch: 23/50, iter: 600/834, loss: 0.32196, top1: 0.60760, throughput: 1328.50 | 2022-05-21 12:18:33.603 [rank:4] [train], epoch: 23/50, iter: 600/834, loss: 0.32328, top1: 0.60370, throughput: 1328.42 | 2022-05-21 12:18:33.604 [rank:2] [train], epoch: 23/50, iter: 600/834, loss: 0.32240, top1: 0.60693, throughput: 1328.48 | 2022-05-21 12:18:33.605 [rank:0] [train], epoch: 23/50, iter: 600/834, loss: 0.32228, top1: 0.60865, throughput: 1328.59 | 2022-05-21 12:18:33.606 [rank:7] [train], epoch: 23/50, iter: 700/834, loss: 0.32320, top1: 0.60849, throughput: 1326.46 | 2022-05-21 12:18:48.078 [rank:6] [train], epoch: 23/50, iter: 700/834, loss: 0.32253, top1: 0.60833, throughput: 1326.17 | 2022-05-21 12:18:48.080 [rank:2] [train], epoch: 23/50, iter: 700/834, loss: 0.32255, top1: 0.60943, throughput: 1326.64 | 2022-05-21 12:18:48.077 [rank:3] [train], epoch: 23/50, iter: 700/834, loss: 0.32438, top1: 0.59880, throughput: 1326.50 | 2022-05-21 12:18:48.077 [rank:4] [train], epoch: 23/50, iter: 700/834, loss: 0.31917, top1: 0.61120, throughput: 1326.50 | 2022-05-21 12:18:48.079 [rank:5] [train], epoch: 23/50, iter: 700/834, loss: 0.32383, top1: 0.60365, throughput: 1326.42 | 2022-05-21 12:18:48.078 [rank:1] [train], epoch: 23/50, iter: 700/834, loss: 0.32115, top1: 0.60698, throughput: 1326.43 | 2022-05-21 12:18:48.078 [rank:0] [train], epoch: 23/50, iter: 700/834, loss: 0.32023, top1: 0.61146, throughput: 1326.46 | 2022-05-21 12:18:48.081 [rank:0] [train], epoch: 23/50, iter: 800/834, loss: 0.32442, top1: 0.60432, throughput: 1328.53 | 2022-05-21 12:19:02.533 [rank:2] [train], epoch: 23/50, iter: 800/834, loss: 0.32367, top1: 0.60609, throughput: 1328.25 | 2022-05-21 12:19:02.532 [rank:3] [train], epoch: 23/50, iter: 800/834, loss: 0.32427, top1: 0.60625, throughput: 1328.27 | 2022-05-21 12:19:02.532 [rank:4] [train], epoch: 23/50, iter: 800/834, loss: 0.32325, top1: 0.60563, throughput: 1328.33 | 2022-05-21 12:19:02.533 [rank:5] [train], epoch: 23/50, iter: 800/834, loss: 0.32207, top1: 0.60870, throughput: 1328.33 | 2022-05-21 12:19:02.533 [rank:7] [train], epoch: 23/50, iter: 800/834, loss: 0.32470, top1: 0.60380, throughput: 1328.26 | 2022-05-21 12:19:02.533 [rank:6] [train], epoch: 23/50, iter: 800/834, loss: 0.32502, top1: 0.60063, throughput: 1328.40 | 2022-05-21 12:19:02.534 [rank:1] [train], epoch: 23/50, iter: 800/834, loss: 0.32416, top1: 0.60432, throughput: 1328.04 | 2022-05-21 12:19:02.536 [rank:2] [train], epoch: 23/50, iter: 834/834, loss: 0.32191, top1: 0.60432, throughput: 1319.78 | 2022-05-21 12:19:07.479 [rank:5] [train], epoch: 23/50, iter: 834/834, loss: 0.31941, top1: 0.61213, throughput: 1319.74 | 2022-05-21 12:19:07.479 [rank:7] [train], epoch: 23/50, iter: 834/834, loss: 0.32606, top1: 0.60325, throughput: 1319.56 | 2022-05-21 12:19:07.480 [rank:6] [train], epoch: 23/50, iter: 834/834, loss: 0.32101, top1: 0.61412, throughput: 1319.95 | 2022-05-21 12:19:07.480 [rank:0] [train], epoch: 23/50, iter: 834/834, loss: 0.31981, top1: 0.61520, throughput: 1319.57 | 2022-05-21 12:19:07.480 [rank:4] [train], epoch: 23/50, iter: 834/834, loss: 0.31818, top1: 0.61566, throughput: 1319.34 | 2022-05-21 12:19:07.481 [rank:3] [train], epoch: 23/50, iter: 834/834, loss: 0.32973, top1: 0.60034, throughput: 1319.13 | 2022-05-21 12:19:07.481 [rank:1] [train], epoch: 23/50, iter: 834/834, loss: 0.32052, top1: 0.60723, throughput: 1320.18 | 2022-05-21 12:19:07.480 [rank:7] [eval], epoch: 23/50, iter: 125/125, loss: 0.00000, top1: 0.61472, throughput: 568.04 | 2022-05-21 12:19:18.483 [rank:0] [eval], epoch: 23/50, iter: 125/125, loss: 0.00000, top1: 0.61696, throughput: 567.66 | 2022-05-21 12:19:18.490 [rank:4] [eval], epoch: 23/50, iter: 125/125, loss: 0.00000, top1: 0.61200, throughput: 566.90 | 2022-05-21 12:19:18.506 [rank:2] [eval], epoch: 23/50, iter: 125/125, loss: 0.00000, top1: 0.60752, throughput: 560.71 | 2022-05-21 12:19:18.625 [rank:3] [eval], epoch: 23/50, iter: 125/125, loss: 0.00000, top1: 0.61024, throughput: 558.13 | 2022-05-21 12:19:18.679 [rank:6] [eval], epoch: 23/50, iter: 125/125, loss: 0.00000, top1: 0.61568, throughput: 555.94 | 2022-05-21 12:19:18.722 [rank:5] [eval], epoch: 23/50, iter: 125/125, loss: 0.00000, top1: 0.59936, throughput: 554.46 | 2022-05-21 12:19:18.751 [rank:1] [eval], epoch: 23/50, iter: 125/125, loss: 0.00000, top1: 0.61680, throughput: 550.34 | 2022-05-21 12:19:18.837 [rank:7] [train], epoch: 24/50, iter: 100/834, loss: 0.31876, top1: 0.61828, throughput: 1302.50 | 2022-05-21 12:19:33.223 [rank:3] [train], epoch: 24/50, iter: 100/834, loss: 0.31875, top1: 0.61432, throughput: 1319.95 | 2022-05-21 12:19:33.225 [rank:6] [train], epoch: 24/50, iter: 100/834, loss: 0.31577, top1: 0.62063, throughput: 1323.85 | 2022-05-21 12:19:33.225 [rank:2] [train], epoch: 24/50, iter: 100/834, loss: 0.31854, top1: 0.61510, throughput: 1315.07 | 2022-05-21 12:19:33.225 [rank:1] [train], epoch: 24/50, iter: 100/834, loss: 0.31649, top1: 0.61818, throughput: 1334.56 | 2022-05-21 12:19:33.224 [rank:4] [train], epoch: 24/50, iter: 100/834, loss: 0.31680, top1: 0.61937, throughput: 1304.15 | 2022-05-21 12:19:33.228 [rank:5] [train], epoch: 24/50, iter: 100/834, loss: 0.31980, top1: 0.61094, throughput: 1326.25 | 2022-05-21 12:19:33.228 [rank:0] [train], epoch: 24/50, iter: 100/834, loss: 0.31723, top1: 0.61906, throughput: 1302.97 | 2022-05-21 12:19:33.225 [rank:6] [train], epoch: 24/50, iter: 200/834, loss: 0.31869, top1: 0.61318, throughput: 1330.43 | 2022-05-21 12:19:47.656 [rank:4] [train], epoch: 24/50, iter: 200/834, loss: 0.31770, top1: 0.61516, throughput: 1330.64 | 2022-05-21 12:19:47.657 [rank:5] [train], epoch: 24/50, iter: 200/834, loss: 0.32251, top1: 0.60776, throughput: 1330.52 | 2022-05-21 12:19:47.658 [rank:7] [train], epoch: 24/50, iter: 200/834, loss: 0.31900, top1: 0.61568, throughput: 1330.09 | 2022-05-21 12:19:47.659 [rank:2] [train], epoch: 24/50, iter: 200/834, loss: 0.31798, top1: 0.61469, throughput: 1330.39 | 2022-05-21 12:19:47.657 [rank:1] [train], epoch: 24/50, iter: 200/834, loss: 0.31561, top1: 0.62229, throughput: 1330.20 | 2022-05-21 12:19:47.658 [rank:0] [train], epoch: 24/50, iter: 200/834, loss: 0.32021, top1: 0.61375, throughput: 1330.19 | 2022-05-21 12:19:47.659 [rank:3] [train], epoch: 24/50, iter: 200/834, loss: 0.31653, top1: 0.61547, throughput: 1330.19 | 2022-05-21 12:19:47.659 [rank:7] [train], epoch: 24/50, iter: 300/834, loss: 0.32089, top1: 0.61354, throughput: 1321.47 | 2022-05-21 12:20:02.188 [rank:3] [train], epoch: 24/50, iter: 300/834, loss: 0.31648, top1: 0.61536, throughput: 1321.52 | 2022-05-21 12:20:02.188 [rank:5] [train], epoch: 24/50, iter: 300/834, loss: 0.31959, top1: 0.61437, throughput: 1321.45 | 2022-05-21 12:20:02.188 [rank:1] [train], epoch: 24/50, iter: 300/834, loss: 0.31842, top1: 0.61349, throughput: 1321.43 | 2022-05-21 12:20:02.188 [rank:6] [train], epoch: 24/50, iter: 300/834, loss: 0.31933, top1: 0.61719, throughput: 1321.18 | 2022-05-21 12:20:02.189 [rank:0] [train], epoch: 24/50, iter: 300/834, loss: 0.31654, top1: 0.62099, throughput: 1321.50 | 2022-05-21 12:20:02.188 [rank:4] [train], epoch: 24/50, iter: 300/834, loss: 0.31912, top1: 0.61406, throughput: 1321.13 | 2022-05-21 12:20:02.190 [rank:2] [train], epoch: 24/50, iter: 300/834, loss: 0.32245, top1: 0.61026, throughput: 1321.33 | 2022-05-21 12:20:02.188 [rank:2] [train], epoch: 24/50, iter: 400/834, loss: 0.31550, top1: 0.61583, throughput: 1328.01 | 2022-05-21 12:20:16.646 [rank:6] [train], epoch: 24/50, iter: 400/834, loss: 0.31961, top1: 0.60995, throughput: 1328.08 | 2022-05-21 12:20:16.646 [rank:7] [train], epoch: 24/50, iter: 400/834, loss: 0.32028, top1: 0.61505, throughput: 1327.88 | 2022-05-21 12:20:16.647 [rank:0] [train], epoch: 24/50, iter: 400/834, loss: 0.31661, top1: 0.61948, throughput: 1328.05 | 2022-05-21 12:20:16.646 [rank:4] [train], epoch: 24/50, iter: 400/834, loss: 0.31970, top1: 0.61250, throughput: 1328.02 | 2022-05-21 12:20:16.648 [rank:3] [train], epoch: 24/50, iter: 400/834, loss: 0.31799, top1: 0.61068, throughput: 1327.88 | 2022-05-21 12:20:16.647 [rank:5] [train], epoch: 24/50, iter: 400/834, loss: 0.31867, top1: 0.61219, throughput: 1327.83 | 2022-05-21 12:20:16.648 [rank:1] [train], epoch: 24/50, iter: 400/834, loss: 0.32119, top1: 0.60943, throughput: 1327.65 | 2022-05-21 12:20:16.649 [rank:5] [train], epoch: 24/50, iter: 500/834, loss: 0.32055, top1: 0.61005, throughput: 1327.22 | 2022-05-21 12:20:31.114 [rank:4] [train], epoch: 24/50, iter: 500/834, loss: 0.32060, top1: 0.61141, throughput: 1327.20 | 2022-05-21 12:20:31.114 [rank:6] [train], epoch: 24/50, iter: 500/834, loss: 0.31977, top1: 0.61359, throughput: 1326.98 | 2022-05-21 12:20:31.115 [rank:7] [train], epoch: 24/50, iter: 500/834, loss: 0.31990, top1: 0.61104, throughput: 1327.12 | 2022-05-21 12:20:31.114 [rank:1] [train], epoch: 24/50, iter: 500/834, loss: 0.31701, top1: 0.61698, throughput: 1327.32 | 2022-05-21 12:20:31.114 [rank:2] [train], epoch: 24/50, iter: 500/834, loss: 0.32052, top1: 0.61521, throughput: 1326.88[rank:3] [train], epoch: 24/50, iter: 500/834, loss: 0.32069, top1: 0.61016, throughput: 1326.92 | 2022-05-21 12:20:31.116 | 2022-05-21 12:20:31.117 [rank:0] [train], epoch: 24/50, iter: 500/834, loss: 0.32078, top1: 0.61328, throughput: 1326.82 | 2022-05-21 12:20:31.116 [rank:7] [train], epoch: 24/50, iter: 600/834, loss: 0.32067, top1: 0.60958, throughput: 1329.04 | 2022-05-21 12:20:45.561 [rank:5] [train], epoch: 24/50, iter: 600/834, loss: 0.32200, top1: 0.60411, throughput: 1328.98 | 2022-05-21 12:20:45.561 [rank:0] [train], epoch: 24/50, iter: 600/834, loss: 0.31857, top1: 0.61531, throughput: 1329.19 | 2022-05-21 12:20:45.561 [rank:3] [train], epoch: 24/50, iter: 600/834, loss: 0.31930, top1: 0.61260, throughput: 1329.15 | 2022-05-21 12:20:45.562 [rank:6] [train], epoch: 24/50, iter: 600/834, loss: 0.31769, top1: 0.61703, throughput: 1328.81 | 2022-05-21 12:20:45.564 [rank:1] [train], epoch: 24/50, iter: 600/834, loss: 0.32022, top1: 0.61005, throughput: 1328.79 | 2022-05-21 12:20:45.564 [rank:4] [train], epoch: 24/50, iter: 600/834, loss: 0.32043, top1: 0.61031, throughput: 1328.78 | 2022-05-21 12:20:45.564 [rank:2] [train], epoch: 24/50, iter: 600/834, loss: 0.32369, top1: 0.60536, throughput: 1328.91 | 2022-05-21 12:20:45.564 [rank:4] [train], epoch: 24/50, iter: 700/834, loss: 0.31968, top1: 0.61391, throughput: 1330.07 | 2022-05-21 12:20:59.999 [rank:5] [train], epoch: 24/50, iter: 700/834, loss: 0.32043, top1: 0.61375, throughput: 1329.69 | 2022-05-21 12:21:00.001 [rank:0] [train], epoch: 24/50, iter: 700/834, loss: 0.31878, top1: 0.61292, throughput: 1329.83 | 2022-05-21 12:20:59.999 [rank:1] [train], epoch: 24/50, iter: 700/834, loss: 0.32007, top1: 0.61229, throughput: 1330.11 | 2022-05-21 12:20:59.999 [rank:6] [train], epoch: 24/50, iter: 700/834, loss: 0.32017, top1: 0.61349, throughput: 1329.91 | 2022-05-21 12:21:00.001 [rank:7] [train], epoch: 24/50, iter: 700/834, loss: 0.31590, top1: 0.62109, throughput: 1329.53 | 2022-05-21 12:21:00.002 [rank:2] [train], epoch: 24/50, iter: 700/834, loss: 0.31939, top1: 0.61703, throughput: 1329.90 | 2022-05-21 12:21:00.001 [rank:3] [train], epoch: 24/50, iter: 700/834, loss: 0.31966, top1: 0.61266, throughput: 1329.71 | 2022-05-21 12:21:00.001 [rank:7] [train], epoch: 24/50, iter: 800/834, loss: 0.32077, top1: 0.61151, throughput: 1327.88 | 2022-05-21 12:21:14.461 [rank:3] [train], epoch: 24/50, iter: 800/834, loss: 0.32040, top1: 0.61484, throughput: 1327.78 | 2022-05-21 12:21:14.461 [rank:5] [train], epoch: 24/50, iter: 800/834, loss: 0.31814, top1: 0.61891, throughput: 1327.71 | 2022-05-21 12:21:14.462 [rank:2] [train], epoch: 24/50, iter: 800/834, loss: 0.32044, top1: 0.61161, throughput: 1327.74 | 2022-05-21 12:21:14.461 [rank:6] [train], epoch: 24/50, iter: 800/834, loss: 0.32493, top1: 0.60245, throughput: 1327.67 | 2022-05-21 12:21:14.462 [rank:0] [train], epoch: 24/50, iter: 800/834, loss: 0.32091, top1: 0.61354, throughput: 1327.51 | 2022-05-21 12:21:14.462 [rank:1] [train], epoch: 24/50, iter: 800/834, loss: 0.32104, top1: 0.61099, throughput: 1327.46 | 2022-05-21 12:21:14.462 [rank:4] [train], epoch: 24/50, iter: 800/834, loss: 0.32189, top1: 0.60724, throughput: 1327.30 | 2022-05-21 12:21:14.464 [rank:4] [train], epoch: 24/50, iter: 834/834, loss: 0.32461, top1: 0.60478, throughput: 1324.11 | 2022-05-21 12:21:19.394 [rank:5] [train], epoch: 24/50, iter: 834/834, loss: 0.31827, top1: 0.61994, throughput: 1323.46 | 2022-05-21 12:21:19.394 [rank:3] [train], epoch: 24/50, iter: 834/834, loss: 0.31538, top1: 0.62714, throughput: 1323.23 | 2022-05-21 12:21:19.395 [rank:2] [train], epoch: 24/50, iter: 834/834, loss: 0.32657, top1: 0.59850, throughput: 1323.01 | 2022-05-21 12:21:19.396 [rank:6] [train], epoch: 24/50, iter: 834/834, loss: 0.31848, top1: 0.61137, throughput: 1322.97 | 2022-05-21 12:21:19.397 [rank:7] [train], epoch: 24/50, iter: 834/834, loss: 0.31724, top1: 0.61443, throughput: 1322.77 | 2022-05-21 12:21:19.396 [rank:1] [train], epoch: 24/50, iter: 834/834, loss: 0.32239, top1: 0.60401, throughput: 1322.90 | 2022-05-21 12:21:19.397 [rank:0] [train], epoch: 24/50, iter: 834/834, loss: 0.32580, top1: 0.60202, throughput: 1322.64 | 2022-05-21 12:21:19.398 [rank:0] [eval], epoch: 24/50, iter: 125/125, loss: 0.00000, top1: 0.62336, throughput: 572.09 | 2022-05-21 12:21:30.323 [rank:7] [eval], epoch: 24/50, iter: 125/125, loss: 0.00000, top1: 0.61952, throughput: 571.90 | 2022-05-21 12:21:30.325 [rank:4] [eval], epoch: 24/50, iter: 125/125, loss: 0.00000, top1: 0.62112, throughput: 570.06 | 2022-05-21 12:21:30.358 [rank:2] [eval], epoch: 24/50, iter: 125/125, loss: 0.00000, top1: 0.61488, throughput: 568.57 | 2022-05-21 12:21:30.388 [rank:6] [eval], epoch: 24/50, iter: 125/125, loss: 0.00000, top1: 0.61712, throughput: 565.51 | 2022-05-21 12:21:30.448 [rank:3] [eval], epoch: 24/50, iter: 125/125, loss: 0.00000, top1: 0.61632, throughput: 564.08 | 2022-05-21 12:21:30.475 [rank:5] [eval], epoch: 24/50, iter: 125/125, loss: 0.00000, top1: 0.60432, throughput: 560.41 | 2022-05-21 12:21:30.547 [rank:1] [eval], epoch: 24/50, iter: 125/125, loss: 0.00000, top1: 0.62624, throughput: 553.67 | 2022-05-21 12:21:30.685 [rank:5] [train], epoch: 25/50, iter: 100/834, loss: 0.31170, top1: 0.62526, throughput: 1319.53 | 2022-05-21 12:21:45.097 [rank:6] [train], epoch: 25/50, iter: 100/834, loss: 0.31292, top1: 0.62510, throughput: 1310.66 | 2022-05-21 12:21:45.098 [rank:7] [train], epoch: 25/50, iter: 100/834, loss: 0.31259, top1: 0.62625, throughput: 1299.74 | 2022-05-21 12:21:45.097 [rank:1] [train], epoch: 25/50, iter: 100/834, loss: 0.31053, top1: 0.63010, throughput: 1332.25 | 2022-05-21 12:21:45.097 [rank:4] [train], epoch: 25/50, iter: 100/834, loss: 0.31442, top1: 0.62349, throughput: 1302.52 | 2022-05-21 12:21:45.099 [rank:3] [train], epoch: 25/50, iter: 100/834, loss: 0.31430, top1: 0.62359, throughput: 1312.86 | 2022-05-21 12:21:45.099 [rank:2] [train], epoch: 25/50, iter: 100/834, loss: 0.31714, top1: 0.61630, throughput: 1305.13 | 2022-05-21 12:21:45.099 [rank:0] [train], epoch: 25/50, iter: 100/834, loss: 0.31488, top1: 0.62125, throughput: 1299.25 | 2022-05-21 12:21:45.101 [rank:3] [train], epoch: 25/50, iter: 200/834, loss: 0.31361, top1: 0.62698, throughput: 1330.80 | 2022-05-21 12:21:59.527 [rank:7] [train], epoch: 25/50, iter: 200/834, loss: 0.31368, top1: 0.62359, throughput: 1330.67 | 2022-05-21 12:21:59.526 [rank:6] [train], epoch: 25/50, iter: 200/834, loss: 0.31360, top1: 0.62500, throughput: 1330.56 | 2022-05-21 12:21:59.528 [rank:0] [train], epoch: 25/50, iter: 200/834, loss: 0.31260, top1: 0.62724, throughput: 1330.88 | 2022-05-21 12:21:59.527 [rank:5] [train], epoch: 25/50, iter: 200/834, loss: 0.31495, top1: 0.62573, throughput: 1330.43 | 2022-05-21 12:21:59.529 [rank:4] [train], epoch: 25/50, iter: 200/834, loss: 0.31181, top1: 0.62891, throughput: 1330.57 | 2022-05-21 12:21:59.529 [rank:2] [train], epoch: 25/50, iter: 200/834, loss: 0.31338, top1: 0.62620, throughput: 1330.57 | 2022-05-21 12:21:59.529 [rank:1] [train], epoch: 25/50, iter: 200/834, loss: 0.31585, top1: 0.62078, throughput: 1330.41 | 2022-05-21 12:21:59.529 [rank:7] [train], epoch: 25/50, iter: 300/834, loss: 0.31771, top1: 0.62063, throughput: 1328.11 | 2022-05-21 12:22:13.982 [rank:2] [train], epoch: 25/50, iter: 300/834, loss: 0.31774, top1: 0.61323, throughput: 1328.38 | 2022-05-21 12:22:13.983 [rank:5] [train], epoch: 25/50, iter: 300/834, loss: 0.31707, top1: 0.61604, throughput: 1328.33 | 2022-05-21 12:22:13.983 [rank:1] [train], epoch: 25/50, iter: 300/834, loss: 0.31468, top1: 0.61870, throughput: 1328.25 | 2022-05-21 12:22:13.984 [rank:4] [train], epoch: 25/50, iter: 300/834, loss: 0.31758, top1: 0.62083, throughput: 1328.21 | 2022-05-21 12:22:13.984 [rank:6] [train], epoch: 25/50, iter: 300/834, loss: 0.31478, top1: 0.62203, throughput: 1327.97 | 2022-05-21 12:22:13.986 [rank:0] [train], epoch: 25/50, iter: 300/834, loss: 0.31561, top1: 0.61854, throughput: 1327.96 | 2022-05-21 12:22:13.985 [rank:3] [train], epoch: 25/50, iter: 300/834, loss: 0.31286, top1: 0.62297, throughput: 1327.81 | 2022-05-21 12:22:13.987 [rank:6] [train], epoch: 25/50, iter: 400/834, loss: 0.31595, top1: 0.61750, throughput: 1328.55 | 2022-05-21 12:22:28.438 [rank:5] [train], epoch: 25/50, iter: 400/834, loss: 0.31890, top1: 0.61146, throughput: 1328.24 | 2022-05-21 12:22:28.438 [rank:4] [train], epoch: 25/50, iter: 400/834, loss: 0.31774, top1: 0.61578, throughput: 1328.33 | 2022-05-21 12:22:28.439 [rank:1] [train], epoch: 25/50, iter: 400/834, loss: 0.31762, top1: 0.61469, throughput: 1328.27 | 2022-05-21 12:22:28.439 [rank:3] [train], epoch: 25/50, iter: 400/834, loss: 0.31696, top1: 0.61937, throughput: 1328.38 | 2022-05-21 12:22:28.440 [rank:0] [train], epoch: 25/50, iter: 400/834, loss: 0.31814, top1: 0.61266, throughput: 1328.39 | 2022-05-21 12:22:28.439 [rank:2] [train], epoch: 25/50, iter: 400/834, loss: 0.31396, top1: 0.61932, throughput: 1328.15 | 2022-05-21 12:22:28.439 [rank:7] [train], epoch: 25/50, iter: 400/834, loss: 0.31382, top1: 0.62281, throughput: 1327.91 | 2022-05-21 12:22:28.441 [rank:4] [train], epoch: 25/50, iter: 500/834, loss: 0.31967, top1: 0.61714, throughput: 1331.09 | 2022-05-21 12:22:42.863 [rank:7] [train], epoch: 25/50, iter: 500/834, loss: 0.31520, top1: 0.62182, throughput: 1331.30 | 2022-05-21 12:22:42.863 [rank:6] [train], epoch: 25/50, iter: 500/834, loss: 0.31443, top1: 0.62276, throughput: 1330.93 | 2022-05-21 12:22:42.864 [rank:1] [train], epoch: 25/50, iter: 500/834, loss: 0.31510, top1: 0.62089, throughput: 1331.04 | 2022-05-21 12:22:42.864 [rank:0] [train], epoch: 25/50, iter: 500/834, loss: 0.31656, top1: 0.61964, throughput: 1331.02 | 2022-05-21 12:22:42.864 [rank:5] [train], epoch: 25/50, iter: 500/834, loss: 0.31834, top1: 0.61974, throughput: 1330.82 | 2022-05-21 12:22:42.865 [rank:2] [train], epoch: 25/50, iter: 500/834, loss: 0.31723, top1: 0.61802, throughput: 1331.01 | 2022-05-21 12:22:42.864 [rank:3] [train], epoch: 25/50, iter: 500/834, loss: 0.31546, top1: 0.61818, throughput: 1331.02 | 2022-05-21 12:22:42.865 [rank:5] [train], epoch: 25/50, iter: 600/834, loss: 0.31668, top1: 0.61599, throughput: 1332.53 | 2022-05-21 12:22:57.274 [rank:6] [train], epoch: 25/50, iter: 600/834, loss: 0.31636, top1: 0.61693, throughput: 1332.11 | 2022-05-21 12:22:57.277 [rank:3] [train], epoch: 25/50, iter: 600/834, loss: 0.31546, top1: 0.62078, throughput: 1332.41 | 2022-05-21 12:22:57.275 [rank:2] [train], epoch: 25/50, iter: 600/834, loss: 0.31846, top1: 0.61557, throughput: 1332.36 | 2022-05-21 12:22:57.275 [rank:4] [train], epoch: 25/50, iter: 600/834, loss: 0.31916, top1: 0.61557, throughput: 1332.24 | 2022-05-21 12:22:57.275 [rank:7] [train], epoch: 25/50, iter: 600/834, loss: 0.31961, top1: 0.61052, throughput: 1332.22 | 2022-05-21 12:22:57.275 [rank:1] [train], epoch: 25/50, iter: 600/834, loss: 0.31519, top1: 0.62276, throughput: 1332.10 | 2022-05-21 12:22:57.277 [rank:0] [train], epoch: 25/50, iter: 600/834, loss: 0.31716, top1: 0.62021, throughput: 1332.18 | 2022-05-21 12:22:57.276 [rank:7] [train], epoch: 25/50, iter: 700/834, loss: 0.31798, top1: 0.61786, throughput: 1328.30 | 2022-05-21 12:23:11.730 [rank:6] [train], epoch: 25/50, iter: 700/834, loss: 0.31731, top1: 0.61865, throughput: 1328.29 | 2022-05-21 12:23:11.731 [rank:2] [train], epoch: 25/50, iter: 700/834, loss: 0.31594, top1: 0.62063, throughput: 1328.22 | 2022-05-21 12:23:11.730 [rank:3] [train], epoch: 25/50, iter: 700/834, loss: 0.31794, top1: 0.61870, throughput: 1328.17 | 2022-05-21 12:23:11.731 [rank:0] [train], epoch: 25/50, iter: 700/834, loss: 0.31587, top1: 0.62193, throughput: 1328.35 | 2022-05-21 12:23:11.730 [rank:1] [train], epoch: 25/50, iter: 700/834, loss: 0.31676, top1: 0.61589, throughput: 1328.26 | 2022-05-21 12:23:11.732 [rank:4] [train], epoch: 25/50, iter: 700/834, loss: 0.31790, top1: 0.61734, throughput: 1327.95 | 2022-05-21 12:23:11.733 [rank:5] [train], epoch: 25/50, iter: 700/834, loss: 0.31986, top1: 0.61271, throughput: 1327.90 | 2022-05-21 12:23:11.733 [rank:7] [train], epoch: 25/50, iter: 800/834, loss: 0.31668, top1: 0.61906, throughput: 1327.76 | 2022-05-21 12:23:26.190 [rank:3] [train], epoch: 25/50, iter: 800/834, loss: 0.31542, top1: 0.61750, throughput: 1327.83 | 2022-05-21 12:23:26.191[rank:5] [train], epoch: 25/50, iter: 800/834, loss: 0.31819, top1: 0.61359, throughput: 1327.96 | 2022-05-21 12:23:26.191 [rank:6] [train], epoch: 25/50, iter: 800/834, loss: 0.31656, top1: 0.61927, throughput: 1327.60 | 2022-05-21 12:23:26.194 [rank:0] [train], epoch: 25/50, iter: 800/834, loss: 0.31752, top1: 0.61849, throughput: 1327.55 | 2022-05-21 12:23:26.193 [rank:2] [train], epoch: 25/50, iter: 800/834, loss: 0.31689, top1: 0.61703, throughput: 1327.51 | 2022-05-21 12:23:26.194 [rank:4] [train], epoch: 25/50, iter: 800/834, loss: 0.31820, top1: 0.61589, throughput: 1327.67 | 2022-05-21 12:23:26.194 [rank:1] [train], epoch: 25/50, iter: 800/834, loss: 0.31711, top1: 0.61495, throughput: 1327.63 | 2022-05-21 12:23:26.194 [rank:1] [train], epoch: 25/50, iter: 834/834, loss: 0.31470, top1: 0.61841, throughput: 1331.97 | 2022-05-21 12:23:31.095 [rank:4] [train], epoch: 25/50, iter: 834/834, loss: 0.31997, top1: 0.61458, throughput: 1331.99 | 2022-05-21 12:23:31.095 [rank:2] [train], epoch: 25/50, iter: 834/834, loss: 0.31689, top1: 0.61841, throughput: 1331.87 | 2022-05-21 12:23:31.095 [rank:3] [train], epoch: 25/50, iter: 834/834, loss: 0.31735, top1: 0.61765, throughput: 1330.73 | 2022-05-21 12:23:31.097 [rank:0] [train], epoch: 25/50, iter: 834/834, loss: 0.31499, top1: 0.62531, throughput: 1331.23[rank:6] [train], epoch: 25/50, iter: 834/834, loss: 0.32057, top1: 0.60187, throughput: 1331.31 | 2022-05-21 12:23:31.097 | 2022-05-21 12:23:31.097 [rank:5] [train], epoch: 25/50, iter: 834/834, loss: 0.31890, top1: 0.62010, throughput: 1330.61 | 2022-05-21 12:23:31.097 [rank:7] [train], epoch: 25/50, iter: 834/834, loss: 0.31951, top1: 0.60830, throughput: 1330.39 | 2022-05-21 12:23:31.097 [rank:0] [eval], epoch: 25/50, iter: 125/125, loss: 0.00000, top1: 0.62608, throughput: 569.05 | 2022-05-21 12:23:42.080 [rank:4] [eval], epoch: 25/50, iter: 125/125, loss: 0.00000, top1: 0.62464, throughput: 568.94 | 2022-05-21 12:23:42.081 [rank:7] [eval], epoch: 25/50, iter: 125/125, loss: 0.00000, top1: 0.62944, throughput: 568.67 | 2022-05-21 12:23:42.088 [rank:2] [eval], epoch: 25/50, iter: 125/125, loss: 0.00000, top1: 0.62256, throughput: 567.95 | 2022-05-21 12:23:42.099 [rank:6] [eval], epoch: 25/50, iter: 125/125, loss: 0.00000, top1: 0.62992, throughput: 563.87 | 2022-05-21 12:23:42.181 [rank:3] [eval], epoch: 25/50, iter: 125/125, loss: 0.00000, top1: 0.62384, throughput: 562.97 | 2022-05-21 12:23:42.198 [rank:1] [eval], epoch: 25/50, iter: 125/125, loss: 0.00000, top1: 0.63312, throughput: 555.28 | 2022-05-21 12:23:42.350 [rank:5] [eval], epoch: 25/50, iter: 125/125, loss: 0.00000, top1: 0.61440, throughput: 549.93 | 2022-05-21 12:23:42.462 [rank:5] [train], epoch: 26/50, iter: 100/834, loss: 0.30953, top1: 0.63365, throughput: 1331.26 | 2022-05-21 12:23:56.885 [rank:0] [train], epoch: 26/50, iter: 100/834, loss: 0.31036, top1: 0.63177, throughput: 1296.89 | 2022-05-21 12:23:56.885 [rank:7] [train], epoch: 26/50, iter: 100/834, loss: 0.30965, top1: 0.63286, throughput: 1297.55 | 2022-05-21 12:23:56.885 [rank:6] [train], epoch: 26/50, iter: 100/834, loss: 0.31269, top1: 0.62969, throughput: 1305.75 | 2022-05-21 12:23:56.885 [rank:2] [train], epoch: 26/50, iter: 100/834, loss: 0.31205, top1: 0.63042, throughput: 1298.60 | 2022-05-21 12:23:56.884 [rank:4] [train], epoch: 26/50, iter: 100/834, loss: 0.30919, top1: 0.62849, throughput: 1296.85 | 2022-05-21 12:23:56.886 [rank:3] [train], epoch: 26/50, iter: 100/834, loss: 0.31326, top1: 0.63104, throughput: 1307.23 | 2022-05-21 12:23:56.886 [rank:1] [train], epoch: 26/50, iter: 100/834, loss: 0.31345, top1: 0.62531, throughput: 1320.97 | 2022-05-21 12:23:56.885 [rank:5] [train], epoch: 26/50, iter: 200/834, loss: 0.31074, top1: 0.62781, throughput: 1328.99[rank:4] [train], epoch: 26/50, iter: 200/834, loss: 0.31358, top1: 0.62349, throughput: 1329.06 | 2022-05-21 12:24:11.332| 2022-05-21 12:24:11.332 [rank:6] [train], epoch: 26/50, iter: 200/834, loss: 0.31318, top1: 0.62339, throughput: 1329.00 | 2022-05-21 12:24:11.332 [rank:2] [train], epoch: 26/50, iter: 200/834, loss: 0.31180, top1: 0.62781, throughput: 1329.03 | 2022-05-21 12:24:11.331 [rank:0] [train], epoch: 26/50, iter: 200/834, loss: 0.31165, top1: 0.62974, throughput: 1328.96 | 2022-05-21 12:24:11.332 [rank:3] [train], epoch: 26/50, iter: 200/834, loss: 0.30899, top1: 0.63250, throughput: 1328.96 | 2022-05-21 12:24:11.333 [rank:7] [train], epoch: 26/50, iter: 200/834, loss: 0.31113, top1: 0.63047, throughput: 1328.82 | 2022-05-21 12:24:11.334 [rank:1] [train], epoch: 26/50, iter: 200/834, loss: 0.31376, top1: 0.62401, throughput: 1328.81 | 2022-05-21 12:24:11.334 [rank:7] [train], epoch: 26/50, iter: 300/834, loss: 0.31521, top1: 0.62146, throughput: 1332.26 | 2022-05-21 12:24:25.745 [rank:4] [train], epoch: 26/50, iter: 300/834, loss: 0.31256, top1: 0.62776, throughput: 1332.03 | 2022-05-21 12:24:25.746 [rank:2] [train], epoch: 26/50, iter: 300/834, loss: 0.31410, top1: 0.62260, throughput: 1332.00 | 2022-05-21 12:24:25.745 [rank:1] [train], epoch: 26/50, iter: 300/834, loss: 0.31305, top1: 0.62771, throughput: 1332.28 | 2022-05-21 12:24:25.746 [rank:3] [train], epoch: 26/50, iter: 300/834, loss: 0.31276, top1: 0.62578, throughput: 1332.00 | 2022-05-21 12:24:25.748 [rank:6] [train], epoch: 26/50, iter: 300/834, loss: 0.31235, top1: 0.62589, throughput: 1331.96 | 2022-05-21 12:24:25.747 [rank:5] [train], epoch: 26/50, iter: 300/834, loss: 0.31118, top1: 0.63115, throughput: 1331.87 | 2022-05-21 12:24:25.748 [rank:0] [train], epoch: 26/50, iter: 300/834, loss: 0.31241, top1: 0.62943, throughput: 1331.87 | 2022-05-21 12:24:25.748 [rank:3] [train], epoch: 26/50, iter: 400/834, loss: 0.31592, top1: 0.61354, throughput: 1330.63 | 2022-05-21 12:24:40.177 [rank:5] [train], epoch: 26/50, iter: 400/834, loss: 0.31438, top1: 0.62193, throughput: 1330.61 | 2022-05-21 12:24:40.177 [rank:7] [train], epoch: 26/50, iter: 400/834, loss: 0.31414, top1: 0.62167, throughput: 1330.41 | 2022-05-21 12:24:40.177 [rank:1] [train], epoch: 26/50, iter: 400/834, loss: 0.31197, top1: 0.62719, throughput: 1330.35[rank:6] [train], epoch: 26/50, iter: 400/834, loss: 0.31221, top1: 0.62474, throughput: 1330.56 | 2022-05-21 12:24:40.178 | 2022-05-21 12:24:40.177 [rank:4] [train], epoch: 26/50, iter: 400/834, loss: 0.31238, top1: 0.62219, throughput: 1330.35 | 2022-05-21 12:24:40.178 [rank:0] [train], epoch: 26/50, iter: 400/834, loss: 0.31314, top1: 0.62557, throughput: 1330.61 | 2022-05-21 12:24:40.177 [rank:2] [train], epoch: 26/50, iter: 400/834, loss: 0.31174, top1: 0.62969, throughput: 1330.31 | 2022-05-21 12:24:40.178 [rank:3] [train], epoch: 26/50, iter: 500/834, loss: 0.31265, top1: 0.62573, throughput: 1328.86 | 2022-05-21 12:24:54.625 [rank:4] [train], epoch: 26/50, iter: 500/834, loss: 0.31075, top1: 0.62714, throughput: 1328.97 | 2022-05-21 12:24:54.626[rank:0] [train], epoch: 26/50, iter: 500/834, loss: 0.30916, top1: 0.63313, throughput: 1328.87 | 2022-05-21 12:24:54.626 [rank:6] [train], epoch: 26/50, iter: 500/834, loss: 0.31373, top1: 0.62312, throughput: 1328.82 | 2022-05-21 12:24:54.626 [rank:7] [train], epoch: 26/50, iter: 500/834, loss: 0.31511, top1: 0.61745, throughput: 1328.67 | 2022-05-21 12:24:54.628 [rank:5] [train], epoch: 26/50, iter: 500/834, loss: 0.31397, top1: 0.62208, throughput: 1328.72 | 2022-05-21 12:24:54.627 [rank:1] [train], epoch: 26/50, iter: 500/834, loss: 0.31784, top1: 0.61693, throughput: 1328.71 | 2022-05-21 12:24:54.628 [rank:2] [train], epoch: 26/50, iter: 500/834, loss: 0.31359, top1: 0.62276, throughput: 1328.88 | 2022-05-21 12:24:54.626 [rank:5] [train], epoch: 26/50, iter: 600/834, loss: 0.31368, top1: 0.62245, throughput: 1329.78 | 2022-05-21 12:25:09.065 [rank:6] [train], epoch: 26/50, iter: 600/834, loss: 0.31740, top1: 0.61833, throughput: 1329.73 | 2022-05-21 12:25:09.065 [rank:1] [train], epoch: 26/50, iter: 600/834, loss: 0.31366, top1: 0.62677, throughput: 1329.98 | 2022-05-21 12:25:09.064 [rank:4] [train], epoch: 26/50, iter: 600/834, loss: 0.31274, top1: 0.62630, throughput: 1329.50 | 2022-05-21 12:25:09.067 [rank:3] [train], epoch: 26/50, iter: 600/834, loss: 0.31295, top1: 0.62297, throughput: 1329.47 | 2022-05-21 12:25:09.067 [rank:7] [train], epoch: 26/50, iter: 600/834, loss: 0.31280, top1: 0.62865, throughput: 1329.71 | 2022-05-21 12:25:09.067 [rank:2] [train], epoch: 26/50, iter: 600/834, loss: 0.31191, top1: 0.62563, throughput: 1329.52 | 2022-05-21 12:25:09.068 [rank:0] [train], epoch: 26/50, iter: 600/834, loss: 0.31614, top1: 0.62594, throughput: 1329.46 | 2022-05-21 12:25:09.068 [rank:4] [train], epoch: 26/50, iter: 700/834, loss: 0.31426, top1: 0.62313, throughput: 1326.74 | 2022-05-21 12:25:23.539 [rank:0] [train], epoch: 26/50, iter: 700/834, loss: 0.31429, top1: 0.62156, throughput: 1326.68 | 2022-05-21 12:25:23.540 [rank:7] [train], epoch: 26/50, iter: 700/834, loss: 0.31454, top1: 0.62245, throughput: 1326.63 | 2022-05-21 12:25:23.540 [rank:5] [train], epoch: 26/50, iter: 700/834, loss: 0.31691, top1: 0.62146, throughput: 1326.40 | 2022-05-21 12:25:23.541 [rank:6] [train], epoch: 26/50, iter: 700/834, loss: 0.31295, top1: 0.62318, throughput: 1326.30 | 2022-05-21 12:25:23.541 [rank:3] [train], epoch: 26/50, iter: 700/834, loss: 0.31125, top1: 0.62193, throughput: 1326.58 | 2022-05-21 12:25:23.541 [rank:1] [train], epoch: 26/50, iter: 700/834, loss: 0.31479, top1: 0.61781, throughput: 1326.32 | 2022-05-21 12:25:23.540 [rank:2] [train], epoch: 26/50, iter: 700/834, loss: 0.31556, top1: 0.61865, throughput: 1326.73 | 2022-05-21 12:25:23.539 [rank:5] [train], epoch: 26/50, iter: 800/834, loss: 0.31252, top1: 0.62130, throughput: 1325.49 | 2022-05-21 12:25:38.026 [rank:7] [train], epoch: 26/50, iter: 800/834, loss: 0.31754, top1: 0.61604, throughput: 1325.44 | 2022-05-21 12:25:38.025 [rank:4] [train], epoch: 26/50, iter: 800/834, loss: 0.31491, top1: 0.62245, throughput: 1325.33 | 2022-05-21 12:25:38.026 [rank:0] [train], epoch: 26/50, iter: 800/834, loss: 0.31510, top1: 0.61833, throughput: 1325.32 | 2022-05-21 12:25:38.027 [rank:1] [train], epoch: 26/50, iter: 800/834, loss: 0.31762, top1: 0.61594, throughput: 1325.31 | 2022-05-21 12:25:38.028 [rank:2] [train], epoch: 26/50, iter: 800/834, loss: 0.31390, top1: 0.62344, throughput: 1325.24 | 2022-05-21 12:25:38.027 [rank:3] [train], epoch: 26/50, iter: 800/834, loss: 0.31309, top1: 0.62349, throughput: 1325.36 | 2022-05-21 12:25:38.027 [rank:6] [train], epoch: 26/50, iter: 800/834, loss: 0.31452, top1: 0.62271, throughput: 1325.11 | 2022-05-21 12:25:38.031 [rank:5] [train], epoch: 26/50, iter: 834/834, loss: 0.31694, top1: 0.61749, throughput: 1329.81 | 2022-05-21 12:25:42.935 [rank:4] [train], epoch: 26/50, iter: 834/834, loss: 0.31701, top1: 0.61949, throughput: 1329.71 | 2022-05-21 12:25:42.935 [rank:3] [train], epoch: 26/50, iter: 834/834, loss: 0.31183, top1: 0.61841, throughput: 1330.11 | 2022-05-21 12:25:42.935 [rank:1] [train], epoch: 26/50, iter: 834/834, loss: 0.30934, top1: 0.62914, throughput: 1330.14 | 2022-05-21 12:25:42.935 [rank:0] [train], epoch: 26/50, iter: 834/834, loss: 0.31378, top1: 0.62255, throughput: 1329.94 | 2022-05-21 12:25:42.935 [rank:6] [train], epoch: 26/50, iter: 834/834, loss: 0.31405, top1: 0.62822, throughput: 1330.81 | 2022-05-21 12:25:42.936 [rank:2] [train], epoch: 26/50, iter: 834/834, loss: 0.31101, top1: 0.63802, throughput: 1330.17 | 2022-05-21 12:25:42.935 [rank:7] [train], epoch: 26/50, iter: 834/834, loss: 0.31027, top1: 0.63388, throughput: 1329.21 | 2022-05-21 12:25:42.937 [rank:0] [eval], epoch: 26/50, iter: 125/125, loss: 0.00000, top1: 0.63936, throughput: 576.42 | 2022-05-21 12:25:53.778 [rank:7] [eval], epoch: 26/50, iter: 125/125, loss: 0.00000, top1: 0.63616, throughput: 573.81 | 2022-05-21 12:25:53.829 [rank:3] [eval], epoch: 26/50, iter: 125/125, loss: 0.00000, top1: 0.63408, throughput: 567.82 | 2022-05-21 12:25:53.942 [rank:6] [eval], epoch: 26/50, iter: 125/125, loss: 0.00000, top1: 0.64608, throughput: 567.73 | 2022-05-21 12:25:53.945 [rank:2] [eval], epoch: 26/50, iter: 125/125, loss: 0.00000, top1: 0.63360, throughput: 567.05 | 2022-05-21 12:25:53.957 [rank:1] [eval], epoch: 26/50, iter: 125/125, loss: 0.00000, top1: 0.64464, throughput: 565.29 | 2022-05-21 12:25:53.992 [rank:5] [eval], epoch: 26/50, iter: 125/125, loss: 0.00000, top1: 0.63184, throughput: 558.63 | 2022-05-21 12:25:54.123 [rank:4] [eval], epoch: 26/50, iter: 125/125, loss: 0.00000, top1: 0.63840, throughput: 558.05 | 2022-05-21 12:25:54.135 [rank:3] [train], epoch: 27/50, iter: 100/834, loss: 0.30624, top1: 0.64255, throughput: 1314.82 | 2022-05-21 12:26:08.545 [rank:6] [train], epoch: 27/50, iter: 100/834, loss: 0.30827, top1: 0.63391, throughput: 1315.02 | 2022-05-21 12:26:08.545 [rank:7] [train], epoch: 27/50, iter: 100/834, loss: 0.30807, top1: 0.63490, throughput: 1304.64 | 2022-05-21 12:26:08.545 [rank:5] [train], epoch: 27/50, iter: 100/834, loss: 0.30828, top1: 0.63255, throughput: 1331.18 | 2022-05-21 12:26:08.546 [rank:0] [train], epoch: 27/50, iter: 100/834, loss: 0.31218, top1: 0.62708, throughput: 1300.12 | 2022-05-21 12:26:08.546 [rank:4] [train], epoch: 27/50, iter: 100/834, loss: 0.30752, top1: 0.63734, throughput: 1332.22 | 2022-05-21 12:26:08.547 [rank:1] [train], epoch: 27/50, iter: 100/834, loss: 0.31003, top1: 0.63115, throughput: 1319.12 | 2022-05-21 12:26:08.547 [rank:2] [train], epoch: 27/50, iter: 100/834, loss: 0.30705, top1: 0.63714, throughput: 1315.97 | 2022-05-21 12:26:08.547 [rank:6] [train], epoch: 27/50, iter: 200/834, loss: 0.30965, top1: 0.63286, throughput: 1326.30 | 2022-05-21 12:26:23.022 [rank:4] [train], epoch: 27/50, iter: 200/834, loss: 0.30897, top1: 0.63260, throughput: 1326.44[rank:7] [train], epoch: 27/50, iter: 200/834, loss: 0.30992, top1: 0.63349, throughput: 1326.38 | 2022-05-21 12:26:23.022 | 2022-05-21 12:26:23.021 [rank:0] [train], epoch: 27/50, iter: 200/834, loss: 0.30772, top1: 0.63432, throughput: 1326.32 | 2022-05-21 12:26:23.022 [rank:5] [train], epoch: 27/50, iter: 200/834, loss: 0.31067, top1: 0.63484, throughput: 1326.25 | 2022-05-21 12:26:23.023 [rank:3] [train], epoch: 27/50, iter: 200/834, loss: 0.31028, top1: 0.62974, throughput: 1326.04 | 2022-05-21 12:26:23.024 [rank:2] [train], epoch: 27/50, iter: 200/834, loss: 0.31180, top1: 0.62755, throughput: 1326.37 | 2022-05-21 12:26:23.023 [rank:1] [train], epoch: 27/50, iter: 200/834, loss: 0.30740, top1: 0.63682, throughput: 1326.19 | 2022-05-21 12:26:23.024 [rank:4] [train], epoch: 27/50, iter: 300/834, loss: 0.30800, top1: 0.63917, throughput: 1330.35 | 2022-05-21 12:26:37.454 [rank:3] [train], epoch: 27/50, iter: 300/834, loss: 0.30894, top1: 0.63229, throughput: 1330.54 | 2022-05-21 12:26:37.454 [rank:7] [train], epoch: 27/50, iter: 300/834, loss: 0.30536, top1: 0.64224, throughput: 1330.21 | 2022-05-21 12:26:37.455 [rank:5] [train], epoch: 27/50, iter: 300/834, loss: 0.31009, top1: 0.62828, throughput: 1330.42 | 2022-05-21 12:26:37.455 [rank:6] [train], epoch: 27/50, iter: 300/834, loss: 0.30836, top1: 0.63172, throughput: 1330.25 | 2022-05-21 12:26:37.455 [rank:2] [train], epoch: 27/50, iter: 300/834, loss: 0.30760, top1: 0.63594, throughput: 1330.28 | 2022-05-21 12:26:37.456 [rank:1] [train], epoch: 27/50, iter: 300/834, loss: 0.30974, top1: 0.62854, throughput: 1330.38 | 2022-05-21 12:26:37.456 [rank:0] [train], epoch: 27/50, iter: 300/834, loss: 0.30978, top1: 0.63177, throughput: 1330.15 | 2022-05-21 12:26:37.457 [rank:5] [train], epoch: 27/50, iter: 400/834, loss: 0.30776, top1: 0.63547, throughput: 1329.04 | 2022-05-21 12:26:51.901 [rank:4] [train], epoch: 27/50, iter: 400/834, loss: 0.31030, top1: 0.62792, throughput: 1328.91 | 2022-05-21 12:26:51.902 [rank:0] [train], epoch: 27/50, iter: 400/834, loss: 0.30997, top1: 0.62839, throughput: 1329.21 | 2022-05-21 12:26:51.901 [rank:6] [train], epoch: 27/50, iter: 400/834, loss: 0.30821, top1: 0.63698, throughput: 1329.06 | 2022-05-21 12:26:51.901 [rank:3] [train], epoch: 27/50, iter: 400/834, loss: 0.31214, top1: 0.62786, throughput: 1328.78 | 2022-05-21 12:26:51.904 [rank:7] [train], epoch: 27/50, iter: 400/834, loss: 0.30800, top1: 0.62891, throughput: 1329.02 | 2022-05-21 12:26:51.901 [rank:2] [train], epoch: 27/50, iter: 400/834, loss: 0.30945, top1: 0.63391, throughput: 1329.10 | 2022-05-21 12:26:51.901 [rank:1] [train], epoch: 27/50, iter: 400/834, loss: 0.30979, top1: 0.63401, throughput: 1329.16 | 2022-05-21 12:26:51.902 [rank:7] [train], epoch: 27/50, iter: 500/834, loss: 0.31188, top1: 0.62807, throughput: 1328.31 | 2022-05-21 12:27:06.356 [rank:3] [train], epoch: 27/50, iter: 500/834, loss: 0.31173, top1: 0.62557, throughput: 1328.55 | 2022-05-21 12:27:06.356 [rank:5] [train], epoch: 27/50, iter: 500/834, loss: 0.31124, top1: 0.62875, throughput: 1328.33 | 2022-05-21 12:27:06.355 [rank:1] [train], epoch: 27/50, iter: 500/834, loss: 0.30963, top1: 0.63151, throughput: 1328.39 [rank:2] [train], epoch: 27/50, iter: 500/834, loss: 0.31067, top1: 0.62615, throughput: 1328.37| 2022-05-21 12:27:06.355 | 2022-05-21 12:27:06.355 [rank:6] [train], epoch: 27/50, iter: 500/834, loss: 0.30951, top1: 0.63016, throughput: 1327.91 | 2022-05-21 12:27:06.360 [rank:0] [train], epoch: 27/50, iter: 500/834, loss: 0.31194, top1: 0.62859, throughput: 1328.28 | 2022-05-21 12:27:06.356 [rank:4] [train], epoch: 27/50, iter: 500/834, loss: 0.30978, top1: 0.63099, throughput: 1327.99 | 2022-05-21 12:27:06.360 [rank:0] [train], epoch: 27/50, iter: 600/834, loss: 0.31001, top1: 0.62969, throughput: 1323.81 | 2022-05-21 12:27:20.860 [rank:5] [train], epoch: 27/50, iter: 600/834, loss: 0.31048, top1: 0.62948, throughput: 1323.84 | 2022-05-21 12:27:20.859 [rank:6] [train], epoch: 27/50, iter: 600/834, loss: 0.31037, top1: 0.63057, throughput: 1324.24 | 2022-05-21 12:27:20.859 [rank:3] [train], epoch: 27/50, iter: 600/834, loss: 0.31143, top1: 0.63130, throughput: 1323.81 | 2022-05-21 12:27:20.859 [rank:7] [train], epoch: 27/50, iter: 600/834, loss: 0.31011, top1: 0.62943, throughput: 1323.72 | 2022-05-21 12:27:20.860 [rank:1] [train], epoch: 27/50, iter: 600/834, loss: 0.31315, top1: 0.62297, throughput: 1323.59 | 2022-05-21 12:27:20.861 [rank:2] [train], epoch: 27/50, iter: 600/834, loss: 0.30745, top1: 0.63448, throughput: 1323.71 | 2022-05-21 12:27:20.860 [rank:4] [train], epoch: 27/50, iter: 600/834, loss: 0.31189, top1: 0.62813, throughput: 1323.84 | 2022-05-21 12:27:20.863 [rank:5] [train], epoch: 27/50, iter: 700/834, loss: 0.30821, top1: 0.63724, throughput: 1332.60 | 2022-05-21 12:27:35.266 [rank:4] [train], epoch: 27/50, iter: 700/834, loss: 0.31128, top1: 0.62745, throughput: 1333.02 | 2022-05-21 12:27:35.266 [rank:1] [train], epoch: 27/50, iter: 700/834, loss: 0.31144, top1: 0.62786, throughput: 1332.87 | 2022-05-21 12:27:35.266 [rank:7] [train], epoch: 27/50, iter: 700/834, loss: 0.31023, top1: 0.62599, throughput: 1332.72 | 2022-05-21 12:27:35.267 [rank:6] [train], epoch: 27/50, iter: 700/834, loss: 0.31254, top1: 0.62771, throughput: 1332.52 | 2022-05-21 12:27:35.268 [rank:3] [train], epoch: 27/50, iter: 700/834, loss: 0.30875, top1: 0.63406, throughput: 1332.45 | 2022-05-21 12:27:35.269 [rank:2] [train], epoch: 27/50, iter: 700/834, loss: 0.30940, top1: 0.63047, throughput: 1332.48 | 2022-05-21 12:27:35.269 [rank:0] [train], epoch: 27/50, iter: 700/834, loss: 0.31125, top1: 0.62979, throughput: 1332.48 | 2022-05-21 12:27:35.269 [rank:5] [train], epoch: 27/50, iter: 800/834, loss: 0.31275, top1: 0.62427, throughput: 1330.14 | 2022-05-21 12:27:49.701 [rank:4] [train], epoch: 27/50, iter: 800/834, loss: 0.31052, top1: 0.63229, throughput: 1330.08 | 2022-05-21 12:27:49.702 [rank:7] [train], epoch: 27/50, iter: 800/834, loss: 0.31162, top1: 0.63000, throughput: 1330.04 | 2022-05-21 12:27:49.703 [rank:1] [train], epoch: 27/50, iter: 800/834, loss: 0.30947, top1: 0.63359, throughput: 1330.05 | 2022-05-21 12:27:49.702 [rank:2] [train], epoch: 27/50, iter: 800/834, loss: 0.31056, top1: 0.62688, throughput: 1330.33 | 2022-05-21 12:27:49.702 [rank:0] [train], epoch: 27/50, iter: 800/834, loss: 0.31186, top1: 0.62682, throughput: 1330.31 | 2022-05-21 12:27:49.702 [rank:3] [train], epoch: 27/50, iter: 800/834, loss: 0.31221, top1: 0.62427, throughput: 1330.12 | 2022-05-21 12:27:49.703 [rank:6] [train], epoch: 27/50, iter: 800/834, loss: 0.30970, top1: 0.63359, throughput: 1330.04 | 2022-05-21 12:27:49.704 [rank:7] [train], epoch: 27/50, iter: 834/834, loss: 0.31434, top1: 0.61841, throughput: 1323.88 | 2022-05-21 12:27:54.634 [rank:0] [train], epoch: 27/50, iter: 834/834, loss: 0.30869, top1: 0.63006, throughput: 1323.17 | 2022-05-21 12:27:54.635 [rank:5] [train], epoch: 27/50, iter: 834/834, loss: 0.30780, top1: 0.63006, throughput: 1322.81[rank:3] [train], epoch: 27/50, iter: 834/834, loss: 0.31305, top1: 0.62117, throughput: 1323.56 | 2022-05-21 12:27:54.636 | 2022-05-21 12:27:54.636 [rank:4] [train], epoch: 27/50, iter: 834/834, loss: 0.31093, top1: 0.62454, throughput: 1322.74 | 2022-05-21 12:27:54.637 [rank:6] [train], epoch: 27/50, iter: 834/834, loss: 0.30946, top1: 0.62822, throughput: 1323.20 | 2022-05-21 12:27:54.637 [rank:2] [train], epoch: 27/50, iter: 834/834, loss: 0.30972, top1: 0.63189, throughput: 1322.71 | 2022-05-21 12:27:54.637 [rank:1] [train], epoch: 27/50, iter: 834/834, loss: 0.31248, top1: 0.62684, throughput: 1322.76 | 2022-05-21 12:27:54.637 [rank:7] [eval], epoch: 27/50, iter: 125/125, loss: 0.00000, top1: 0.60688, throughput: 582.58 | 2022-05-21 12:28:05.362 [rank:0] [eval], epoch: 27/50, iter: 125/125, loss: 0.00000, top1: 0.61792, throughput: 582.58 | 2022-05-21 12:28:05.363 [rank:2] [eval], epoch: 27/50, iter: 125/125, loss: 0.00000, top1: 0.60448, throughput: 575.21 | 2022-05-21 12:28:05.502 [rank:6] [eval], epoch: 27/50, iter: 125/125, loss: 0.00000, top1: 0.61360, throughput: 574.64 | 2022-05-21 12:28:05.514 [rank:3] [eval], epoch: 27/50, iter: 125/125, loss: 0.00000, top1: 0.60912, throughput: 572.68 | 2022-05-21 12:28:05.549 [rank:4] [eval], epoch: 27/50, iter: 125/125, loss: 0.00000, top1: 0.60880, throughput: 567.01 | 2022-05-21 12:28:05.660 [rank:5] [eval], epoch: 27/50, iter: 125/125, loss: 0.00000, top1: 0.59408, throughput: 565.05 | 2022-05-21 12:28:05.697 [rank:1] [eval], epoch: 27/50, iter: 125/125, loss: 0.00000, top1: 0.62000, throughput: 564.81 | 2022-05-21 12:28:05.702 [rank:3] [train], epoch: 28/50, iter: 100/834, loss: 0.30359, top1: 0.64182, throughput: 1316.20 | 2022-05-21 12:28:20.137 [rank:7] [train], epoch: 28/50, iter: 100/834, loss: 0.30731, top1: 0.63349, throughput: 1299.49 | 2022-05-21 12:28:20.137 [rank:1] [train], epoch: 28/50, iter: 100/834, loss: 0.30718, top1: 0.63953, throughput: 1330.13 | 2022-05-21 12:28:20.137 [rank:5] [train], epoch: 28/50, iter: 100/834, loss: 0.30289, top1: 0.64365, throughput: 1329.62 | 2022-05-21 12:28:20.137 [rank:2] [train], epoch: 28/50, iter: 100/834, loss: 0.30793, top1: 0.63625, throughput: 1311.94 | 2022-05-21 12:28:20.137 [rank:4] [train], epoch: 28/50, iter: 100/834, loss: 0.30343, top1: 0.64505, throughput: 1325.99 | 2022-05-21 12:28:20.139 [rank:6] [train], epoch: 28/50, iter: 100/834, loss: 0.30335, top1: 0.64354, throughput: 1312.72[rank:0] [train], epoch: 28/50, iter: 100/834, loss: 0.30419, top1: 0.64219, throughput: 1299.41 | 2022-05-21 12:28:20.139 | 2022-05-21 12:28:20.140 [rank:3] [train], epoch: 28/50, iter: 200/834, loss: 0.30719, top1: 0.64177, throughput: 1329.05 | 2022-05-21 12:28:34.583 [rank:5] [train], epoch: 28/50, iter: 200/834, loss: 0.30353, top1: 0.64932, throughput: 1329.17 | 2022-05-21 12:28:34.582 [rank:2] [train], epoch: 28/50, iter: 200/834, loss: 0.30427, top1: 0.63958, throughput: 1329.15 | 2022-05-21 12:28:34.583 [rank:7] [train], epoch: 28/50, iter: 200/834, loss: 0.30619, top1: 0.63589, throughput: 1328.97 | 2022-05-21 12:28:34.584 [rank:1] [train], epoch: 28/50, iter: 200/834, loss: 0.30675, top1: 0.63651, throughput: 1329.11 | 2022-05-21 12:28:34.583 [rank:0] [train], epoch: 28/50, iter: 200/834, loss: 0.30734, top1: 0.63708, throughput: 1329.12 | 2022-05-21 12:28:34.585 [rank:6] [train], epoch: 28/50, iter: 200/834, loss: 0.30855, top1: 0.63031, throughput: 1329.13 | 2022-05-21 12:28:34.585 [rank:4] [train], epoch: 28/50, iter: 200/834, loss: 0.30693, top1: 0.63599, throughput: 1328.87 | 2022-05-21 12:28:34.588 [rank:7] [train], epoch: 28/50, iter: 300/834, loss: 0.30700, top1: 0.63339, throughput: 1319.74 | 2022-05-21 12:28:49.132 [rank:5] [train], epoch: 28/50, iter: 300/834, loss: 0.30380, top1: 0.64302, throughput: 1319.52 | 2022-05-21 12:28:49.133 [rank:4] [train], epoch: 28/50, iter: 300/834, loss: 0.30587, top1: 0.63823, throughput: 1320.02 | 2022-05-21 12:28:49.133 [rank:1] [train], epoch: 28/50, iter: 300/834, loss: 0.30635, top1: 0.63797, throughput: 1319.61 | 2022-05-21 12:28:49.133 [rank:6] [train], epoch: 28/50, iter: 300/834, loss: 0.30559, top1: 0.63776, throughput: 1319.74 | 2022-05-21 12:28:49.133 [rank:0] [train], epoch: 28/50, iter: 300/834, loss: 0.30790, top1: 0.63672, throughput: 1319.72 | 2022-05-21 12:28:49.134 [rank:3] [train], epoch: 28/50, iter: 300/834, loss: 0.30743, top1: 0.63464, throughput: 1319.33 | 2022-05-21 12:28:49.136 [rank:2] [train], epoch: 28/50, iter: 300/834, loss: 0.30671, top1: 0.63547, throughput: 1319.30 | 2022-05-21 12:28:49.136 [rank:3] [train], epoch: 28/50, iter: 400/834, loss: 0.30516, top1: 0.63911, throughput: 1330.25 | 2022-05-21 12:29:03.569 [rank:4] [train], epoch: 28/50, iter: 400/834, loss: 0.30691, top1: 0.63448, throughput: 1329.86 | 2022-05-21 12:29:03.570 [rank:0] [train], epoch: 28/50, iter: 400/834, loss: 0.30831, top1: 0.63781, throughput: 1329.99 | 2022-05-21 12:29:03.570 [rank:7] [train], epoch: 28/50, iter: 400/834, loss: 0.30634, top1: 0.63557, throughput: 1329.81 | 2022-05-21 12:29:03.571 [rank:2] [train], epoch: 28/50, iter: 400/834, loss: 0.30577, top1: 0.63802, throughput: 1330.11 | 2022-05-21 12:29:03.571 [rank:1] [train], epoch: 28/50, iter: 400/834, loss: 0.30603, top1: 0.64016, throughput: 1329.77 | 2022-05-21 12:29:03.571 [rank:5] [train], epoch: 28/50, iter: 400/834, loss: 0.30845, top1: 0.63182, throughput: 1329.57 | 2022-05-21 12:29:03.574 [rank:6] [train], epoch: 28/50, iter: 400/834, loss: 0.30868, top1: 0.63521, throughput: 1329.59 | 2022-05-21 12:29:03.574 [rank:4] [train], epoch: 28/50, iter: 500/834, loss: 0.30696, top1: 0.63672, throughput: 1330.28 | 2022-05-21 12:29:18.004 [rank:6] [train], epoch: 28/50, iter: 500/834, loss: 0.30732, top1: 0.63557, throughput: 1330.52 | 2022-05-21 12:29:18.004 [rank:2] [train], epoch: 28/50, iter: 500/834, loss: 0.30845, top1: 0.63563, throughput: 1330.26 | 2022-05-21 12:29:18.004 [rank:0] [train], epoch: 28/50, iter: 500/834, loss: 0.30747, top1: 0.63943, throughput: 1330.18 | 2022-05-21 12:29:18.004 [rank:7] [train], epoch: 28/50, iter: 500/834, loss: 0.30366, top1: 0.64495, throughput: 1330.22 | 2022-05-21 12:29:18.004 [rank:5] [train], epoch: 28/50, iter: 500/834, loss: 0.30626, top1: 0.63672, throughput: 1330.47 | 2022-05-21 12:29:18.005 [rank:1] [train], epoch: 28/50, iter: 500/834, loss: 0.30896, top1: 0.63625, throughput: 1330.16 | 2022-05-21 12:29:18.006 [rank:3] [train], epoch: 28/50, iter: 500/834, loss: 0.30754, top1: 0.63318, throughput: 1329.94 | 2022-05-21 12:29:18.006 [rank:2] [train], epoch: 28/50, iter: 600/834, loss: 0.30733, top1: 0.63214, throughput: 1328.56 | 2022-05-21 12:29:32.456 [rank:4] [train], epoch: 28/50, iter: 600/834, loss: 0.30594, top1: 0.63641, throughput: 1328.53 | 2022-05-21 12:29:32.456 [rank:3] [train], epoch: 28/50, iter: 600/834, loss: 0.30716, top1: 0.63500, throughput: 1328.68 | 2022-05-21 12:29:32.456 [rank:1] [train], epoch: 28/50, iter: 600/834, loss: 0.30454, top1: 0.64318, throughput: 1328.73 | 2022-05-21 12:29:32.455 [rank:0] [train], epoch: 28/50, iter: 600/834, loss: 0.30864, top1: 0.63130, throughput: 1328.50[rank:7] [train], epoch: 28/50, iter: 600/834, loss: 0.30892, top1: 0.63432, throughput: 1328.62 | 2022-05-21 12:29:32.455 | 2022-05-21 12:29:32.456 [rank:6] [train], epoch: 28/50, iter: 600/834, loss: 0.30796, top1: 0.63401, throughput: 1328.31 | 2022-05-21 12:29:32.459 [rank:5] [train], epoch: 28/50, iter: 600/834, loss: 0.30931, top1: 0.63260, throughput: 1328.39 | 2022-05-21 12:29:32.458 [rank:1] [train], epoch: 28/50, iter: 700/834, loss: 0.30898, top1: 0.63146, throughput: 1328.96 | 2022-05-21 12:29:46.903 [rank:4] [train], epoch: 28/50, iter: 700/834, loss: 0.30653, top1: 0.63458, throughput: 1328.95 | 2022-05-21 12:29:46.903 [rank:7] [train], epoch: 28/50, iter: 700/834, loss: 0.30680, top1: 0.63615, throughput: 1328.89 | 2022-05-21 12:29:46.904 [rank:3] [train], epoch: 28/50, iter: 700/834, loss: 0.30762, top1: 0.63755, throughput: 1328.90 | 2022-05-21 12:29:46.904 [rank:6] [train], epoch: 28/50, iter: 700/834, loss: 0.30688, top1: 0.63599, throughput: 1329.16 | 2022-05-21 12:29:46.904 [rank:5] [train], epoch: 28/50, iter: 700/834, loss: 0.30546, top1: 0.63615, throughput: 1329.14 | 2022-05-21 12:29:46.904 [rank:0] [train], epoch: 28/50, iter: 700/834, loss: 0.30718, top1: 0.63094, throughput: 1328.84 | 2022-05-21 12:29:46.905 [rank:2] [train], epoch: 28/50, iter: 700/834, loss: 0.30572, top1: 0.64031, throughput: 1328.70 | 2022-05-21 12:29:46.906 [rank:5] [train], epoch: 28/50, iter: 800/834, loss: 0.30779, top1: 0.63682, throughput: 1330.69 | 2022-05-21 12:30:01.332 [rank:6] [train], epoch: 28/50, iter: 800/834, loss: 0.31026, top1: 0.63188, throughput: 1330.61 | 2022-05-21 12:30:01.334 [rank:7] [train], epoch: 28/50, iter: 800/834, loss: 0.31006, top1: 0.63057, throughput: 1330.66 | 2022-05-21 12:30:01.332 [rank:2] [train], epoch: 28/50, iter: 800/834, loss: 0.30906, top1: 0.63307, throughput: 1330.58 | 2022-05-21 12:30:01.336 [rank:4] [train], epoch: 28/50, iter: 800/834, loss: 0.30847, top1: 0.63219, throughput: 1330.52 | 2022-05-21 12:30:01.334 [rank:3] [train], epoch: 28/50, iter: 800/834, loss: 0.30735, top1: 0.63563, throughput: 1330.42 | 2022-05-21 12:30:01.336 [rank:1] [train], epoch: 28/50, iter: 800/834, loss: 0.30938, top1: 0.63375, throughput: 1330.31 | 2022-05-21 12:30:01.336 [rank:0] [train], epoch: 28/50, iter: 800/834, loss: 0.30486, top1: 0.64312, throughput: 1330.48 | 2022-05-21 12:30:01.336 [rank:1] [train], epoch: 28/50, iter: 834/834, loss: 0.30566, top1: 0.63680, throughput: 1328.92 | 2022-05-21 12:30:06.248 [rank:7] [train], epoch: 28/50, iter: 834/834, loss: 0.30752, top1: 0.62975, throughput: 1327.94 [rank:6] [train], epoch: 28/50, iter: 834/834, loss: 0.30668, top1: 0.64170, throughput: 1328.32| 2022-05-21 12:30:06.248 | 2022-05-21 12:30:06.248 [rank:2] [train], epoch: 28/50, iter: 834/834, loss: 0.30865, top1: 0.63741, throughput: 1328.93 | 2022-05-21 12:30:06.248 [rank:0] [train], epoch: 28/50, iter: 834/834, loss: 0.30616, top1: 0.63618, throughput: 1328.58 | 2022-05-21 12:30:06.249 [rank:3] [train], epoch: 28/50, iter: 834/834, loss: 0.30382, top1: 0.65104, throughput: 1328.33 | 2022-05-21 12:30:06.250 [rank:5] [train], epoch: 28/50, iter: 834/834, loss: 0.30365, top1: 0.63955, throughput: 1326.69 | 2022-05-21 12:30:06.253 [rank:4] [train], epoch: 28/50, iter: 834/834, loss: 0.30503, top1: 0.64415, throughput: 1326.95 | 2022-05-21 12:30:06.253 [rank:0] [eval], epoch: 28/50, iter: 125/125, loss: 0.00000, top1: 0.61200, throughput: 572.65 | 2022-05-21 12:30:17.164 [rank:7] [eval], epoch: 28/50, iter: 125/125, loss: 0.00000, top1: 0.60832, throughput: 572.49 | 2022-05-21 12:30:17.166 [rank:2] [eval], epoch: 28/50, iter: 125/125, loss: 0.00000, top1: 0.60304, throughput: 569.15 | 2022-05-21 12:30:17.229 [rank:6] [eval], epoch: 28/50, iter: 125/125, loss: 0.00000, top1: 0.59456, throughput: 565.48 | 2022-05-21 12:30:17.301 [rank:4] [eval], epoch: 28/50, iter: 125/125, loss: 0.00000, top1: 0.60832, throughput: 565.61 | 2022-05-21 12:30:17.303 [rank:3] [eval], epoch: 28/50, iter: 125/125, loss: 0.00000, top1: 0.59888, throughput: 564.44 | 2022-05-21 12:30:17.323 [rank:1] [eval], epoch: 28/50, iter: 125/125, loss: 0.00000, top1: 0.60720, throughput: 555.49 | 2022-05-21 12:30:17.499 [rank:5] [eval], epoch: 28/50, iter: 125/125, loss: 0.00000, top1: 0.59712, throughput: 554.20 | 2022-05-21 12:30:17.530 [rank:5] [train], epoch: 29/50, iter: 100/834, loss: 0.29892, top1: 0.65214, throughput: 1329.89 | 2022-05-21 12:30:31.968 [rank:1] [train], epoch: 29/50, iter: 100/834, loss: 0.30246, top1: 0.64844, throughput: 1326.99 | 2022-05-21 12:30:31.968 [rank:7] [train], epoch: 29/50, iter: 100/834, loss: 0.30036, top1: 0.65135, throughput: 1297.02 | 2022-05-21 12:30:31.969 [rank:3] [train], epoch: 29/50, iter: 100/834, loss: 0.30041, top1: 0.64604, throughput: 1310.81 | 2022-05-21 12:30:31.971 [rank:2] [train], epoch: 29/50, iter: 100/834, loss: 0.29939, top1: 0.64771, throughput: 1302.48 | 2022-05-21 12:30:31.970 [rank:6] [train], epoch: 29/50, iter: 100/834, loss: 0.30051, top1: 0.65391, throughput: 1308.73 | 2022-05-21 12:30:31.971 [rank:0] [train], epoch: 29/50, iter: 100/834, loss: 0.30187, top1: 0.64557, throughput: 1296.84 | 2022-05-21 12:30:31.969 [rank:4] [train], epoch: 29/50, iter: 100/834, loss: 0.30122, top1: 0.64786, throughput: 1308.88 | 2022-05-21 12:30:31.972 [rank:6] [train], epoch: 29/50, iter: 200/834, loss: 0.30092, top1: 0.64943, throughput: 1327.50[rank:2] [train], epoch: 29/50, iter: 200/834, loss: 0.30217, top1: 0.64667, throughput: 1327.49 | 2022-05-21 12:30:46.435 | 2022-05-21 12:30:46.434 [rank:1] [train], epoch: 29/50, iter: 200/834, loss: 0.30363, top1: 0.64318, throughput: 1327.25 | 2022-05-21 12:30:46.434 [rank:7] [train], epoch: 29/50, iter: 200/834, loss: 0.30134, top1: 0.64578, throughput: 1327.28 | 2022-05-21 12:30:46.434 [rank:5] [train], epoch: 29/50, iter: 200/834, loss: 0.30247, top1: 0.64516, throughput: 1327.19 | 2022-05-21 12:30:46.434 [rank:4] [train], epoch: 29/50, iter: 200/834, loss: 0.29965, top1: 0.64630, throughput: 1327.39 | 2022-05-21 12:30:46.437 [rank:3] [train], epoch: 29/50, iter: 200/834, loss: 0.30104, top1: 0.64552, throughput: 1327.23 | 2022-05-21 12:30:46.437 [rank:0] [train], epoch: 29/50, iter: 200/834, loss: 0.30167, top1: 0.64865, throughput: 1327.04 | 2022-05-21 12:30:46.437 [rank:3] [train], epoch: 29/50, iter: 300/834, loss: 0.30345, top1: 0.63964, throughput: 1329.08 | 2022-05-21 12:31:00.883 [rank:4] [train], epoch: 29/50, iter: 300/834, loss: 0.30338, top1: 0.64417, throughput: 1329.04 | 2022-05-21 12:31:00.883 [rank:2] [train], epoch: 29/50, iter: 300/834, loss: 0.30298, top1: 0.64438, throughput: 1328.81 | 2022-05-21 12:31:00.883 [rank:1] [train], epoch: 29/50, iter: 300/834, loss: 0.30367, top1: 0.64323, throughput: 1328.62 | 2022-05-21 12:31:00.885 [rank:6] [train], epoch: 29/50, iter: 300/834, loss: 0.30469, top1: 0.63964, throughput: 1328.71 | 2022-05-21 12:31:00.885 [rank:0] [train], epoch: 29/50, iter: 300/834, loss: 0.30262, top1: 0.64448, throughput: 1328.98 | 2022-05-21 12:31:00.884 [rank:7] [train], epoch: 29/50, iter: 300/834, loss: 0.30268, top1: 0.64719, throughput: 1328.45 | 2022-05-21 12:31:00.887 [rank:5] [train], epoch: 29/50, iter: 300/834, loss: 0.30473, top1: 0.64073, throughput: 1328.44 | 2022-05-21 12:31:00.888 [rank:6] [train], epoch: 29/50, iter: 400/834, loss: 0.30073, top1: 0.65302, throughput: 1330.56 | 2022-05-21 12:31:15.315 [rank:3] [train], epoch: 29/50, iter: 400/834, loss: 0.30079, top1: 0.64979, throughput: 1330.35 | 2022-05-21 12:31:15.315 [rank:0] [train], epoch: 29/50, iter: 400/834, loss: 0.30363, top1: 0.64188, throughput: 1330.56 | 2022-05-21 12:31:15.314 [rank:2] [train], epoch: 29/50, iter: 400/834, loss: 0.30528, top1: 0.63823, throughput: 1330.34 | 2022-05-21 12:31:15.315 [rank:4] [train], epoch: 29/50, iter: 400/834, loss: 0.30399, top1: 0.63880, throughput: 1330.21 | 2022-05-21 12:31:15.317 [rank:1] [train], epoch: 29/50, iter: 400/834, loss: 0.30510, top1: 0.64443, throughput: 1330.50 | 2022-05-21 12:31:15.316 [rank:7] [train], epoch: 29/50, iter: 400/834, loss: 0.30510, top1: 0.64172, throughput: 1330.41 | 2022-05-21 12:31:15.319 [rank:5] [train], epoch: 29/50, iter: 400/834, loss: 0.30439, top1: 0.64135, throughput: 1330.37 | 2022-05-21 12:31:15.320 [rank:5] [train], epoch: 29/50, iter: 500/834, loss: 0.30219, top1: 0.64188, throughput: 1328.39 | 2022-05-21 12:31:29.773 [rank:6] [train], epoch: 29/50, iter: 500/834, loss: 0.30306, top1: 0.64411, throughput: 1327.91 | 2022-05-21 12:31:29.774 [rank:4] [train], epoch: 29/50, iter: 500/834, loss: 0.30257, top1: 0.64729, throughput: 1328.09 | 2022-05-21 12:31:29.774 [rank:0] [train], epoch: 29/50, iter: 500/834, loss: 0.30049, top1: 0.65083, throughput: 1327.82 | 2022-05-21 12:31:29.774 [rank:7] [train], epoch: 29/50, iter: 500/834, loss: 0.30312, top1: 0.64854, throughput: 1328.14 | 2022-05-21 12:31:29.775 [rank:1] [train], epoch: 29/50, iter: 500/834, loss: 0.30355, top1: 0.64260, throughput: 1327.84 | 2022-05-21 12:31:29.775 [rank:3] [train], epoch: 29/50, iter: 500/834, loss: 0.30447, top1: 0.64016, throughput: 1327.80 | 2022-05-21 12:31:29.775 [rank:2] [train], epoch: 29/50, iter: 500/834, loss: 0.30226, top1: 0.64792, throughput: 1327.77 | 2022-05-21 12:31:29.775 [rank:7] [train], epoch: 29/50, iter: 600/834, loss: 0.30798, top1: 0.63745, throughput: 1323.23 | 2022-05-21 12:31:44.285 [rank:6] [train], epoch: 29/50, iter: 600/834, loss: 0.30814, top1: 0.63359, throughput: 1323.06 | 2022-05-21 12:31:44.285 [rank:4] [train], epoch: 29/50, iter: 600/834, loss: 0.30529, top1: 0.63927, throughput: 1322.84 | 2022-05-21 12:31:44.288 [rank:0] [train], epoch: 29/50, iter: 600/834, loss: 0.30327, top1: 0.64141, throughput: 1323.09 | 2022-05-21 12:31:44.286[rank:1] [train], epoch: 29/50, iter: 600/834, loss: 0.30341, top1: 0.63760, throughput: 1323.21 | 2022-05-21 12:31:44.286 [rank:5] [train], epoch: 29/50, iter: 600/834, loss: 0.30484, top1: 0.64333, throughput: 1323.03 | 2022-05-21 12:31:44.285 [rank:3] [train], epoch: 29/50, iter: 600/834, loss: 0.30314, top1: 0.64193, throughput: 1323.02 | 2022-05-21 12:31:44.288 [rank:2] [train], epoch: 29/50, iter: 600/834, loss: 0.30547, top1: 0.64271, throughput: 1323.04 | 2022-05-21 12:31:44.287 [rank:5] [train], epoch: 29/50, iter: 700/834, loss: 0.30277, top1: 0.64323, throughput: 1329.24 | 2022-05-21 12:31:58.730 [rank:7] [train], epoch: 29/50, iter: 700/834, loss: 0.30425, top1: 0.63906, throughput: 1329.23 | 2022-05-21 12:31:58.730 [rank:6] [train], epoch: 29/50, iter: 700/834, loss: 0.30372, top1: 0.64323, throughput: 1329.15 | 2022-05-21 12:31:58.731 [rank:3] [train], epoch: 29/50, iter: 700/834, loss: 0.30222, top1: 0.64656, throughput: 1329.33 | 2022-05-21 12:31:58.731 [rank:2] [train], epoch: 29/50, iter: 700/834, loss: 0.30246, top1: 0.64568, throughput: 1329.27 | 2022-05-21 12:31:58.731 [rank:1] [train], epoch: 29/50, iter: 700/834, loss: 0.30160, top1: 0.64729, throughput: 1329.16 | 2022-05-21 12:31:58.731 [rank:4] [train], epoch: 29/50, iter: 700/834, loss: 0.30474, top1: 0.64177, throughput: 1328.99 | 2022-05-21 12:31:58.735 [rank:0] [train], epoch: 29/50, iter: 700/834, loss: 0.30593, top1: 0.63807, throughput: 1329.03 | 2022-05-21 12:31:58.732 [rank:6] [train], epoch: 29/50, iter: 800/834, loss: 0.30608, top1: 0.64146, throughput: 1328.63 | 2022-05-21 12:32:13.182 [rank:7] [train], epoch: 29/50, iter: 800/834, loss: 0.30559, top1: 0.63990, throughput: 1328.49 | 2022-05-21 12:32:13.182 [rank:2] [train], epoch: 29/50, iter: 800/834, loss: 0.30412, top1: 0.64245, throughput: 1328.87 | 2022-05-21 12:32:13.180 [rank:4] [train], epoch: 29/50, iter: 800/834, loss: 0.30515, top1: 0.64099, throughput: 1329.06 | 2022-05-21 12:32:13.181 [rank:5] [train], epoch: 29/50, iter: 800/834, loss: 0.30426, top1: 0.64385, throughput: 1328.45 | 2022-05-21 12:32:13.183 [rank:3] [train], epoch: 29/50, iter: 800/834, loss: 0.30508, top1: 0.63932, throughput: 1328.49 | 2022-05-21 12:32:13.183 [rank:0] [train], epoch: 29/50, iter: 800/834, loss: 0.30424, top1: 0.64724, throughput: 1328.58 | 2022-05-21 12:32:13.184 [rank:1] [train], epoch: 29/50, iter: 800/834, loss: 0.30213, top1: 0.64755, throughput: 1328.48 | 2022-05-21 12:32:13.183 [rank:7] [train], epoch: 29/50, iter: 834/834, loss: 0.30453, top1: 0.64032, throughput: 1326.67 | 2022-05-21 12:32:18.103 [rank:4] [train], epoch: 29/50, iter: 834/834, loss: 0.30273, top1: 0.64752, throughput: 1326.37 | 2022-05-21 12:32:18.103 [rank:2] [train], epoch: 29/50, iter: 834/834, loss: 0.30679, top1: 0.63312, throughput: 1326.02 | 2022-05-21 12:32:18.103 [rank:0] [train], epoch: 29/50, iter: 834/834, loss: 0.30323, top1: 0.64936, throughput: 1326.78 | 2022-05-21 12:32:18.104 [rank:5] [train], epoch: 29/50, iter: 834/834, loss: 0.30414, top1: 0.64292, throughput: 1326.21 | 2022-05-21 12:32:18.105 [rank:6] [train], epoch: 29/50, iter: 834/834, loss: 0.30537, top1: 0.63343, throughput: 1325.99 | 2022-05-21 12:32:18.105 [rank:1] [train], epoch: 29/50, iter: 834/834, loss: 0.30380, top1: 0.64461, throughput: 1325.74 | 2022-05-21 12:32:18.107 [rank:3] [train], epoch: 29/50, iter: 834/834, loss: 0.30670, top1: 0.63680, throughput: 1325.77 | 2022-05-21 12:32:18.107 [rank:7] [eval], epoch: 29/50, iter: 125/125, loss: 0.00000, top1: 0.64064, throughput: 570.49 | 2022-05-21 12:32:29.058 [rank:0] [eval], epoch: 29/50, iter: 125/125, loss: 0.00000, top1: 0.65408, throughput: 570.53 | 2022-05-21 12:32:29.059 [rank:2] [eval], epoch: 29/50, iter: 125/125, loss: 0.00000, top1: 0.63584, throughput: 570.36 | 2022-05-21 12:32:29.061 [rank:4] [eval], epoch: 29/50, iter: 125/125, loss: 0.00000, top1: 0.63504, throughput: 566.44 | 2022-05-21 12:32:29.137 [rank:3] [eval], epoch: 29/50, iter: 125/125, loss: 0.00000, top1: 0.63472, throughput: 566.09 | 2022-05-21 12:32:29.148 [rank:1] [eval], epoch: 29/50, iter: 125/125, loss: 0.00000, top1: 0.64080, throughput: 563.41 | 2022-05-21 12:32:29.201 [rank:6] [eval], epoch: 29/50, iter: 125/125, loss: 0.00000, top1: 0.64192, throughput: 563.12 | 2022-05-21 12:32:29.204 [rank:5] [eval], epoch: 29/50, iter: 125/125, loss: 0.00000, top1: 0.62880, throughput: 553.93 | 2022-05-21 12:32:29.388 [rank:5] [train], epoch: 30/50, iter: 100/834, loss: 0.29983, top1: 0.65005, throughput: 1334.94 | 2022-05-21 12:32:43.771 [rank:6] [train], epoch: 30/50, iter: 100/834, loss: 0.29806, top1: 0.65547, throughput: 1317.98 | 2022-05-21 12:32:43.771 [rank:7] [train], epoch: 30/50, iter: 100/834, loss: 0.29816, top1: 0.65620, throughput: 1304.99 | 2022-05-21 12:32:43.771 [rank:2] [train], epoch: 30/50, iter: 100/834, loss: 0.29926, top1: 0.65203, throughput: 1305.23 | 2022-05-21 12:32:43.771 [rank:4] [train], epoch: 30/50, iter: 100/834, loss: 0.30008, top1: 0.64948, throughput: 1311.98 | 2022-05-21 12:32:43.771 [rank:3] [train], epoch: 30/50, iter: 100/834, loss: 0.29933, top1: 0.64849, throughput: 1312.88 | 2022-05-21 12:32:43.772 [rank:1] [train], epoch: 30/50, iter: 100/834, loss: 0.29493, top1: 0.65771, throughput: 1317.69 | 2022-05-21 12:32:43.772 [rank:0] [train], epoch: 30/50, iter: 100/834, loss: 0.29753, top1: 0.65526, throughput: 1304.81 | 2022-05-21 12:32:43.773 [rank:7] [train], epoch: 30/50, iter: 200/834, loss: 0.30163, top1: 0.64776, throughput: 1328.20 | 2022-05-21 12:32:58.227 [rank:3] [train], epoch: 30/50, iter: 200/834, loss: 0.29954, top1: 0.65302, throughput: 1328.26 | 2022-05-21 12:32:58.227 [rank:4] [train], epoch: 30/50, iter: 200/834, loss: 0.29931, top1: 0.65214, throughput: 1328.17 | 2022-05-21 12:32:58.227 [rank:0] [train], epoch: 30/50, iter: 200/834, loss: 0.30082, top1: 0.64818, throughput: 1328.37 | 2022-05-21 12:32:58.227 [rank:5] [train], epoch: 30/50, iter: 200/834, loss: 0.29938, top1: 0.65469, throughput: 1328.05 | 2022-05-21 12:32:58.228 [rank:6] [train], epoch: 30/50, iter: 200/834, loss: 0.29570, top1: 0.65682, throughput: 1328.01 | 2022-05-21 12:32:58.229 [rank:1] [train], epoch: 30/50, iter: 200/834, loss: 0.29885, top1: 0.65120, throughput: 1328.04 | 2022-05-21 12:32:58.229 [rank:2] [train], epoch: 30/50, iter: 200/834, loss: 0.29781, top1: 0.65224, throughput: 1327.95 | 2022-05-21 12:32:58.229 [rank:1] [train], epoch: 30/50, iter: 300/834, loss: 0.30128, top1: 0.64510, throughput: 1329.84 | 2022-05-21 12:33:12.667 [rank:2] [train], epoch: 30/50, iter: 300/834, loss: 0.29995, top1: 0.64974, throughput: 1329.81[rank:4] [train], epoch: 30/50, iter: 300/834, loss: 0.30329, top1: 0.64438, throughput: 1329.51 | 2022-05-21 12:33:12.667 | 2022-05-21 12:33:12.668 [rank:7] [train], epoch: 30/50, iter: 300/834, loss: 0.30024, top1: 0.64729, throughput: 1329.49 | 2022-05-21 12:33:12.668 [rank:5] [train], epoch: 30/50, iter: 300/834, loss: 0.29733, top1: 0.65526, throughput: 1329.31 | 2022-05-21 12:33:12.671 [rank:3] [train], epoch: 30/50, iter: 300/834, loss: 0.29986, top1: 0.65625, throughput: 1329.37 | 2022-05-21 12:33:12.670 [rank:6] [train], epoch: 30/50, iter: 300/834, loss: 0.29953, top1: 0.65224, throughput: 1329.36 | 2022-05-21 12:33:12.672 [rank:0] [train], epoch: 30/50, iter: 300/834, loss: 0.29840, top1: 0.65484, throughput: 1329.32 | 2022-05-21 12:33:12.671 [rank:3] [train], epoch: 30/50, iter: 400/834, loss: 0.29780, top1: 0.65286, throughput: 1328.74 | 2022-05-21 12:33:27.120 [rank:6] [train], epoch: 30/50, iter: 400/834, loss: 0.29764, top1: 0.65177, throughput: 1328.86 | 2022-05-21 12:33:27.121 [rank:4] [train], epoch: 30/50, iter: 400/834, loss: 0.30208, top1: 0.64708, throughput: 1328.53 | 2022-05-21 12:33:27.120 [rank:5] [train], epoch: 30/50, iter: 400/834, loss: 0.29990, top1: 0.65281, throughput: 1328.82 | 2022-05-21 12:33:27.120 [rank:1] [train], epoch: 30/50, iter: 400/834, loss: 0.30078, top1: 0.65313, throughput: 1328.21 | 2022-05-21 12:33:27.122 [rank:0] [train], epoch: 30/50, iter: 400/834, loss: 0.29926, top1: 0.65333, throughput: 1328.53 | 2022-05-21 12:33:27.123 [rank:7] [train], epoch: 30/50, iter: 400/834, loss: 0.29863, top1: 0.65297, throughput: 1328.33 | 2022-05-21 12:33:27.123 [rank:2] [train], epoch: 30/50, iter: 400/834, loss: 0.30016, top1: 0.65302, throughput: 1328.26 | 2022-05-21 12:33:27.122 [rank:3] [train], epoch: 30/50, iter: 500/834, loss: 0.30212, top1: 0.64635, throughput: 1331.09 | 2022-05-21 12:33:41.544 [rank:7] [train], epoch: 30/50, iter: 500/834, loss: 0.29898, top1: 0.65078, throughput: 1331.34 | 2022-05-21 12:33:41.544 [rank:6] [train], epoch: 30/50, iter: 500/834, loss: 0.30037, top1: 0.64990, throughput: 1331.12 | 2022-05-21 12:33:41.545 [rank:5] [train], epoch: 30/50, iter: 500/834, loss: 0.29883, top1: 0.64995, throughput: 1331.09 | 2022-05-21 12:33:41.545 [rank:4] [train], epoch: 30/50, iter: 500/834, loss: 0.29935, top1: 0.65318, throughput: 1331.01 | 2022-05-21 12:33:41.546 [rank:2] [train], epoch: 30/50, iter: 500/834, loss: 0.30334, top1: 0.64141, throughput: 1331.14 | 2022-05-21 12:33:41.546 [rank:1] [train], epoch: 30/50, iter: 500/834, loss: 0.30053, top1: 0.64786, throughput: 1331.33 | 2022-05-21 12:33:41.544 [rank:0] [train], epoch: 30/50, iter: 500/834, loss: 0.30137, top1: 0.64589, throughput: 1331.11 | 2022-05-21 12:33:41.547 [rank:4] [train], epoch: 30/50, iter: 600/834, loss: 0.30244, top1: 0.64349, throughput: 1329.06 | 2022-05-21 12:33:55.992 [rank:5] [train], epoch: 30/50, iter: 600/834, loss: 0.30243, top1: 0.64656, throughput: 1329.04 | 2022-05-21 12:33:55.991 [rank:6] [train], epoch: 30/50, iter: 600/834, loss: 0.29860, top1: 0.65266, throughput: 1328.97 | 2022-05-21 12:33:55.992 [rank:0] [train], epoch: 30/50, iter: 600/834, loss: 0.29853, top1: 0.65052, throughput: 1329.08 | 2022-05-21 12:33:55.993 [rank:7] [train], epoch: 30/50, iter: 600/834, loss: 0.30049, top1: 0.64865, throughput: 1328.85 | 2022-05-21 12:33:55.993[rank:1] [train], epoch: 30/50, iter: 600/834, loss: 0.30064, top1: 0.64891, throughput: 1328.91 | 2022-05-21 12:33:55.992 [rank:3] [train], epoch: 30/50, iter: 600/834, loss: 0.30245, top1: 0.64708, throughput: 1328.83 | 2022-05-21 12:33:55.993 [rank:2] [train], epoch: 30/50, iter: 600/834, loss: 0.30207, top1: 0.64469, throughput: 1329.00 | 2022-05-21 12:33:55.993 [rank:5] [train], epoch: 30/50, iter: 700/834, loss: 0.30152, top1: 0.64698, throughput: 1326.55 | 2022-05-21 12:34:10.465 [rank:3] [train], epoch: 30/50, iter: 700/834, loss: 0.29915, top1: 0.65135, throughput: 1326.71 | 2022-05-21 12:34:10.465 [rank:4] [train], epoch: 30/50, iter: 700/834, loss: 0.30066, top1: 0.64760, throughput: 1326.55 | 2022-05-21 12:34:10.466 [rank:7] [train], epoch: 30/50, iter: 700/834, loss: 0.29726, top1: 0.65375, throughput: 1326.63 | 2022-05-21 12:34:10.466 [rank:1] [train], epoch: 30/50, iter: 700/834, loss: 0.29874, top1: 0.65151, throughput: 1326.56 | 2022-05-21 12:34:10.465 [rank:2] [train], epoch: 30/50, iter: 700/834, loss: 0.29918, top1: 0.64891, throughput: 1326.61 | 2022-05-21 12:34:10.466 [rank:6] [train], epoch: 30/50, iter: 700/834, loss: 0.29925, top1: 0.65104, throughput: 1326.42 | 2022-05-21 12:34:10.467 [rank:0] [train], epoch: 30/50, iter: 700/834, loss: 0.30002, top1: 0.65208, throughput: 1326.41 | 2022-05-21 12:34:10.468 [rank:4] [train], epoch: 30/50, iter: 800/834, loss: 0.29987, top1: 0.65036, throughput: 1327.79 | 2022-05-21 12:34:24.926 [rank:6] [train], epoch: 30/50, iter: 800/834, loss: 0.30007, top1: 0.65068, throughput: 1327.87 | 2022-05-21 12:34:24.926 [rank:7] [train], epoch: 30/50, iter: 800/834, loss: 0.29892, top1: 0.65073, throughput: 1327.80 | 2022-05-21 12:34:24.926 [rank:1] [train], epoch: 30/50, iter: 800/834, loss: 0.30213, top1: 0.64229, throughput: 1327.74 | 2022-05-21 12:34:24.926 [rank:2] [train], epoch: 30/50, iter: 800/834, loss: 0.30027, top1: 0.64797, throughput: 1327.83 | 2022-05-21 12:34:24.926 [rank:3] [train], epoch: 30/50, iter: 800/834, loss: 0.29932, top1: 0.64964, throughput: 1327.61 | 2022-05-21 12:34:24.927 [rank:0] [train], epoch: 30/50, iter: 800/834, loss: 0.30030, top1: 0.64979, throughput: 1327.86 | 2022-05-21 12:34:24.927 [rank:5] [train], epoch: 30/50, iter: 800/834, loss: 0.30001, top1: 0.65089, throughput: 1327.57 | 2022-05-21 12:34:24.927 [rank:5] [train], epoch: 30/50, iter: 834/834, loss: 0.30296, top1: 0.64246, throughput: 1328.16 | 2022-05-21 12:34:29.842 [rank:2] [train], epoch: 30/50, iter: 834/834, loss: 0.29849, top1: 0.65365, throughput: 1327.83 | 2022-05-21 12:34:29.842 [rank:7] [train], epoch: 30/50, iter: 834/834, loss: 0.30429, top1: 0.63726, throughput: 1327.71 | 2022-05-21 12:34:29.842 [rank:1] [train], epoch: 30/50, iter: 834/834, loss: 0.30334, top1: 0.64017, throughput: 1327.44 | 2022-05-21 12:34:29.844 [rank:6] [train], epoch: 30/50, iter: 834/834, loss: 0.30489, top1: 0.64292, throughput: 1327.35 | 2022-05-21 12:34:29.844 [rank:3] [train], epoch: 30/50, iter: 834/834, loss: 0.30045, top1: 0.64568, throughput: 1327.37 | 2022-05-21 12:34:29.845 [rank:0] [train], epoch: 30/50, iter: 834/834, loss: 0.29940, top1: 0.65349, throughput: 1327.24 | 2022-05-21 12:34:29.846 [rank:4] [train], epoch: 30/50, iter: 834/834, loss: 0.29996, top1: 0.64691, throughput: 1326.90 | 2022-05-21 12:34:29.845 [rank:0] [eval], epoch: 30/50, iter: 125/125, loss: 0.00000, top1: 0.67184, throughput: 559.64 | 2022-05-21 12:34:41.014 [rank:7] [eval], epoch: 30/50, iter: 125/125, loss: 0.00000, top1: 0.66368, throughput: 559.25 | 2022-05-21 12:34:41.018 [rank:2] [eval], epoch: 30/50, iter: 125/125, loss: 0.00000, top1: 0.65424, throughput: 558.09 | 2022-05-21 12:34:41.041 [rank:1] [eval], epoch: 30/50, iter: 125/125, loss: 0.00000, top1: 0.66800, throughput: 557.21 | 2022-05-21 12:34:41.060 [rank:3] [eval], epoch: 30/50, iter: 125/125, loss: 0.00000, top1: 0.66416, throughput: 556.58 | 2022-05-21 12:34:41.074 [rank:4] [eval], epoch: 30/50, iter: 125/125, loss: 0.00000, top1: 0.66912, throughput: 556.43 | 2022-05-21 12:34:41.078 [rank:6] [eval], epoch: 30/50, iter: 125/125, loss: 0.00000, top1: 0.66176, throughput: 553.05 | 2022-05-21 12:34:41.145 [rank:5] [eval], epoch: 30/50, iter: 125/125, loss: 0.00000, top1: 0.65360, throughput: 543.73 | 2022-05-21 12:34:41.337 [rank:5] [train], epoch: 31/50, iter: 100/834, loss: 0.29171, top1: 0.66583, throughput: 1333.22 | 2022-05-21 12:34:55.738 [rank:3] [train], epoch: 31/50, iter: 100/834, loss: 0.29298, top1: 0.66328, throughput: 1309.25 | 2022-05-21 12:34:55.739 [rank:4] [train], epoch: 31/50, iter: 100/834, loss: 0.29579, top1: 0.65604, throughput: 1309.46 | 2022-05-21 12:34:55.740 [rank:1] [train], epoch: 31/50, iter: 100/834, loss: 0.29298, top1: 0.66427, throughput: 1308.05 | 2022-05-21 12:34:55.739 [rank:2] [train], epoch: 31/50, iter: 100/834, loss: 0.29325, top1: 0.66687, throughput: 1306.21 | 2022-05-21 12:34:55.740 [rank:0] [train], epoch: 31/50, iter: 100/834, loss: 0.29344, top1: 0.66344, throughput: 1303.74 | 2022-05-21 12:34:55.741 [rank:6] [train], epoch: 31/50, iter: 100/834, loss: 0.29206, top1: 0.66693, throughput: 1315.26 | 2022-05-21 12:34:55.743 [rank:7] [train], epoch: 31/50, iter: 100/834, loss: 0.29261, top1: 0.66271, throughput: 1303.96 | 2022-05-21 12:34:55.742 [rank:0] [train], epoch: 31/50, iter: 200/834, loss: 0.29954, top1: 0.64937, throughput: 1322.42 | 2022-05-21 12:35:10.259 [rank:5] [train], epoch: 31/50, iter: 200/834, loss: 0.29620, top1: 0.65885, throughput: 1322.32 | 2022-05-21 12:35:10.258 [rank:4] [train], epoch: 31/50, iter: 200/834, loss: 0.29331, top1: 0.66089, throughput: 1322.41 | 2022-05-21 12:35:10.259 [rank:7] [train], epoch: 31/50, iter: 200/834, loss: 0.29759, top1: 0.65479, throughput: 1322.64 | 2022-05-21 12:35:10.259 [rank:1] [train], epoch: 31/50, iter: 200/834, loss: 0.29612, top1: 0.65792, throughput: 1322.38 | 2022-05-21 12:35:10.258 [rank:6] [train], epoch: 31/50, iter: 200/834, loss: 0.29469, top1: 0.65885, throughput: 1322.64 | 2022-05-21 12:35:10.260 [rank:3] [train], epoch: 31/50, iter: 200/834, loss: 0.29322, top1: 0.66458, throughput: 1322.20 | 2022-05-21 12:35:10.260 [rank:2] [train], epoch: 31/50, iter: 200/834, loss: 0.29354, top1: 0.66016, throughput: 1322.33 | 2022-05-21 12:35:10.260 [rank:6] [train], epoch: 31/50, iter: 300/834, loss: 0.29791, top1: 0.65729, throughput: 1327.47 | 2022-05-21 12:35:24.723 [rank:5] [train], epoch: 31/50, iter: 300/834, loss: 0.29585, top1: 0.65786, throughput: 1327.35 | 2022-05-21 12:35:24.723 [rank:3] [train], epoch: 31/50, iter: 300/834, loss: 0.29568, top1: 0.65698, throughput: 1327.54 | 2022-05-21 12:35:24.723 [rank:1] [train], epoch: 31/50, iter: 300/834, loss: 0.29590, top1: 0.66005, throughput: 1327.25 | 2022-05-21 12:35:24.724 [rank:2] [train], epoch: 31/50, iter: 300/834, loss: 0.29608, top1: 0.65786, throughput: 1327.48 | 2022-05-21 12:35:24.723 [rank:0] [train], epoch: 31/50, iter: 300/834, loss: 0.29602, top1: 0.65432, throughput: 1327.44 | 2022-05-21 12:35:24.723 [rank:4] [train], epoch: 31/50, iter: 300/834, loss: 0.29824, top1: 0.65375, throughput: 1327.25 | 2022-05-21 12:35:24.725 [rank:7] [train], epoch: 31/50, iter: 300/834, loss: 0.29597, top1: 0.65922, throughput: 1327.22 | 2022-05-21 12:35:24.725 [rank:4] [train], epoch: 31/50, iter: 400/834, loss: 0.29460, top1: 0.66000, throughput: 1330.88 | 2022-05-21 12:35:39.152 [rank:5] [train], epoch: 31/50, iter: 400/834, loss: 0.29806, top1: 0.65365, throughput: 1330.62 | 2022-05-21 12:35:39.152 [rank:0] [train], epoch: 31/50, iter: 400/834, loss: 0.29819, top1: 0.65333, throughput: 1330.69 | 2022-05-21 12:35:39.152 [rank:2] [train], epoch: 31/50, iter: 400/834, loss: 0.29693, top1: 0.65667, throughput: 1330.67 | 2022-05-21 12:35:39.152 [rank:7] [train], epoch: 31/50, iter: 400/834, loss: 0.29619, top1: 0.66068, throughput: 1330.75 | 2022-05-21 12:35:39.153 [rank:6] [train], epoch: 31/50, iter: 400/834, loss: 0.29581, top1: 0.65370, throughput: 1330.53 | 2022-05-21 12:35:39.153 [rank:3] [train], epoch: 31/50, iter: 400/834, loss: 0.29257, top1: 0.67047, throughput: 1330.48 | 2022-05-21 12:35:39.154 [rank:1] [train], epoch: 31/50, iter: 400/834, loss: 0.29921, top1: 0.65167, throughput: 1330.56 | 2022-05-21 12:35:39.154 [rank:0] [train], epoch: 31/50, iter: 500/834, loss: 0.29670, top1: 0.66026, throughput: 1328.88 | 2022-05-21 12:35:53.600 [rank:5] [train], epoch: 31/50, iter: 500/834, loss: 0.29679, top1: 0.65120, throughput: 1328.87 | 2022-05-21 12:35:53.601 [rank:6] [train], epoch: 31/50, iter: 500/834, loss: 0.29700, top1: 0.65437, throughput: 1328.92 | 2022-05-21 12:35:53.601 [rank:3] [train], epoch: 31/50, iter: 500/834, loss: 0.29682, top1: 0.65458, throughput: 1328.87 | 2022-05-21 12:35:53.602 [rank:7] [train], epoch: 31/50, iter: 500/834, loss: 0.29694, top1: 0.65495, throughput: 1328.75 | 2022-05-21 12:35:53.603 [rank:4] [train], epoch: 31/50, iter: 500/834, loss: 0.29406, top1: 0.66120, throughput: 1328.52 | 2022-05-21 12:35:53.604 [rank:1] [train], epoch: 31/50, iter: 500/834, loss: 0.29803, top1: 0.65167, throughput: 1328.81 | 2022-05-21 12:35:53.603 [rank:2] [train], epoch: 31/50, iter: 500/834, loss: 0.29524, top1: 0.65771, throughput: 1328.60 | 2022-05-21 12:35:53.603 [rank:5] [train], epoch: 31/50, iter: 600/834, loss: 0.29860, top1: 0.65219, throughput: 1330.71 | 2022-05-21 12:36:08.029 [rank:7] [train], epoch: 31/50, iter: 600/834, loss: 0.29646, top1: 0.65729, throughput: 1331.01 | 2022-05-21 12:36:08.028 [rank:6] [train], epoch: 31/50, iter: 600/834, loss: 0.29804, top1: 0.65125, throughput: 1330.77 | 2022-05-21 12:36:08.029 [rank:1] [train], epoch: 31/50, iter: 600/834, loss: 0.29507, top1: 0.65781, throughput: 1331.03 | 2022-05-21 12:36:08.028 [rank:0] [train], epoch: 31/50, iter: 600/834, loss: 0.29713, top1: 0.65344, throughput: 1330.70 | 2022-05-21 12:36:08.029 [rank:3] [train], epoch: 31/50, iter: 600/834, loss: 0.29798, top1: 0.65135, throughput: 1330.88 | 2022-05-21 12:36:08.029 [rank:4] [train], epoch: 31/50, iter: 600/834, loss: 0.29659, top1: 0.65776, throughput: 1331.04 | 2022-05-21 12:36:08.029 [rank:2] [train], epoch: 31/50, iter: 600/834, loss: 0.29878, top1: 0.65203, throughput: 1330.89 | 2022-05-21 12:36:08.030 [rank:7] [train], epoch: 31/50, iter: 700/834, loss: 0.29940, top1: 0.65063, throughput: 1327.01 | 2022-05-21 12:36:22.496 [rank:5] [train], epoch: 31/50, iter: 700/834, loss: 0.29717, top1: 0.65615, throughput: 1327.17 | 2022-05-21 12:36:22.496 [rank:1] [train], epoch: 31/50, iter: 700/834, loss: 0.29801, top1: 0.65359, throughput: 1327.11 | 2022-05-21 12:36:22.496 [rank:0] [train], epoch: 31/50, iter: 700/834, loss: 0.29538, top1: 0.65964, throughput: 1327.11 | 2022-05-21 12:36:22.496 [rank:6] [train], epoch: 31/50, iter: 700/834, loss: 0.29407, top1: 0.66224, throughput: 1327.09 | 2022-05-21 12:36:22.497 [rank:4] [train], epoch: 31/50, iter: 700/834, loss: 0.29562, top1: 0.65880, throughput: 1327.06 | 2022-05-21 12:36:22.497 [rank:2] [train], epoch: 31/50, iter: 700/834, loss: 0.29673, top1: 0.65531, throughput: 1327.06 | 2022-05-21 12:36:22.498 [rank:3] [train], epoch: 31/50, iter: 700/834, loss: 0.29665, top1: 0.65484, throughput: 1327.00 | 2022-05-21 12:36:22.498 [rank:0] [train], epoch: 31/50, iter: 800/834, loss: 0.29571, top1: 0.65964, throughput: 1318.89[rank:1] [train], epoch: 31/50, iter: 800/834, loss: 0.29641, top1: 0.65464, throughput: 1318.81 | 2022-05-21 12:36:37.054 | 2022-05-21 12:36:37.054 [rank:6] [train], epoch: 31/50, iter: 800/834, loss: 0.29801, top1: 0.65490, throughput: 1318.92 | 2022-05-21 12:36:37.054 [rank:5] [train], epoch: 31/50, iter: 800/834, loss: 0.29690, top1: 0.65776, throughput: 1318.82 | 2022-05-21 12:36:37.054 [rank:4] [train], epoch: 31/50, iter: 800/834, loss: 0.29687, top1: 0.65573, throughput: 1318.72 | 2022-05-21 12:36:37.056 [rank:3] [train], epoch: 31/50, iter: 800/834, loss: 0.29711, top1: 0.65349, throughput: 1318.79 | 2022-05-21 12:36:37.056 [rank:7] [train], epoch: 31/50, iter: 800/834, loss: 0.29873, top1: 0.65359, throughput: 1318.73 | 2022-05-21 12:36:37.056 [rank:2] [train], epoch: 31/50, iter: 800/834, loss: 0.29624, top1: 0.65255, throughput: 1318.84 | 2022-05-21 12:36:37.056 [rank:7] [train], epoch: 31/50, iter: 834/834, loss: 0.29867, top1: 0.65456, throughput: 1326.16 | 2022-05-21 12:36:41.978 [rank:5] [train], epoch: 31/50, iter: 834/834, loss: 0.30009, top1: 0.65426, throughput: 1325.67 | 2022-05-21 12:36:41.979 [rank:2] [train], epoch: 31/50, iter: 834/834, loss: 0.29990, top1: 0.65334, throughput: 1326.04 | 2022-05-21 12:36:41.979 [rank:1] [train], epoch: 31/50, iter: 834/834, loss: 0.29997, top1: 0.65472, throughput: 1325.31 | 2022-05-21 12:36:41.980 [rank:4] [train], epoch: 31/50, iter: 834/834, loss: 0.30102, top1: 0.65165, throughput: 1325.85 | 2022-05-21 12:36:41.980 [rank:0] [train], epoch: 31/50, iter: 834/834, loss: 0.30123, top1: 0.65211, throughput: 1325.16 | 2022-05-21 12:36:41.980 [rank:6] [train], epoch: 31/50, iter: 834/834, loss: 0.29579, top1: 0.65870, throughput: 1325.00 | 2022-05-21 12:36:41.981 [rank:3] [train], epoch: 31/50, iter: 834/834, loss: 0.29586, top1: 0.65977, throughput: 1325.56 | 2022-05-21 12:36:41.981 [rank:7] [eval], epoch: 31/50, iter: 125/125, loss: 0.00000, top1: 0.63984, throughput: 568.52 | 2022-05-21 12:36:52.972 [rank:0] [eval], epoch: 31/50, iter: 125/125, loss: 0.00000, top1: 0.63904, throughput: 568.60 | 2022-05-21 12:36:52.972 [rank:2] [eval], epoch: 31/50, iter: 125/125, loss: 0.00000, top1: 0.62624, throughput: 565.79 | 2022-05-21 12:36:53.026 [rank:1] [eval], epoch: 31/50, iter: 125/125, loss: 0.00000, top1: 0.63600, throughput: 565.48 | 2022-05-21 12:36:53.033 [rank:4] [eval], epoch: 31/50, iter: 125/125, loss: 0.00000, top1: 0.62464, throughput: 565.28 | 2022-05-21 12:36:53.036 [rank:3] [eval], epoch: 31/50, iter: 125/125, loss: 0.00000, top1: 0.62720, throughput: 562.30 | 2022-05-21 12:36:53.096 [rank:6] [eval], epoch: 31/50, iter: 125/125, loss: 0.00000, top1: 0.62320, throughput: 559.10 | 2022-05-21 12:36:53.160 [rank:5] [eval], epoch: 31/50, iter: 125/125, loss: 0.00000, top1: 0.62176, throughput: 550.58 | 2022-05-21 12:36:53.330 [rank:5] [train], epoch: 32/50, iter: 100/834, loss: 0.29179, top1: 0.66411, throughput: 1334.30 | 2022-05-21 12:37:07.720 [rank:6] [train], epoch: 32/50, iter: 100/834, loss: 0.29168, top1: 0.66583, throughput: 1318.65 | 2022-05-21 12:37:07.720 [rank:4] [train], epoch: 32/50, iter: 100/834, loss: 0.28946, top1: 0.67193, throughput: 1307.60 | 2022-05-21 12:37:07.720 [rank:3] [train], epoch: 32/50, iter: 100/834, loss: 0.28703, top1: 0.67521, throughput: 1312.88[rank:1] [train], epoch: 32/50, iter: 100/834, loss: 0.29190, top1: 0.66193, throughput: 1307.10 | 2022-05-21 12:37:07.722 | 2022-05-21 12:37:07.721 [rank:0] [train], epoch: 32/50, iter: 100/834, loss: 0.29143, top1: 0.66432, throughput: 1301.73 | 2022-05-21 12:37:07.722 [rank:7] [train], epoch: 32/50, iter: 100/834, loss: 0.29440, top1: 0.66344, throughput: 1301.62 | 2022-05-21 12:37:07.723 [rank:2] [train], epoch: 32/50, iter: 100/834, loss: 0.29389, top1: 0.66115, throughput: 1306.40 | 2022-05-21 12:37:07.722 [rank:3] [train], epoch: 32/50, iter: 200/834, loss: 0.29212, top1: 0.66141, throughput: 1328.26 | 2022-05-21 12:37:22.176 [rank:7] [train], epoch: 32/50, iter: 200/834, loss: 0.29337, top1: 0.66349, throughput: 1328.30 | 2022-05-21 12:37:22.177 [rank:6] [train], epoch: 32/50, iter: 200/834, loss: 0.28926, top1: 0.66885, throughput: 1328.10 | 2022-05-21 12:37:22.177 [rank:5] [train], epoch: 32/50, iter: 200/834, loss: 0.29069, top1: 0.66667, throughput: 1328.18 | 2022-05-21 12:37:22.176 [rank:0] [train], epoch: 32/50, iter: 200/834, loss: 0.28775, top1: 0.67391, throughput: 1328.23 | 2022-05-21 12:37:22.177 [rank:1] [train], epoch: 32/50, iter: 200/834, loss: 0.29124, top1: 0.66302, throughput: 1328.25 | 2022-05-21 12:37:22.177 [rank:4] [train], epoch: 32/50, iter: 200/834, loss: 0.29280, top1: 0.66505, throughput: 1327.94 | 2022-05-21 12:37:22.178 [rank:2] [train], epoch: 32/50, iter: 200/834, loss: 0.29313, top1: 0.66109, throughput: 1328.07 | 2022-05-21 12:37:22.179 [rank:7] [train], epoch: 32/50, iter: 300/834, loss: 0.29169, top1: 0.66229, throughput: 1328.17 | 2022-05-21 12:37:36.633 [rank:0] [train], epoch: 32/50, iter: 300/834, loss: 0.29208, top1: 0.66151, throughput: 1328.17 | 2022-05-21 12:37:36.633 [rank:4] [train], epoch: 32/50, iter: 300/834, loss: 0.29305, top1: 0.66229, throughput: 1328.32 | 2022-05-21 12:37:36.633 [rank:5] [train], epoch: 32/50, iter: 300/834, loss: 0.29181, top1: 0.66354, throughput: 1328.06 | 2022-05-21 12:37:36.633 [rank:3] [train], epoch: 32/50, iter: 300/834, loss: 0.29090, top1: 0.66818, throughput: 1327.92 | 2022-05-21 12:37:36.634 [rank:6] [train], epoch: 32/50, iter: 300/834, loss: 0.29523, top1: 0.65807, throughput: 1327.95 | 2022-05-21 12:37:36.635 [rank:1] [train], epoch: 32/50, iter: 300/834, loss: 0.29450, top1: 0.66115, throughput: 1327.95 | 2022-05-21 12:37:36.635 [rank:2] [train], epoch: 32/50, iter: 300/834, loss: 0.29219, top1: 0.66146, throughput: 1328.23 | 2022-05-21 12:37:36.635 [rank:2] [train], epoch: 32/50, iter: 400/834, loss: 0.29136, top1: 0.66474, throughput: 1328.45 | 2022-05-21 12:37:51.088 [rank:7] [train], epoch: 32/50, iter: 400/834, loss: 0.29470, top1: 0.65849, throughput: 1328.43 | 2022-05-21 12:37:51.086 [rank:3] [train], epoch: 32/50, iter: 400/834, loss: 0.29141, top1: 0.66661, throughput: 1328.50 | 2022-05-21 12:37:51.087 [rank:0] [train], epoch: 32/50, iter: 400/834, loss: 0.29376, top1: 0.66281, throughput: 1328.14 | 2022-05-21 12:37:51.090 [rank:1] [train], epoch: 32/50, iter: 400/834, loss: 0.29381, top1: 0.66276, throughput: 1328.38 | 2022-05-21 12:37:51.089 [rank:5] [train], epoch: 32/50, iter: 400/834, loss: 0.29588, top1: 0.66010, throughput: 1328.17 | 2022-05-21 12:37:51.089 [rank:4] [train], epoch: 32/50, iter: 400/834, loss: 0.28986, top1: 0.67323, throughput: 1328.07 | 2022-05-21 12:37:51.090 [rank:6] [train], epoch: 32/50, iter: 400/834, loss: 0.29077, top1: 0.66969, throughput: 1328.27 | 2022-05-21 12:37:51.090 [rank:7] [train], epoch: 32/50, iter: 500/834, loss: 0.29389, top1: 0.66563, throughput: 1328.59 | 2022-05-21 12:38:05.538 [rank:6] [train], epoch: 32/50, iter: 500/834, loss: 0.29175, top1: 0.66177, throughput: 1328.94 | 2022-05-21 12:38:05.538 [rank:4] [train], epoch: 32/50, iter: 500/834, loss: 0.29339, top1: 0.66385, throughput: 1328.88 | 2022-05-21 12:38:05.538 [rank:3] [train], epoch: 32/50, iter: 500/834, loss: 0.29254, top1: 0.66562, throughput: 1328.60 | 2022-05-21 12:38:05.538 [rank:5] [train], epoch: 32/50, iter: 500/834, loss: 0.29183, top1: 0.66323, throughput: 1328.60 | 2022-05-21 12:38:05.540 [rank:0] [train], epoch: 32/50, iter: 500/834, loss: 0.29356, top1: 0.66224, throughput: 1328.73 | 2022-05-21 12:38:05.539 [rank:2] [train], epoch: 32/50, iter: 500/834, loss: 0.29410, top1: 0.66229, throughput: 1328.68 | 2022-05-21 12:38:05.538 [rank:1] [train], epoch: 32/50, iter: 500/834, loss: 0.29096, top1: 0.66911, throughput: 1328.66 | 2022-05-21 12:38:05.539 [rank:4] [train], epoch: 32/50, iter: 600/834, loss: 0.29446, top1: 0.65953, throughput: 1328.92 | 2022-05-21 12:38:19.986 [rank:6] [train], epoch: 32/50, iter: 600/834, loss: 0.29211, top1: 0.67047, throughput: 1328.89 | 2022-05-21 12:38:19.986 [rank:5] [train], epoch: 32/50, iter: 600/834, loss: 0.29341, top1: 0.66568, throughput: 1329.09 | 2022-05-21 12:38:19.986 [rank:7] [train], epoch: 32/50, iter: 600/834, loss: 0.29157, top1: 0.66807, throughput: 1328.77 | 2022-05-21 12:38:19.987 [rank:1] [train], epoch: 32/50, iter: 600/834, loss: 0.29402, top1: 0.66521, throughput: 1328.87 | 2022-05-21 12:38:19.988[rank:3] [train], epoch: 32/50, iter: 600/834, loss: 0.29334, top1: 0.66224, throughput: 1328.75 | 2022-05-21 12:38:19.988 [rank:0] [train], epoch: 32/50, iter: 600/834, loss: 0.29266, top1: 0.66458, throughput: 1328.87 | 2022-05-21 12:38:19.988 [rank:2] [train], epoch: 32/50, iter: 600/834, loss: 0.29469, top1: 0.66302, throughput: 1328.78 | 2022-05-21 12:38:19.987 [rank:5] [train], epoch: 32/50, iter: 700/834, loss: 0.29513, top1: 0.65318, throughput: 1321.40 | 2022-05-21 12:38:34.516 [rank:4] [train], epoch: 32/50, iter: 700/834, loss: 0.29336, top1: 0.66312, throughput: 1321.40 | 2022-05-21 12:38:34.516 [rank:6] [train], epoch: 32/50, iter: 700/834, loss: 0.29375, top1: 0.66302, throughput: 1321.35 | 2022-05-21 12:38:34.517 [rank:3] [train], epoch: 32/50, iter: 700/834, loss: 0.29266, top1: 0.66380, throughput: 1321.55 | 2022-05-21 12:38:34.516 [rank:1] [train], epoch: 32/50, iter: 700/834, loss: 0.29332, top1: 0.66208, throughput: 1321.54 | 2022-05-21 12:38:34.516 [rank:2] [train], epoch: 32/50, iter: 700/834, loss: 0.29353, top1: 0.66125, throughput: 1321.50 | 2022-05-21 12:38:34.516 [rank:7] [train], epoch: 32/50, iter: 700/834, loss: 0.29098, top1: 0.66375, throughput: 1321.13 | 2022-05-21 12:38:34.520 [rank:0] [train], epoch: 32/50, iter: 700/834, loss: 0.29583, top1: 0.65818, throughput: 1321.32 | 2022-05-21 12:38:34.519 [rank:6] [train], epoch: 32/50, iter: 800/834, loss: 0.29216, top1: 0.66479, throughput: 1329.42 | 2022-05-21 12:38:48.959 [rank:4] [train], epoch: 32/50, iter: 800/834, loss: 0.29330, top1: 0.66406, throughput: 1329.56 | 2022-05-21 12:38:48.957 [rank:5] [train], epoch: 32/50, iter: 800/834, loss: 0.29309, top1: 0.66130, throughput: 1329.54 | 2022-05-21 12:38:48.957 [rank:7] [train], epoch: 32/50, iter: 800/834, loss: 0.29105, top1: 0.66719, throughput: 1329.89 | 2022-05-21 12:38:48.957 [rank:1] [train], epoch: 32/50, iter: 800/834, loss: 0.29103, top1: 0.66943, throughput: 1329.60 | 2022-05-21 12:38:48.957 [rank:0] [train], epoch: 32/50, iter: 800/834, loss: 0.29376, top1: 0.66036, throughput: 1329.78 | 2022-05-21 12:38:48.957 [rank:3] [train], epoch: 32/50, iter: 800/834, loss: 0.29386, top1: 0.66047, throughput: 1329.35 | 2022-05-21 12:38:48.959 [rank:2] [train], epoch: 32/50, iter: 800/834, loss: 0.29092, top1: 0.66557, throughput: 1329.36 | 2022-05-21 12:38:48.959 [rank:3] [train], epoch: 32/50, iter: 834/834, loss: 0.29136, top1: 0.66789, throughput: 1307.22 | 2022-05-21 12:38:53.953 [rank:5] [train], epoch: 32/50, iter: 834/834, loss: 0.29226, top1: 0.66008, throughput: 1306.67 | 2022-05-21 12:38:53.953 [rank:4] [train], epoch: 32/50, iter: 834/834, loss: 0.29660, top1: 0.65196, throughput: 1306.52 | 2022-05-21 12:38:53.953 [rank:1] [train], epoch: 32/50, iter: 834/834, loss: 0.28886, top1: 0.66697, throughput: 1306.48 | 2022-05-21 12:38:53.953 [rank:7] [train], epoch: 32/50, iter: 834/834, loss: 0.29357, top1: 0.66176, throughput: 1306.13 | 2022-05-21 12:38:53.955 [rank:2] [train], epoch: 32/50, iter: 834/834, loss: 0.29240, top1: 0.66238, throughput: 1306.67 | 2022-05-21 12:38:53.955 [rank:6] [train], epoch: 32/50, iter: 834/834, loss: 0.29351, top1: 0.65794, throughput: 1306.37 | 2022-05-21 12:38:53.956 [rank:0] [train], epoch: 32/50, iter: 834/834, loss: 0.29478, top1: 0.66468, throughput: 1305.76 | 2022-05-21 12:38:53.957 [rank:7] [eval], epoch: 32/50, iter: 125/125, loss: 0.00000, top1: 0.66736, throughput: 575.85 | 2022-05-21 12:39:04.809 [rank:0] [eval], epoch: 32/50, iter: 125/125, loss: 0.00000, top1: 0.67088, throughput: 575.84 | 2022-05-21 12:39:04.810 [rank:2] [eval], epoch: 32/50, iter: 125/125, loss: 0.00000, top1: 0.65024, throughput: 574.98 | 2022-05-21 12:39:04.825 [rank:1] [eval], epoch: 32/50, iter: 125/125, loss: 0.00000, top1: 0.66576, throughput: 573.64 | 2022-05-21 12:39:04.849 [rank:4] [eval], epoch: 32/50, iter: 125/125, loss: 0.00000, top1: 0.66368, throughput: 572.45 | 2022-05-21 12:39:04.871 [rank:6] [eval], epoch: 32/50, iter: 125/125, loss: 0.00000, top1: 0.66320, throughput: 571.08 | 2022-05-21 12:39:04.900 [rank:3] [eval], epoch: 32/50, iter: 125/125, loss: 0.00000, top1: 0.65488, throughput: 568.40 | 2022-05-21 12:39:04.949 [rank:5] [eval], epoch: 32/50, iter: 125/125, loss: 0.00000, top1: 0.65856, throughput: 558.52 | 2022-05-21 12:39:05.143 [rank:7] [train], epoch: 33/50, iter: 100/834, loss: 0.28964, top1: 0.67125, throughput: 1302.52 | 2022-05-21 12:39:19.550 [rank:2] [train], epoch: 33/50, iter: 100/834, loss: 0.28488, top1: 0.68271, throughput: 1303.93 | 2022-05-21 12:39:19.550 [rank:3] [train], epoch: 33/50, iter: 100/834, loss: 0.28521, top1: 0.67776, throughput: 1314.98 | 2022-05-21 12:39:19.550 [rank:4] [train], epoch: 33/50, iter: 100/834, loss: 0.28537, top1: 0.67859, throughput: 1307.99 | 2022-05-21 12:39:19.550 [rank:5] [train], epoch: 33/50, iter: 100/834, loss: 0.28670, top1: 0.67510, throughput: 1332.69 | 2022-05-21 12:39:19.550 [rank:6] [train], epoch: 33/50, iter: 100/834, loss: 0.28570, top1: 0.67682, throughput: 1310.40 | 2022-05-21 12:39:19.552 [rank:0] [train], epoch: 33/50, iter: 100/834, loss: 0.28873, top1: 0.67099, throughput: 1302.40 | 2022-05-21 12:39:19.552 [rank:1] [train], epoch: 33/50, iter: 100/834, loss: 0.28356, top1: 0.68099, throughput: 1305.81 | 2022-05-21 12:39:19.552 [rank:0] [train], epoch: 33/50, iter: 200/834, loss: 0.28872, top1: 0.67203, throughput: 1328.21 | 2022-05-21 12:39:34.008 [rank:1] [train], epoch: 33/50, iter: 200/834, loss: 0.28639, top1: 0.67422, throughput: 1328.31 | 2022-05-21 12:39:34.007 [rank:6] [train], epoch: 33/50, iter: 200/834, loss: 0.28928, top1: 0.67203, throughput: 1328.18 | 2022-05-21 12:39:34.008 [rank:7] [train], epoch: 33/50, iter: 200/834, loss: 0.28502, top1: 0.67651, throughput: 1327.98 | 2022-05-21 12:39:34.008 [rank:3] [train], epoch: 33/50, iter: 200/834, loss: 0.28532, top1: 0.67469, throughput: 1327.75 | 2022-05-21 12:39:34.011 [rank:4] [train], epoch: 33/50, iter: 200/834, loss: 0.28773, top1: 0.67344, throughput: 1327.95 | 2022-05-21 12:39:34.009 [rank:2] [train], epoch: 33/50, iter: 200/834, loss: 0.28859, top1: 0.67099, throughput: 1327.91 | 2022-05-21 12:39:34.009 [rank:5] [train], epoch: 33/50, iter: 200/834, loss: 0.28967, top1: 0.67057, throughput: 1327.92 | 2022-05-21 12:39:34.009 [rank:2] [train], epoch: 33/50, iter: 300/834, loss: 0.29016, top1: 0.66927, throughput: 1327.18 | 2022-05-21 12:39:48.476 [rank:4] [train], epoch: 33/50, iter: 300/834, loss: 0.28825, top1: 0.67568, throughput: 1327.14 | 2022-05-21 12:39:48.476 [rank:3] [train], epoch: 33/50, iter: 300/834, loss: 0.28472, top1: 0.67729, throughput: 1327.33 | 2022-05-21 12:39:48.476 [rank:6] [train], epoch: 33/50, iter: 300/834, loss: 0.28735, top1: 0.67594, throughput: 1327.07 | 2022-05-21 12:39:48.476 [rank:0] [train], epoch: 33/50, iter: 300/834, loss: 0.28742, top1: 0.67609, throughput: 1327.09 | 2022-05-21 12:39:48.476 [rank:1] [train], epoch: 33/50, iter: 300/834, loss: 0.29019, top1: 0.67005, throughput: 1326.77 | 2022-05-21 12:39:48.478 [rank:5] [train], epoch: 33/50, iter: 300/834, loss: 0.28679, top1: 0.67328, throughput: 1326.72 | 2022-05-21 12:39:48.481 [rank:7] [train], epoch: 33/50, iter: 300/834, loss: 0.29175, top1: 0.66786, throughput: 1326.57 | 2022-05-21 12:39:48.481 [rank:5] [train], epoch: 33/50, iter: 400/834, loss: 0.28993, top1: 0.66740, throughput: 1328.39 | 2022-05-21 12:40:02.934 [rank:6] [train], epoch: 33/50, iter: 400/834, loss: 0.28992, top1: 0.66875, throughput: 1327.94 | 2022-05-21 12:40:02.934 [rank:0] [train], epoch: 33/50, iter: 400/834, loss: 0.28794, top1: 0.67078, throughput: 1327.88 | 2022-05-21 12:40:02.935 [rank:4] [train], epoch: 33/50, iter: 400/834, loss: 0.29054, top1: 0.66906, throughput: 1327.91 | 2022-05-21 12:40:02.935 [rank:3] [train], epoch: 33/50, iter: 400/834, loss: 0.29099, top1: 0.66786, throughput: 1327.85 | 2022-05-21 12:40:02.935 [rank:2] [train], epoch: 33/50, iter: 400/834, loss: 0.29007, top1: 0.66792, throughput: 1327.90 | 2022-05-21 12:40:02.934 [rank:7] [train], epoch: 33/50, iter: 400/834, loss: 0.29081, top1: 0.66609, throughput: 1328.24 | 2022-05-21 12:40:02.936 [rank:1] [train], epoch: 33/50, iter: 400/834, loss: 0.28579, top1: 0.68120, throughput: 1327.94 | 2022-05-21 12:40:02.936 [rank:0] [train], epoch: 33/50, iter: 500/834, loss: 0.28863, top1: 0.67307, throughput: 1329.12 | 2022-05-21 12:40:17.380 [rank:7] [train], epoch: 33/50, iter: 500/834, loss: 0.28941, top1: 0.66870, throughput: 1329.39 | 2022-05-21 12:40:17.379 [rank:4] [train], epoch: 33/50, iter: 500/834, loss: 0.29106, top1: 0.66750, throughput: 1329.13 | 2022-05-21 12:40:17.380 [rank:6] [train], epoch: 33/50, iter: 500/834, loss: 0.28697, top1: 0.67255, throughput: 1329.14 | 2022-05-21 12:40:17.380 [rank:5] [train], epoch: 33/50, iter: 500/834, loss: 0.28900, top1: 0.67010, throughput: 1329.14 | 2022-05-21 12:40:17.380 [rank:1] [train], epoch: 33/50, iter: 500/834, loss: 0.29019, top1: 0.67073, throughput: 1329.29 | 2022-05-21 12:40:17.380 [rank:2] [train], epoch: 33/50, iter: 500/834, loss: 0.29057, top1: 0.67047, throughput: 1329.12 | 2022-05-21 12:40:17.380 [rank:3] [train], epoch: 33/50, iter: 500/834, loss: 0.28795, top1: 0.67187, throughput: 1329.03 | 2022-05-21 12:40:17.382 [rank:0] [train], epoch: 33/50, iter: 600/834, loss: 0.28751, top1: 0.67542, throughput: 1327.73 | 2022-05-21 12:40:31.841 [rank:7] [train], epoch: 33/50, iter: 600/834, loss: 0.28694, top1: 0.67245, throughput: 1327.83 | 2022-05-21 12:40:31.839 [rank:5] [train], epoch: 33/50, iter: 600/834, loss: 0.28980, top1: 0.66812, throughput: 1327.84 | 2022-05-21 12:40:31.839 [rank:6] [train], epoch: 33/50, iter: 600/834, loss: 0.28908, top1: 0.67172, throughput: 1327.82 | 2022-05-21 12:40:31.840 [rank:4] [train], epoch: 33/50, iter: 600/834, loss: 0.28875, top1: 0.66906, throughput: 1327.74[rank:2] [train], epoch: 33/50, iter: 600/834, loss: 0.28710, top1: 0.67271, throughput: 1327.85 | 2022-05-21 12:40:31.840 | 2022-05-21 12:40:31.841 [rank:1] [train], epoch: 33/50, iter: 600/834, loss: 0.28669, top1: 0.67703, throughput: 1327.45 [rank:3] [train], epoch: 33/50, iter: 600/834, loss: 0.29077, top1: 0.66630, throughput: 1327.58| 2022-05-21 12:40:31.844 | 2022-05-21 12:40:31.844 [rank:4] [train], epoch: 33/50, iter: 700/834, loss: 0.28877, top1: 0.66990, throughput: 1330.04 | 2022-05-21 12:40:46.276 [rank:6] [train], epoch: 33/50, iter: 700/834, loss: 0.28883, top1: 0.66818, throughput: 1329.94 | 2022-05-21 12:40:46.276 [rank:5] [train], epoch: 33/50, iter: 700/834, loss: 0.28821, top1: 0.67401, throughput: 1329.86 | 2022-05-21 12:40:46.277 [rank:1] [train], epoch: 33/50, iter: 700/834, loss: 0.28981, top1: 0.66792, throughput: 1330.33 | 2022-05-21 12:40:46.276 [rank:0] [train], epoch: 33/50, iter: 700/834, loss: 0.28899, top1: 0.66849, throughput: 1329.94 | 2022-05-21 12:40:46.278 [rank:7] [train], epoch: 33/50, iter: 700/834, loss: 0.29053, top1: 0.66828, throughput: 1329.70 | 2022-05-21 12:40:46.278 [rank:3] [train], epoch: 33/50, iter: 700/834, loss: 0.29044, top1: 0.67047, throughput: 1330.18 | 2022-05-21 12:40:46.278 [rank:2] [train], epoch: 33/50, iter: 700/834, loss: 0.28876, top1: 0.67089, throughput: 1329.82 | 2022-05-21 12:40:46.278 [rank:6] [train], epoch: 33/50, iter: 800/834, loss: 0.28766, top1: 0.67323, throughput: 1330.92 | 2022-05-21 12:41:00.703 [rank:3] [train], epoch: 33/50, iter: 800/834, loss: 0.29085, top1: 0.66693, throughput: 1331.10 | 2022-05-21 12:41:00.702 [rank:7] [train], epoch: 33/50, iter: 800/834, loss: 0.29003, top1: 0.66562, throughput: 1331.05 | 2022-05-21 12:41:00.703 [rank:5] [train], epoch: 33/50, iter: 800/834, loss: 0.29012, top1: 0.66672, throughput: 1330.86 | 2022-05-21 12:41:00.704 [rank:4] [train], epoch: 33/50, iter: 800/834, loss: 0.29056, top1: 0.66354, throughput: 1330.76 | 2022-05-21 12:41:00.704 [rank:0] [train], epoch: 33/50, iter: 800/834, loss: 0.29076, top1: 0.66979, throughput: 1330.76 | 2022-05-21 12:41:00.706 [rank:1] [train], epoch: 33/50, iter: 800/834, loss: 0.29254, top1: 0.66687, throughput: 1330.59 | 2022-05-21 12:41:00.706 [rank:2] [train], epoch: 33/50, iter: 800/834, loss: 0.29184, top1: 0.66651, throughput: 1330.70 | 2022-05-21 12:41:00.706 [rank:7] [train], epoch: 33/50, iter: 834/834, loss: 0.28823, top1: 0.67509, throughput: 1320.53 | 2022-05-21 12:41:05.646 [rank:1] [train], epoch: 33/50, iter: 834/834, loss: 0.28950, top1: 0.66789, throughput: 1321.48 | 2022-05-21 12:41:05.646 [rank:5] [train], epoch: 33/50, iter: 834/834, loss: 0.29203, top1: 0.66054, throughput: 1320.78 | 2022-05-21 12:41:05.646 [rank:2] [train], epoch: 33/50, iter: 834/834, loss: 0.29138, top1: 0.66345, throughput: 1321.14 | 2022-05-21 12:41:05.647 [rank:3] [train], epoch: 33/50, iter: 834/834, loss: 0.28807, top1: 0.67448, throughput: 1319.82 | 2022-05-21 12:41:05.649 [rank:6] [train], epoch: 33/50, iter: 834/834, loss: 0.28895, top1: 0.67264, throughput: 1319.79 | 2022-05-21 12:41:05.649 [rank:0] [train], epoch: 33/50, iter: 834/834, loss: 0.29209, top1: 0.66483, throughput: 1320.53 | 2022-05-21 12:41:05.649 [rank:4] [train], epoch: 33/50, iter: 834/834, loss: 0.28940, top1: 0.67126, throughput: 1319.92 | 2022-05-21 12:41:05.650 [rank:7] [eval], epoch: 33/50, iter: 125/125, loss: 0.00000, top1: 0.67712, throughput: 574.10 | 2022-05-21 12:41:16.533 [rank:0] [eval], epoch: 33/50, iter: 125/125, loss: 0.00000, top1: 0.67376, throughput: 574.16 | 2022-05-21 12:41:16.535 [rank:2] [eval], epoch: 33/50, iter: 125/125, loss: 0.00000, top1: 0.66384, throughput: 573.84 | 2022-05-21 12:41:16.539 [rank:4] [eval], epoch: 33/50, iter: 125/125, loss: 0.00000, top1: 0.67504, throughput: 568.29 | 2022-05-21 12:41:16.648 [rank:6] [eval], epoch: 33/50, iter: 125/125, loss: 0.00000, top1: 0.67968, throughput: 566.50 | 2022-05-21 12:41:16.681 [rank:3] [eval], epoch: 33/50, iter: 125/125, loss: 0.00000, top1: 0.67232, throughput: 564.11 | 2022-05-21 12:41:16.728 [rank:5] [eval], epoch: 33/50, iter: 125/125, loss: 0.00000, top1: 0.66592, throughput: 561.04 | 2022-05-21 12:41:16.786 [rank:1] [eval], epoch: 33/50, iter: 125/125, loss: 0.00000, top1: 0.67264, throughput: 556.24 | 2022-05-21 12:41:16.882 [rank:7] [train], epoch: 34/50, iter: 100/834, loss: 0.28080, top1: 0.68458, throughput: 1302.92 | 2022-05-21 12:41:31.269 [rank:3] [train], epoch: 34/50, iter: 100/834, loss: 0.28168, top1: 0.68615, throughput: 1320.36 | 2022-05-21 12:41:31.270 [rank:5] [train], epoch: 34/50, iter: 100/834, loss: 0.28390, top1: 0.67927, throughput: 1325.67 | 2022-05-21 12:41:31.270 [rank:6] [train], epoch: 34/50, iter: 100/834, loss: 0.28667, top1: 0.67583, throughput: 1315.95 | 2022-05-21 12:41:31.272 [rank:2] [train], epoch: 34/50, iter: 100/834, loss: 0.28473, top1: 0.67646, throughput: 1303.35 | 2022-05-21 12:41:31.270 [rank:4] [train], epoch: 34/50, iter: 100/834, loss: 0.28586, top1: 0.67391, throughput: 1312.99 | 2022-05-21 12:41:31.271 [rank:1] [train], epoch: 34/50, iter: 100/834, loss: 0.28392, top1: 0.68078, throughput: 1334.13 | 2022-05-21 12:41:31.274 [rank:0] [train], epoch: 34/50, iter: 100/834, loss: 0.28091, top1: 0.68411, throughput: 1302.60 | 2022-05-21 12:41:31.274 [rank:7] [train], epoch: 34/50, iter: 200/834, loss: 0.28044, top1: 0.68802, throughput: 1329.46 | 2022-05-21 12:41:45.711 [rank:4] [train], epoch: 34/50, iter: 200/834, loss: 0.28387, top1: 0.68036, throughput: 1329.67 | 2022-05-21 12:41:45.711 [rank:6] [train], epoch: 34/50, iter: 200/834, loss: 0.28136, top1: 0.68740, throughput: 1329.64 | 2022-05-21 12:41:45.712 [rank:0] [train], epoch: 34/50, iter: 200/834, loss: 0.28289, top1: 0.68380, throughput: 1329.88 | 2022-05-21 12:41:45.712 [rank:5] [train], epoch: 34/50, iter: 200/834, loss: 0.28440, top1: 0.68089, throughput: 1329.44 | 2022-05-21 12:41:45.712 [rank:3] [train], epoch: 34/50, iter: 200/834, loss: 0.28280, top1: 0.68417, throughput: 1329.25 | 2022-05-21 12:41:45.714 [rank:1] [train], epoch: 34/50, iter: 200/834, loss: 0.28386, top1: 0.68115, throughput: 1329.69 | 2022-05-21 12:41:45.713 [rank:2] [train], epoch: 34/50, iter: 200/834, loss: 0.28451, top1: 0.68016, throughput: 1329.27 | 2022-05-21 12:41:45.714 [rank:7] [train], epoch: 34/50, iter: 300/834, loss: 0.28365, top1: 0.68120, throughput: 1329.26 | 2022-05-21 12:42:00.155 [rank:3] [train], epoch: 34/50, iter: 300/834, loss: 0.28437, top1: 0.68240, throughput: 1329.42 | 2022-05-21 12:42:00.156 [rank:4] [train], epoch: 34/50, iter: 300/834, loss: 0.28653, top1: 0.67573, throughput: 1329.32 | 2022-05-21 12:42:00.154 [rank:5] [train], epoch: 34/50, iter: 300/834, loss: 0.28466, top1: 0.68172, throughput: 1329.31 | 2022-05-21 12:42:00.155 [rank:6] [train], epoch: 34/50, iter: 300/834, loss: 0.28486, top1: 0.67740, throughput: 1329.22 | 2022-05-21 12:42:00.156 [rank:2] [train], epoch: 34/50, iter: 300/834, loss: 0.28429, top1: 0.68115, throughput: 1329.53 | 2022-05-21 12:42:00.155 [rank:0] [train], epoch: 34/50, iter: 300/834, loss: 0.28330, top1: 0.68375, throughput: 1329.11 | 2022-05-21 12:42:00.157 [rank:1] [train], epoch: 34/50, iter: 300/834, loss: 0.28552, top1: 0.67776, throughput: 1329.26 | 2022-05-21 12:42:00.157 [rank:7] [train], epoch: 34/50, iter: 400/834, loss: 0.28725, top1: 0.67333, throughput: 1329.20 | 2022-05-21 12:42:14.600 [rank:4] [train], epoch: 34/50, iter: 400/834, loss: 0.28605, top1: 0.67698, throughput: 1329.09 | 2022-05-21 12:42:14.600 [rank:3] [train], epoch: 34/50, iter: 400/834, loss: 0.28337, top1: 0.67776, throughput: 1329.21[rank:5] [train], epoch: 34/50, iter: 400/834, loss: 0.28577, top1: 0.67708, throughput: 1329.19 | 2022-05-21 12:42:14.600 | 2022-05-21 12:42:14.601 [rank:6] [train], epoch: 34/50, iter: 400/834, loss: 0.28378, top1: 0.68266, throughput: 1329.11 | 2022-05-21 12:42:14.602 [rank:1] [train], epoch: 34/50, iter: 400/834, loss: 0.28587, top1: 0.67495, throughput: 1329.22 | 2022-05-21 12:42:14.602 [rank:2] [train], epoch: 34/50, iter: 400/834, loss: 0.28863, top1: 0.67010, throughput: 1329.06 | 2022-05-21 12:42:14.601 [rank:0] [train], epoch: 34/50, iter: 400/834, loss: 0.28661, top1: 0.67703, throughput: 1329.13 | 2022-05-21 12:42:14.603 [rank:2] [train], epoch: 34/50, iter: 500/834, loss: 0.28572, top1: 0.67865, throughput: 1327.45 | 2022-05-21 12:42:29.065 [rank:7] [train], epoch: 34/50, iter: 500/834, loss: 0.28429, top1: 0.68135, throughput: 1327.28 | 2022-05-21 12:42:29.065 [rank:4] [train], epoch: 34/50, iter: 500/834, loss: 0.28415, top1: 0.68333, throughput: 1327.32 | 2022-05-21 12:42:29.066 [rank:0] [train], epoch: 34/50, iter: 500/834, loss: 0.28853, top1: 0.67542, throughput: 1327.52 | 2022-05-21 12:42:29.066 [rank:1] [train], epoch: 34/50, iter: 500/834, loss: 0.28505, top1: 0.67677, throughput: 1327.39 | 2022-05-21 12:42:29.066 [rank:3] [train], epoch: 34/50, iter: 500/834, loss: 0.28292, top1: 0.68599, throughput: 1327.19 | 2022-05-21 12:42:29.068 [rank:6] [train], epoch: 34/50, iter: 500/834, loss: 0.28478, top1: 0.67906, throughput: 1327.04 | 2022-05-21 12:42:29.070 [rank:5] [train], epoch: 34/50, iter: 500/834, loss: 0.28168, top1: 0.68510, throughput: 1326.92 | 2022-05-21 12:42:29.070 [rank:7] [train], epoch: 34/50, iter: 600/834, loss: 0.28566, top1: 0.67516, throughput: 1329.52 | 2022-05-21 12:42:43.507 [rank:6] [train], epoch: 34/50, iter: 600/834, loss: 0.28621, top1: 0.67901, throughput: 1329.89 | 2022-05-21 12:42:43.507 [rank:1] [train], epoch: 34/50, iter: 600/834, loss: 0.28433, top1: 0.67823, throughput: 1329.59 | 2022-05-21 12:42:43.507 [rank:0] [train], epoch: 34/50, iter: 600/834, loss: 0.28711, top1: 0.67406, throughput: 1329.52 | 2022-05-21 12:42:43.507 [rank:4] [train], epoch: 34/50, iter: 600/834, loss: 0.28437, top1: 0.67818, throughput: 1329.41 | 2022-05-21 12:42:43.508 [rank:5] [train], epoch: 34/50, iter: 600/834, loss: 0.28504, top1: 0.68016, throughput: 1329.69 | 2022-05-21 12:42:43.509 [rank:3] [train], epoch: 34/50, iter: 600/834, loss: 0.28413, top1: 0.67865, throughput: 1329.45 | 2022-05-21 12:42:43.510 [rank:2] [train], epoch: 34/50, iter: 600/834, loss: 0.28197, top1: 0.68411, throughput: 1329.26 | 2022-05-21 12:42:43.509 [rank:7] [train], epoch: 34/50, iter: 700/834, loss: 0.28363, top1: 0.67729, throughput: 1328.93 | 2022-05-21 12:42:57.954 [rank:4] [train], epoch: 34/50, iter: 700/834, loss: 0.28725, top1: 0.67057, throughput: 1328.99 | 2022-05-21 12:42:57.955 [rank:3] [train], epoch: 34/50, iter: 700/834, loss: 0.28347, top1: 0.68073, throughput: 1329.02[rank:0] [train], epoch: 34/50, iter: 700/834, loss: 0.28585, top1: 0.67375, throughput: 1328.83 | 2022-05-21 12:42:57.956| 2022-05-21 12:42:57.956 [rank:1] [train], epoch: 34/50, iter: 700/834, loss: 0.28604, top1: 0.67172, throughput: 1328.84 | 2022-05-21 12:42:57.956 [rank:6] [train], epoch: 34/50, iter: 700/834, loss: 0.28400, top1: 0.68151, throughput: 1328.64 | 2022-05-21 12:42:57.958 [rank:2] [train], epoch: 34/50, iter: 700/834, loss: 0.28662, top1: 0.67500, throughput: 1328.92 | 2022-05-21 12:42:57.957 [rank:5] [train], epoch: 34/50, iter: 700/834, loss: 0.28288, top1: 0.68167, throughput: 1328.80 | 2022-05-21 12:42:57.959 [rank:3] [train], epoch: 34/50, iter: 800/834, loss: 0.28430, top1: 0.67813, throughput: 1327.39 | 2022-05-21 12:43:12.421 [rank:7] [train], epoch: 34/50, iter: 800/834, loss: 0.28786, top1: 0.67036, throughput: 1327.30 | 2022-05-21 12:43:12.420 [rank:5] [train], epoch: 34/50, iter: 800/834, loss: 0.28588, top1: 0.67625, throughput: 1327.66 | 2022-05-21 12:43:12.420 [rank:6] [train], epoch: 34/50, iter: 800/834, loss: 0.28596, top1: 0.67630, throughput: 1327.68 | 2022-05-21 12:43:12.420 [rank:4] [train], epoch: 34/50, iter: 800/834, loss: 0.28637, top1: 0.67766, throughput: 1327.35 | 2022-05-21 12:43:12.420 [rank:2] [train], epoch: 34/50, iter: 800/834, loss: 0.28414, top1: 0.68125, throughput: 1327.55 | 2022-05-21 12:43:12.420 [rank:0] [train], epoch: 34/50, iter: 800/834, loss: 0.28345, top1: 0.68323, throughput: 1327.31 | 2022-05-21 12:43:12.422[rank:1] [train], epoch: 34/50, iter: 800/834, loss: 0.28386, top1: 0.68156, throughput: 1327.28 | 2022-05-21 12:43:12.421 [rank:4] [train], epoch: 34/50, iter: 834/834, loss: 0.28501, top1: 0.67586, throughput: 1323.56 | 2022-05-21 12:43:17.352 [rank:5] [train], epoch: 34/50, iter: 834/834, loss: 0.28273, top1: 0.67678, throughput: 1323.48 | 2022-05-21 12:43:17.352[rank:7] [train], epoch: 34/50, iter: 834/834, loss: 0.28642, top1: 0.67325, throughput: 1323.40 | 2022-05-21 12:43:17.353 [rank:1] [train], epoch: 34/50, iter: 834/834, loss: 0.28901, top1: 0.67325, throughput: 1323.62 | 2022-05-21 12:43:17.353 [rank:2] [train], epoch: 34/50, iter: 834/834, loss: 0.28468, top1: 0.67586, throughput: 1322.93 | 2022-05-21 12:43:17.354 [rank:3] [train], epoch: 34/50, iter: 834/834, loss: 0.28556, top1: 0.68275, throughput: 1323.01 | 2022-05-21 12:43:17.355 [rank:0] [train], epoch: 34/50, iter: 834/834, loss: 0.28733, top1: 0.67019, throughput: 1323.11 | 2022-05-21 12:43:17.355 [rank:6] [train], epoch: 34/50, iter: 834/834, loss: 0.28189, top1: 0.67923, throughput: 1322.61 | 2022-05-21 12:43:17.355 [rank:0] [eval], epoch: 34/50, iter: 125/125, loss: 0.00000, top1: 0.68880, throughput: 568.43 | 2022-05-21 12:43:28.351 [rank:7] [eval], epoch: 34/50, iter: 125/125, loss: 0.00000, top1: 0.68224, throughput: 564.90 | 2022-05-21 12:43:28.416 [rank:6] [eval], epoch: 34/50, iter: 125/125, loss: 0.00000, top1: 0.68736, throughput: 562.42 | 2022-05-21 12:43:28.468 [rank:1] [eval], epoch: 34/50, iter: 125/125, loss: 0.00000, top1: 0.68864, throughput: 559.02 | 2022-05-21 12:43:28.534 [rank:2] [eval], epoch: 34/50, iter: 125/125, loss: 0.00000, top1: 0.67744, throughput: 556.87 | 2022-05-21 12:43:28.578 [rank:3] [eval], epoch: 34/50, iter: 125/125, loss: 0.00000, top1: 0.67808, throughput: 555.90 | 2022-05-21 12:43:28.598 [rank:4] [eval], epoch: 34/50, iter: 125/125, loss: 0.00000, top1: 0.67760, throughput: 552.89 | 2022-05-21 12:43:28.657 [rank:5] [eval], epoch: 34/50, iter: 125/125, loss: 0.00000, top1: 0.67200, throughput: 551.23 | 2022-05-21 12:43:28.691 [rank:3] [train], epoch: 35/50, iter: 100/834, loss: 0.28119, top1: 0.68974, throughput: 1324.20 | 2022-05-21 12:43:43.097 [rank:4] [train], epoch: 35/50, iter: 100/834, loss: 0.27753, top1: 0.69766, throughput: 1329.47 | 2022-05-21 12:43:43.098 [rank:5] [train], epoch: 35/50, iter: 100/834, loss: 0.28248, top1: 0.68406, throughput: 1332.64 | 2022-05-21 12:43:43.098 [rank:7] [train], epoch: 35/50, iter: 100/834, loss: 0.27947, top1: 0.68839, throughput: 1307.75 | 2022-05-21 12:43:43.098 [rank:2] [train], epoch: 35/50, iter: 100/834, loss: 0.28107, top1: 0.68745, throughput: 1322.29 | 2022-05-21 12:43:43.098 [rank:1] [train], epoch: 35/50, iter: 100/834, loss: 0.27912, top1: 0.68755, throughput: 1318.28 | 2022-05-21 12:43:43.098 [rank:6] [train], epoch: 35/50, iter: 100/834, loss: 0.27619, top1: 0.69828, throughput: 1312.32 | 2022-05-21 12:43:43.099 [rank:0] [train], epoch: 35/50, iter: 100/834, loss: 0.27778, top1: 0.69464, throughput: 1301.86 | 2022-05-21 12:43:43.099 [rank:2] [train], epoch: 35/50, iter: 200/834, loss: 0.27987, top1: 0.68823, throughput: 1331.12 | 2022-05-21 12:43:57.522 [rank:3] [train], epoch: 35/50, iter: 200/834, loss: 0.28018, top1: 0.68969, throughput: 1331.03 | 2022-05-21 12:43:57.522 [rank:7] [train], epoch: 35/50, iter: 200/834, loss: 0.27857, top1: 0.68740, throughput: 1331.09 | 2022-05-21 12:43:57.522 [rank:5] [train], epoch: 35/50, iter: 200/834, loss: 0.28121, top1: 0.68734, throughput: 1331.00 | 2022-05-21 12:43:57.524 [rank:4] [train], epoch: 35/50, iter: 200/834, loss: 0.27870, top1: 0.69370, throughput: 1330.92 | 2022-05-21 12:43:57.524 [rank:6] [train], epoch: 35/50, iter: 200/834, loss: 0.27737, top1: 0.69135, throughput: 1330.99 | 2022-05-21 12:43:57.524 [rank:1] [train], epoch: 35/50, iter: 200/834, loss: 0.28057, top1: 0.68495, throughput: 1330.87 | 2022-05-21 12:43:57.525 [rank:0] [train], epoch: 35/50, iter: 200/834, loss: 0.28095, top1: 0.69016, throughput: 1330.93 | 2022-05-21 12:43:57.525 [rank:3] [train], epoch: 35/50, iter: 300/834, loss: 0.28279, top1: 0.68328, throughput: 1328.51 | 2022-05-21 12:44:11.975 [rank:4] [train], epoch: 35/50, iter: 300/834, loss: 0.27861, top1: 0.69036, throughput: 1328.62 | 2022-05-21 12:44:11.976 [rank:6] [train], epoch: 35/50, iter: 300/834, loss: 0.27815, top1: 0.69438, throughput: 1328.56 | 2022-05-21 12:44:11.976 [rank:7] [train], epoch: 35/50, iter: 300/834, loss: 0.28386, top1: 0.67854, throughput: 1328.39 | 2022-05-21 12:44:11.976 [rank:1] [train], epoch: 35/50, iter: 300/834, loss: 0.28108, top1: 0.68599, throughput: 1328.60 | 2022-05-21 12:44:11.976 [rank:5] [train], epoch: 35/50, iter: 300/834, loss: 0.27993, top1: 0.69047, throughput: 1328.54 | 2022-05-21 12:44:11.976 [rank:2] [train], epoch: 35/50, iter: 300/834, loss: 0.27849, top1: 0.69229, throughput: 1328.42 | 2022-05-21 12:44:11.975 [rank:0] [train], epoch: 35/50, iter: 300/834, loss: 0.28119, top1: 0.68578, throughput: 1328.47 | 2022-05-21 12:44:11.977 [rank:4] [train], epoch: 35/50, iter: 400/834, loss: 0.28136, top1: 0.68292, throughput: 1321.57 | 2022-05-21 12:44:26.504 [rank:5] [train], epoch: 35/50, iter: 400/834, loss: 0.28247, top1: 0.68510, throughput: 1321.58 | 2022-05-21 12:44:26.504 [rank:6] [train], epoch: 35/50, iter: 400/834, loss: 0.27939, top1: 0.69135, throughput: 1321.55 | 2022-05-21 12:44:26.504 [rank:1] [train], epoch: 35/50, iter: 400/834, loss: 0.28178, top1: 0.68375, throughput: 1321.57 | 2022-05-21 12:44:26.504 [rank:2] [train], epoch: 35/50, iter: 400/834, loss: 0.28097, top1: 0.68635, throughput: 1321.54[rank:7] [train], epoch: 35/50, iter: 400/834, loss: 0.28042, top1: 0.68964, throughput: 1321.61 | 2022-05-21 12:44:26.504| 2022-05-21 12:44:26.504 [rank:0] [train], epoch: 35/50, iter: 400/834, loss: 0.28364, top1: 0.68089, throughput: 1321.57 | 2022-05-21 12:44:26.506 [rank:3] [train], epoch: 35/50, iter: 400/834, loss: 0.28149, top1: 0.68318, throughput: 1321.29 | 2022-05-21 12:44:26.506 [rank:4] [train], epoch: 35/50, iter: 500/834, loss: 0.28008, top1: 0.68865, throughput: 1329.72 | 2022-05-21 12:44:40.943 [rank:3] [train], epoch: 35/50, iter: 500/834, loss: 0.27902, top1: 0.69214, throughput: 1329.91 | 2022-05-21 12:44:40.943 [rank:2] [train], epoch: 35/50, iter: 500/834, loss: 0.28146, top1: 0.68479, throughput: 1329.81 | 2022-05-21 12:44:40.942 [rank:1] [train], epoch: 35/50, iter: 500/834, loss: 0.28268, top1: 0.68271, throughput: 1329.80 | 2022-05-21 12:44:40.942 [rank:5] [train], epoch: 35/50, iter: 500/834, loss: 0.28310, top1: 0.68422, throughput: 1329.71 | 2022-05-21 12:44:40.943 [rank:7] [train], epoch: 35/50, iter: 500/834, loss: 0.28069, top1: 0.68740, throughput: 1329.73 | 2022-05-21 12:44:40.943 [rank:6] [train], epoch: 35/50, iter: 500/834, loss: 0.28124, top1: 0.68516, throughput: 1329.57 | 2022-05-21 12:44:40.945 [rank:0] [train], epoch: 35/50, iter: 500/834, loss: 0.28089, top1: 0.68516, throughput: 1329.76 | 2022-05-21 12:44:40.944 [rank:5] [train], epoch: 35/50, iter: 600/834, loss: 0.28090, top1: 0.68688, throughput: 1329.05 | 2022-05-21 12:44:55.389 [rank:1] [train], epoch: 35/50, iter: 600/834, loss: 0.28039, top1: 0.68609, throughput: 1329.01 | 2022-05-21 12:44:55.389 [rank:7] [train], epoch: 35/50, iter: 600/834, loss: 0.27855, top1: 0.68682, throughput: 1329.06 | 2022-05-21 12:44:55.389[rank:2] [train], epoch: 35/50, iter: 600/834, loss: 0.28360, top1: 0.67802, throughput: 1328.96 | 2022-05-21 12:44:55.389 [rank:3] [train], epoch: 35/50, iter: 600/834, loss: 0.28075, top1: 0.68714, throughput: 1328.87 | 2022-05-21 12:44:55.391 [rank:0] [train], epoch: 35/50, iter: 600/834, loss: 0.28157, top1: 0.68469, throughput: 1328.96 | 2022-05-21 12:44:55.392 [rank:6] [train], epoch: 35/50, iter: 600/834, loss: 0.28121, top1: 0.68583, throughput: 1328.89 | 2022-05-21 12:44:55.393 [rank:4] [train], epoch: 35/50, iter: 600/834, loss: 0.28130, top1: 0.68370, throughput: 1328.69 | 2022-05-21 12:44:55.393 [rank:5] [train], epoch: 35/50, iter: 700/834, loss: 0.27895, top1: 0.68937, throughput: 1327.28 | 2022-05-21 12:45:09.855 [rank:7] [train], epoch: 35/50, iter: 700/834, loss: 0.28074, top1: 0.68865, throughput: 1327.27 | 2022-05-21 12:45:09.855 [rank:1] [train], epoch: 35/50, iter: 700/834, loss: 0.28098, top1: 0.69005, throughput: 1327.23 | 2022-05-21 12:45:09.855 [rank:4] [train], epoch: 35/50, iter: 700/834, loss: 0.28092, top1: 0.68552, throughput: 1327.64 | 2022-05-21 12:45:09.855 [rank:6] [train], epoch: 35/50, iter: 700/834, loss: 0.27876, top1: 0.69010, throughput: 1327.49 | 2022-05-21 12:45:09.856 [rank:0] [train], epoch: 35/50, iter: 700/834, loss: 0.28072, top1: 0.68682, throughput: 1327.45 | 2022-05-21 12:45:09.856 [rank:2] [train], epoch: 35/50, iter: 700/834, loss: 0.28273, top1: 0.68302, throughput: 1327.11 | 2022-05-21 12:45:09.857 [rank:3] [train], epoch: 35/50, iter: 700/834, loss: 0.28306, top1: 0.68172, throughput: 1327.30 | 2022-05-21 12:45:09.857 [rank:2] [train], epoch: 35/50, iter: 800/834, loss: 0.28072, top1: 0.68563, throughput: 1327.69 | 2022-05-21 12:45:24.318 [rank:6] [train], epoch: 35/50, iter: 800/834, loss: 0.28117, top1: 0.68641, throughput: 1327.63 | 2022-05-21 12:45:24.318 [rank:1] [train], epoch: 35/50, iter: 800/834, loss: 0.28125, top1: 0.68589, throughput: 1327.45 | 2022-05-21 12:45:24.319 [rank:4] [train], epoch: 35/50, iter: 800/834, loss: 0.28195, top1: 0.68708, throughput: 1327.32 | 2022-05-21 12:45:24.320 [rank:0] [train], epoch: 35/50, iter: 800/834, loss: 0.28137, top1: 0.68573, throughput: 1327.42 | 2022-05-21 12:45:24.320 [rank:3] [train], epoch: 35/50, iter: 800/834, loss: 0.28299, top1: 0.68745, throughput: 1327.44 | 2022-05-21 12:45:24.321 [rank:7] [train], epoch: 35/50, iter: 800/834, loss: 0.28114, top1: 0.68693, throughput: 1327.10 | 2022-05-21 12:45:24.323 [rank:5] [train], epoch: 35/50, iter: 800/834, loss: 0.28358, top1: 0.68870, throughput: 1327.07 | 2022-05-21 12:45:24.323 [rank:7] [train], epoch: 35/50, iter: 834/834, loss: 0.28410, top1: 0.68091, throughput: 1326.92 | 2022-05-21 12:45:29.242 [rank:4] [train], epoch: 35/50, iter: 834/834, loss: 0.28126, top1: 0.68244, throughput: 1326.30 | 2022-05-21 12:45:29.242 [rank:6] [train], epoch: 35/50, iter: 834/834, loss: 0.28463, top1: 0.67877, throughput: 1325.68[rank:3] [train], epoch: 35/50, iter: 834/834, loss: 0.27826, top1: 0.69455, throughput: 1326.27 | 2022-05-21 12:45:29.243 | 2022-05-21 12:45:29.242 [rank:1] [train], epoch: 35/50, iter: 834/834, loss: 0.27757, top1: 0.69041, throughput: 1325.95 | 2022-05-21 12:45:29.243 [rank:5] [train], epoch: 35/50, iter: 834/834, loss: 0.28035, top1: 0.68290, throughput: 1326.71 | 2022-05-21 12:45:29.243 [rank:0] [train], epoch: 35/50, iter: 834/834, loss: 0.28443, top1: 0.67938, throughput: 1325.83 | 2022-05-21 12:45:29.243 [rank:2] [train], epoch: 35/50, iter: 834/834, loss: 0.27963, top1: 0.68597, throughput: 1325.00 | 2022-05-21 12:45:29.245 [rank:0] [eval], epoch: 35/50, iter: 125/125, loss: 0.00000, top1: 0.69648, throughput: 583.73 | 2022-05-21 12:45:39.950 [rank:7] [eval], epoch: 35/50, iter: 125/125, loss: 0.00000, top1: 0.68064, throughput: 579.58 | 2022-05-21 12:45:40.026 [rank:2] [eval], epoch: 35/50, iter: 125/125, loss: 0.00000, top1: 0.68080, throughput: 576.53 | 2022-05-21 12:45:40.085 [rank:6] [eval], epoch: 35/50, iter: 125/125, loss: 0.00000, top1: 0.68064, throughput: 572.75 | 2022-05-21 12:45:40.155 [rank:3] [eval], epoch: 35/50, iter: 125/125, loss: 0.00000, top1: 0.68016, throughput: 569.25 | 2022-05-21 12:45:40.222 [rank:1] [eval], epoch: 35/50, iter: 125/125, loss: 0.00000, top1: 0.68400, throughput: 568.92 | 2022-05-21 12:45:40.228 [rank:5] [eval], epoch: 35/50, iter: 125/125, loss: 0.00000, top1: 0.67584, throughput: 564.44 | 2022-05-21 12:45:40.316 [rank:4] [eval], epoch: 35/50, iter: 125/125, loss: 0.00000, top1: 0.67968, throughput: 563.36 | 2022-05-21 12:45:40.336 [rank:4] [train], epoch: 36/50, iter: 100/834, loss: 0.27441, top1: 0.69849, throughput: 1335.84 | 2022-05-21 12:45:54.709 [rank:5] [train], epoch: 36/50, iter: 100/834, loss: 0.27382, top1: 0.70490, throughput: 1333.86 | 2022-05-21 12:45:54.710 [rank:7] [train], epoch: 36/50, iter: 100/834, loss: 0.27329, top1: 0.70245, throughput: 1307.60 | 2022-05-21 12:45:54.709 [rank:0] [train], epoch: 36/50, iter: 100/834, loss: 0.27413, top1: 0.70109, throughput: 1300.89 | 2022-05-21 12:45:54.710 [rank:1] [train], epoch: 36/50, iter: 100/834, loss: 0.27506, top1: 0.70021, throughput: 1325.77 | 2022-05-21 12:45:54.710 [rank:6] [train], epoch: 36/50, iter: 100/834, loss: 0.27651, top1: 0.69552, throughput: 1319.00 | 2022-05-21 12:45:54.711 [rank:3] [train], epoch: 36/50, iter: 100/834, loss: 0.27751, top1: 0.69453, throughput: 1324.87 | 2022-05-21 12:45:54.714 [rank:2] [train], epoch: 36/50, iter: 100/834, loss: 0.27513, top1: 0.69802, throughput: 1312.51 | 2022-05-21 12:45:54.714 [rank:6] [train], epoch: 36/50, iter: 200/834, loss: 0.27460, top1: 0.70135, throughput: 1327.75 | 2022-05-21 12:46:09.172 [rank:3] [train], epoch: 36/50, iter: 200/834, loss: 0.27396, top1: 0.70005, throughput: 1327.95 | 2022-05-21 12:46:09.173 [rank:2] [train], epoch: 36/50, iter: 200/834, loss: 0.27354, top1: 0.70281, throughput: 1327.84 | 2022-05-21 12:46:09.173 [rank:5] [train], epoch: 36/50, iter: 200/834, loss: 0.27587, top1: 0.69891, throughput: 1327.58 | 2022-05-21 12:46:09.173 [rank:7] [train], epoch: 36/50, iter: 200/834, loss: 0.27394, top1: 0.69870, throughput: 1327.35 | 2022-05-21 12:46:09.174 [rank:4] [train], epoch: 36/50, iter: 200/834, loss: 0.27499, top1: 0.69771, throughput: 1327.32 | 2022-05-21 12:46:09.175 [rank:1] [train], epoch: 36/50, iter: 200/834, loss: 0.27658, top1: 0.69568, throughput: 1327.46 | 2022-05-21 12:46:09.174 [rank:0] [train], epoch: 36/50, iter: 200/834, loss: 0.27601, top1: 0.69672, throughput: 1327.29 | 2022-05-21 12:46:09.175 [rank:4] [train], epoch: 36/50, iter: 300/834, loss: 0.27652, top1: 0.69318, throughput: 1321.74 | 2022-05-21 12:46:23.701 [rank:6] [train], epoch: 36/50, iter: 300/834, loss: 0.27792, top1: 0.69089, throughput: 1321.52 | 2022-05-21 12:46:23.701 [rank:5] [train], epoch: 36/50, iter: 300/834, loss: 0.27665, top1: 0.69505, throughput: 1321.59 | 2022-05-21 12:46:23.701 [rank:7] [train], epoch: 36/50, iter: 300/834, loss: 0.27963, top1: 0.68651, throughput: 1321.71 | 2022-05-21 12:46:23.701 [rank:0] [train], epoch: 36/50, iter: 300/834, loss: 0.27772, top1: 0.69495, throughput: 1321.68 | 2022-05-21 12:46:23.702 [rank:2] [train], epoch: 36/50, iter: 300/834, loss: 0.27512, top1: 0.69714, throughput: 1321.51 | 2022-05-21 12:46:23.702 [rank:3] [train], epoch: 36/50, iter: 300/834, loss: 0.27719, top1: 0.69276, throughput: 1321.32 | 2022-05-21 12:46:23.703 [rank:1] [train], epoch: 36/50, iter: 300/834, loss: 0.27543, top1: 0.69667, throughput: 1321.43 | 2022-05-21 12:46:23.704 [rank:4] [train], epoch: 36/50, iter: 400/834, loss: 0.27893, top1: 0.68906, throughput: 1328.34 | 2022-05-21 12:46:38.155 [rank:5] [train], epoch: 36/50, iter: 400/834, loss: 0.27615, top1: 0.69474, throughput: 1328.33 | 2022-05-21 12:46:38.155 [rank:6] [train], epoch: 36/50, iter: 400/834, loss: 0.27716, top1: 0.69266, throughput: 1328.15 | 2022-05-21 12:46:38.157 [rank:2] [train], epoch: 36/50, iter: 400/834, loss: 0.27768, top1: 0.69620, throughput: 1328.39 | 2022-05-21 12:46:38.156 [rank:0] [train], epoch: 36/50, iter: 400/834, loss: 0.27854, top1: 0.69380, throughput: 1328.36 | 2022-05-21 12:46:38.156 [rank:7] [train], epoch: 36/50, iter: 400/834, loss: 0.27503, top1: 0.69589, throughput: 1328.18 | 2022-05-21 12:46:38.157 [rank:3] [train], epoch: 36/50, iter: 400/834, loss: 0.27644, top1: 0.69896, throughput: 1328.42 | 2022-05-21 12:46:38.157 [rank:1] [train], epoch: 36/50, iter: 400/834, loss: 0.27590, top1: 0.69474, throughput: 1328.45 | 2022-05-21 12:46:38.157 [rank:5] [train], epoch: 36/50, iter: 500/834, loss: 0.27644, top1: 0.69547, throughput: 1328.97 | 2022-05-21 12:46:52.602 [rank:3] [train], epoch: 36/50, iter: 500/834, loss: 0.28046, top1: 0.68839, throughput: 1329.09 | 2022-05-21 12:46:52.603 [rank:6] [train], epoch: 36/50, iter: 500/834, loss: 0.27665, top1: 0.69318, throughput: 1329.05 | 2022-05-21 12:46:52.603 [rank:7] [train], epoch: 36/50, iter: 500/834, loss: 0.27847, top1: 0.68724, throughput: 1328.95 | 2022-05-21 12:46:52.604 [rank:1] [train], epoch: 36/50, iter: 500/834, loss: 0.27618, top1: 0.69464, throughput: 1328.98 | 2022-05-21 12:46:52.604 [rank:0] [train], epoch: 36/50, iter: 500/834, loss: 0.27592, top1: 0.69594, throughput: 1328.91 | 2022-05-21 12:46:52.604 [rank:2] [train], epoch: 36/50, iter: 500/834, loss: 0.27744, top1: 0.69385, throughput: 1328.90 | 2022-05-21 12:46:52.604 [rank:4] [train], epoch: 36/50, iter: 500/834, loss: 0.27587, top1: 0.69531, throughput: 1328.77 | 2022-05-21 12:46:52.604 [rank:7] [train], epoch: 36/50, iter: 600/834, loss: 0.27876, top1: 0.69286, throughput: 1327.05 | 2022-05-21 12:47:07.072 [rank:5] [train], epoch: 36/50, iter: 600/834, loss: 0.27780, top1: 0.69635, throughput: 1326.89 | 2022-05-21 12:47:07.072 [rank:3] [train], epoch: 36/50, iter: 600/834, loss: 0.27604, top1: 0.69875, throughput: 1326.88 | 2022-05-21 12:47:07.073 [rank:4] [train], epoch: 36/50, iter: 600/834, loss: 0.27678, top1: 0.69734, throughput: 1326.88 | 2022-05-21 12:47:07.074 [rank:0] [train], epoch: 36/50, iter: 600/834, loss: 0.27651, top1: 0.69219, throughput: 1326.96 | 2022-05-21 12:47:07.073 [rank:6] [train], epoch: 36/50, iter: 600/834, loss: 0.27821, top1: 0.69010, throughput: 1326.74 | 2022-05-21 12:47:07.075 [rank:1] [train], epoch: 36/50, iter: 600/834, loss: 0.27744, top1: 0.69000, throughput: 1326.51 | 2022-05-21 12:47:07.078 [rank:2] [train], epoch: 36/50, iter: 600/834, loss: 0.27881, top1: 0.69042, throughput: 1326.48 | 2022-05-21 12:47:07.078 [rank:6] [train], epoch: 36/50, iter: 700/834, loss: 0.27694, top1: 0.69510, throughput: 1329.11 | 2022-05-21 12:47:21.521 [rank:5] [train], epoch: 36/50, iter: 700/834, loss: 0.27634, top1: 0.69422, throughput: 1328.71 | 2022-05-21 12:47:21.522 [rank:4] [train], epoch: 36/50, iter: 700/834, loss: 0.27948, top1: 0.68771, throughput: 1329.03 | 2022-05-21 12:47:21.521 [rank:7] [train], epoch: 36/50, iter: 700/834, loss: 0.27819, top1: 0.69104, throughput: 1328.82 | 2022-05-21 12:47:21.521 [rank:2] [train], epoch: 36/50, iter: 700/834, loss: 0.27721, top1: 0.69260, throughput: 1329.31 | 2022-05-21 12:47:21.522 [rank:1] [train], epoch: 36/50, iter: 700/834, loss: 0.27708, top1: 0.69328, throughput: 1329.36 | 2022-05-21 12:47:21.521 [rank:0] [train], epoch: 36/50, iter: 700/834, loss: 0.28004, top1: 0.68818, throughput: 1328.73 | 2022-05-21 12:47:21.523 [rank:3] [train], epoch: 36/50, iter: 700/834, loss: 0.27855, top1: 0.69104, throughput: 1328.68 | 2022-05-21 12:47:21.523 [rank:4] [train], epoch: 36/50, iter: 800/834, loss: 0.27484, top1: 0.69938, throughput: 1326.40 | 2022-05-21 12:47:35.996 [rank:7] [train], epoch: 36/50, iter: 800/834, loss: 0.27661, top1: 0.69469, throughput: 1326.41 | 2022-05-21 12:47:35.996 [rank:5] [train], epoch: 36/50, iter: 800/834, loss: 0.27628, top1: 0.69839, throughput: 1326.49 | 2022-05-21 12:47:35.997 [rank:6] [train], epoch: 36/50, iter: 800/834, loss: 0.27676, top1: 0.69729, throughput: 1326.19 | 2022-05-21 12:47:35.998 [rank:1] [train], epoch: 36/50, iter: 800/834, loss: 0.27768, top1: 0.69495, throughput: 1326.19 | 2022-05-21 12:47:35.999 [rank:0] [train], epoch: 36/50, iter: 800/834, loss: 0.27728, top1: 0.69521, throughput: 1326.32 | 2022-05-21 12:47:35.999 [rank:2] [train], epoch: 36/50, iter: 800/834, loss: 0.27874, top1: 0.68911, throughput: 1326.29 | 2022-05-21 12:47:35.998 [rank:3] [train], epoch: 36/50, iter: 800/834, loss: 0.27983, top1: 0.68911, throughput: 1326.31 | 2022-05-21 12:47:36.000 [rank:7] [train], epoch: 36/50, iter: 834/834, loss: 0.27625, top1: 0.69516, throughput: 1323.87 | 2022-05-21 12:47:40.927 [rank:4] [train], epoch: 36/50, iter: 834/834, loss: 0.27780, top1: 0.68658, throughput: 1323.84 | 2022-05-21 12:47:40.928 [rank:2] [train], epoch: 36/50, iter: 834/834, loss: 0.27707, top1: 0.69072, throughput: 1324.36 | 2022-05-21 12:47:40.927 [rank:6] [train], epoch: 36/50, iter: 834/834, loss: 0.27916, top1: 0.69133, throughput: 1324.21 | 2022-05-21 12:47:40.928 [rank:5] [train], epoch: 36/50, iter: 834/834, loss: 0.27773, top1: 0.68765, throughput: 1323.79 | 2022-05-21 12:47:40.928 [rank:3] [train], epoch: 36/50, iter: 834/834, loss: 0.27432, top1: 0.69746, throughput: 1324.06 | 2022-05-21 12:47:40.930 [rank:0] [train], epoch: 36/50, iter: 834/834, loss: 0.27535, top1: 0.69562, throughput: 1323.98 | 2022-05-21 12:47:40.930 [rank:1] [train], epoch: 36/50, iter: 834/834, loss: 0.27534, top1: 0.69914, throughput: 1323.90 | 2022-05-21 12:47:40.930 [rank:0] [eval], epoch: 36/50, iter: 125/125, loss: 0.00000, top1: 0.71120, throughput: 571.62 | 2022-05-21 12:47:51.864 [rank:7] [eval], epoch: 36/50, iter: 125/125, loss: 0.00000, top1: 0.70656, throughput: 571.26 | 2022-05-21 12:47:51.868 [rank:2] [eval], epoch: 36/50, iter: 125/125, loss: 0.00000, top1: 0.70016, throughput: 571.18 | 2022-05-21 12:47:51.870 [rank:4] [eval], epoch: 36/50, iter: 125/125, loss: 0.00000, top1: 0.70560, throughput: 567.84 | 2022-05-21 12:47:51.934 [rank:3] [eval], epoch: 36/50, iter: 125/125, loss: 0.00000, top1: 0.70496, throughput: 567.83 | 2022-05-21 12:47:51.937 [rank:6] [eval], epoch: 36/50, iter: 125/125, loss: 0.00000, top1: 0.70064, throughput: 561.78 | 2022-05-21 12:47:52.053 [rank:1] [eval], epoch: 36/50, iter: 125/125, loss: 0.00000, top1: 0.70480, throughput: 557.27 | 2022-05-21 12:47:52.145 [rank:5] [eval], epoch: 36/50, iter: 125/125, loss: 0.00000, top1: 0.69696, throughput: 553.43 | 2022-05-21 12:47:52.221 [rank:3] [train], epoch: 37/50, iter: 100/834, loss: 0.26912, top1: 0.71510, throughput: 1307.66 | 2022-05-21 12:48:06.619 [rank:0] [train], epoch: 37/50, iter: 100/834, loss: 0.27267, top1: 0.70562, throughput: 1301.15 | 2022-05-21 12:48:06.620 [rank:6] [train], epoch: 37/50, iter: 100/834, loss: 0.27233, top1: 0.70260, throughput: 1317.99 | 2022-05-21 12:48:06.621 [rank:5] [train], epoch: 37/50, iter: 100/834, loss: 0.26863, top1: 0.71078, throughput: 1333.36 | 2022-05-21 12:48:06.621 [rank:7] [train], epoch: 37/50, iter: 100/834, loss: 0.27467, top1: 0.70052, throughput: 1301.48 | 2022-05-21 12:48:06.621 [rank:2] [train], epoch: 37/50, iter: 100/834, loss: 0.26860, top1: 0.71104, throughput: 1301.63 | 2022-05-21 12:48:06.620 [rank:4] [train], epoch: 37/50, iter: 100/834, loss: 0.27190, top1: 0.70391, throughput: 1307.27 | 2022-05-21 12:48:06.621 [rank:1] [train], epoch: 37/50, iter: 100/834, loss: 0.27353, top1: 0.70396, throughput: 1326.30 | 2022-05-21 12:48:06.621 [rank:7] [train], epoch: 37/50, iter: 200/834, loss: 0.27419, top1: 0.69906, throughput: 1327.06 | 2022-05-21 12:48:21.089 [rank:3] [train], epoch: 37/50, iter: 200/834, loss: 0.27337, top1: 0.70052, throughput: 1326.98 | 2022-05-21 12:48:21.088 [rank:0] [train], epoch: 37/50, iter: 200/834, loss: 0.27158, top1: 0.70328, throughput: 1326.77 | 2022-05-21 12:48:21.091 [rank:5] [train], epoch: 37/50, iter: 200/834, loss: 0.27070, top1: 0.70792, throughput: 1326.90 | 2022-05-21 12:48:21.091 [rank:1] [train], epoch: 37/50, iter: 200/834, loss: 0.27168, top1: 0.70542, throughput: 1327.00 | 2022-05-21 12:48:21.090 [rank:4] [train], epoch: 37/50, iter: 200/834, loss: 0.27266, top1: 0.70339, throughput: 1326.91 | 2022-05-21 12:48:21.091 [rank:2] [train], epoch: 37/50, iter: 200/834, loss: 0.26981, top1: 0.70943, throughput: 1327.01 | 2022-05-21 12:48:21.089 [rank:6] [train], epoch: 37/50, iter: 200/834, loss: 0.27262, top1: 0.70406, throughput: 1326.85 | 2022-05-21 12:48:21.091 [rank:3] [train], epoch: 37/50, iter: 300/834, loss: 0.27388, top1: 0.69964, throughput: 1322.51 | 2022-05-21 12:48:35.606 [rank:5] [train], epoch: 37/50, iter: 300/834, loss: 0.27206, top1: 0.70370, throughput: 1322.67 | 2022-05-21 12:48:35.607 [rank:0] [train], epoch: 37/50, iter: 300/834, loss: 0.27381, top1: 0.69937, throughput: 1322.74 | 2022-05-21 12:48:35.606 [rank:6] [train], epoch: 37/50, iter: 300/834, loss: 0.27107, top1: 0.70469, throughput: 1322.71 | 2022-05-21 12:48:35.607 [rank:4] [train], epoch: 37/50, iter: 300/834, loss: 0.27167, top1: 0.70505, throughput: 1322.66 | 2022-05-21 12:48:35.607 [rank:1] [train], epoch: 37/50, iter: 300/834, loss: 0.27210, top1: 0.70651, throughput: 1322.52 | 2022-05-21 12:48:35.608 [rank:7] [train], epoch: 37/50, iter: 300/834, loss: 0.26955, top1: 0.71276, throughput: 1322.51 | 2022-05-21 12:48:35.607 [rank:2] [train], epoch: 37/50, iter: 300/834, loss: 0.27227, top1: 0.70437, throughput: 1322.38 | 2022-05-21 12:48:35.608 [rank:5] [train], epoch: 37/50, iter: 400/834, loss: 0.27018, top1: 0.70833, throughput: 1327.32 | 2022-05-21 12:48:50.072 [rank:2] [train], epoch: 37/50, iter: 400/834, loss: 0.27335, top1: 0.70026, throughput: 1327.53 | 2022-05-21 12:48:50.071 [rank:4] [train], epoch: 37/50, iter: 400/834, loss: 0.26812, top1: 0.71354, throughput: 1327.39 | 2022-05-21 12:48:50.072 [rank:1] [train], epoch: 37/50, iter: 400/834, loss: 0.27195, top1: 0.70490, throughput: 1327.46 | 2022-05-21 12:48:50.071 [rank:7] [train], epoch: 37/50, iter: 400/834, loss: 0.27315, top1: 0.69750, throughput: 1327.38 | 2022-05-21 12:48:50.071 [rank:3] [train], epoch: 37/50, iter: 400/834, loss: 0.27225, top1: 0.70708, throughput: 1327.18 | 2022-05-21 12:48:50.073 [rank:6] [train], epoch: 37/50, iter: 400/834, loss: 0.27185, top1: 0.70536, throughput: 1327.23 | 2022-05-21 12:48:50.073 [rank:0] [train], epoch: 37/50, iter: 400/834, loss: 0.27090, top1: 0.70667, throughput: 1327.18 | 2022-05-21 12:48:50.073 [rank:7] [train], epoch: 37/50, iter: 500/834, loss: 0.27092, top1: 0.70250, throughput: 1324.94 | 2022-05-21 12:49:04.562 [rank:5] [train], epoch: 37/50, iter: 500/834, loss: 0.27175, top1: 0.70286, throughput: 1325.15 | 2022-05-21 12:49:04.561 [rank:1] [train], epoch: 37/50, iter: 500/834, loss: 0.27385, top1: 0.70531, throughput: 1325.13 | 2022-05-21 12:49:04.561 [rank:2] [train], epoch: 37/50, iter: 500/834, loss: 0.27124, top1: 0.70688, throughput: 1325.13 | 2022-05-21 12:49:04.561 [rank:4] [train], epoch: 37/50, iter: 500/834, loss: 0.27355, top1: 0.70234, throughput: 1325.10 [rank:3] [train], epoch: 37/50, iter: 500/834, loss: 0.27354, top1: 0.70036, throughput: 1325.23 | 2022-05-21 12:49:04.561 | 2022-05-21 12:49:04.561 [rank:6] [train], epoch: 37/50, iter: 500/834, loss: 0.27475, top1: 0.70005, throughput: 1325.21 | 2022-05-21 12:49:04.561 [rank:0] [train], epoch: 37/50, iter: 500/834, loss: 0.26900, top1: 0.71167, throughput: 1325.08 | 2022-05-21 12:49:04.563 [rank:4] [train], epoch: 37/50, iter: 600/834, loss: 0.27126, top1: 0.70786, throughput: 1320.74 | 2022-05-21 12:49:19.099 [rank:6] [train], epoch: 37/50, iter: 600/834, loss: 0.27210, top1: 0.70557, throughput: 1320.79 | 2022-05-21 12:49:19.098 [rank:7] [train], epoch: 37/50, iter: 600/834, loss: 0.26933, top1: 0.70823, throughput: 1320.73 | 2022-05-21 12:49:19.100 [rank:5] [train], epoch: 37/50, iter: 600/834, loss: 0.27370, top1: 0.70010, throughput: 1320.39 | 2022-05-21 12:49:19.102 [rank:0] [train], epoch: 37/50, iter: 600/834, loss: 0.27400, top1: 0.69771, throughput: 1320.67 | 2022-05-21 12:49:19.101 [rank:1] [train], epoch: 37/50, iter: 600/834, loss: 0.27208, top1: 0.70281, throughput: 1320.46 | 2022-05-21 12:49:19.101 [rank:3] [train], epoch: 37/50, iter: 600/834, loss: 0.27212, top1: 0.70464, throughput: 1320.29 | 2022-05-21 12:49:19.103 [rank:2] [train], epoch: 37/50, iter: 600/834, loss: 0.27461, top1: 0.69906, throughput: 1320.25 | 2022-05-21 12:49:19.103 [rank:4] [train], epoch: 37/50, iter: 700/834, loss: 0.27289, top1: 0.70354, throughput: 1328.02 | 2022-05-21 12:49:33.556 [rank:5] [train], epoch: 37/50, iter: 700/834, loss: 0.27199, top1: 0.70396, throughput: 1328.33 | 2022-05-21 12:49:33.556 [rank:3] [train], epoch: 37/50, iter: 700/834, loss: 0.27470, top1: 0.70094, throughput: 1328.41 | 2022-05-21 12:49:33.556 [rank:7] [train], epoch: 37/50, iter: 700/834, loss: 0.27379, top1: 0.70120, throughput: 1328.05 | 2022-05-21 12:49:33.557 [rank:0] [train], epoch: 37/50, iter: 700/834, loss: 0.27373, top1: 0.70036, throughput: 1328.09 | 2022-05-21 12:49:33.558 [rank:6] [train], epoch: 37/50, iter: 700/834, loss: 0.27458, top1: 0.69953, throughput: 1327.83 | 2022-05-21 12:49:33.558 [rank:2] [train], epoch: 37/50, iter: 700/834, loss: 0.27090, top1: 0.70464, throughput: 1328.38 | 2022-05-21 12:49:33.557 [rank:1] [train], epoch: 37/50, iter: 700/834, loss: 0.27183, top1: 0.70292, throughput: 1328.06 | 2022-05-21 12:49:33.558 [rank:6] [train], epoch: 37/50, iter: 800/834, loss: 0.27560, top1: 0.69505, throughput: 1325.69 | 2022-05-21 12:49:48.041 [rank:4] [train], epoch: 37/50, iter: 800/834, loss: 0.27425, top1: 0.69802, throughput: 1325.60 | 2022-05-21 12:49:48.040 [rank:7] [train], epoch: 37/50, iter: 800/834, loss: 0.27304, top1: 0.69880, throughput: 1325.67 | 2022-05-21 12:49:48.040 [rank:2] [train], epoch: 37/50, iter: 800/834, loss: 0.27169, top1: 0.70427, throughput: 1325.65 | 2022-05-21 12:49:48.040 [rank:0] [train], epoch: 37/50, iter: 800/834, loss: 0.27151, top1: 0.70193, throughput: 1325.63 | 2022-05-21 12:49:48.041 [rank:3] [train], epoch: 37/50, iter: 800/834, loss: 0.27395, top1: 0.70120, throughput: 1325.42 | 2022-05-21 12:49:48.042 [rank:5] [train], epoch: 37/50, iter: 800/834, loss: 0.27321, top1: 0.70172, throughput: 1325.39 | 2022-05-21 12:49:48.043 [rank:1] [train], epoch: 37/50, iter: 800/834, loss: 0.27559, top1: 0.69922, throughput: 1325.53 | 2022-05-21 12:49:48.043 [rank:1] [train], epoch: 37/50, iter: 834/834, loss: 0.27097, top1: 0.70542, throughput: 1326.60 | 2022-05-21 12:49:52.964 [rank:3] [train], epoch: 37/50, iter: 834/834, loss: 0.27191, top1: 0.70205, throughput: 1326.31 | 2022-05-21 12:49:52.964 [rank:5] [train], epoch: 37/50, iter: 834/834, loss: 0.27265, top1: 0.70680, throughput: 1326.41 | 2022-05-21 12:49:52.964 [rank:4] [train], epoch: 37/50, iter: 834/834, loss: 0.27760, top1: 0.69317, throughput: 1325.74 | 2022-05-21 12:49:52.964 [rank:6] [train], epoch: 37/50, iter: 834/834, loss: 0.27296, top1: 0.70741, throughput: 1325.88 | 2022-05-21 12:49:52.964 [rank:0] [train], epoch: 37/50, iter: 834/834, loss: 0.27200, top1: 0.70282, throughput: 1325.75 | 2022-05-21 12:49:52.965 [rank:7] [train], epoch: 37/50, iter: 834/834, loss: 0.27077, top1: 0.70481, throughput: 1325.53 | 2022-05-21 12:49:52.965 [rank:2] [train], epoch: 37/50, iter: 834/834, loss: 0.27118, top1: 0.70619, throughput: 1325.07 | 2022-05-21 12:49:52.967 [rank:7] [eval], epoch: 37/50, iter: 125/125, loss: 0.00000, top1: 0.71424, throughput: 578.69 | 2022-05-21 12:50:03.766 [rank:0] [eval], epoch: 37/50, iter: 125/125, loss: 0.00000, top1: 0.71472, throughput: 578.04 | 2022-05-21 12:50:03.778 [rank:2] [eval], epoch: 37/50, iter: 125/125, loss: 0.00000, top1: 0.70480, throughput: 575.22 | 2022-05-21 12:50:03.832 [rank:6] [eval], epoch: 37/50, iter: 125/125, loss: 0.00000, top1: 0.71552, throughput: 570.83 | 2022-05-21 12:50:03.913 [rank:1] [eval], epoch: 37/50, iter: 125/125, loss: 0.00000, top1: 0.71712, throughput: 570.10 | 2022-05-21 12:50:03.927 [rank:3] [eval], epoch: 37/50, iter: 125/125, loss: 0.00000, top1: 0.70672, throughput: 567.21 | 2022-05-21 12:50:03.983 [rank:4] [eval], epoch: 37/50, iter: 125/125, loss: 0.00000, top1: 0.70416, throughput: 562.31 | 2022-05-21 12:50:04.079 [rank:5] [eval], epoch: 37/50, iter: 125/125, loss: 0.00000, top1: 0.70176, throughput: 560.83 | 2022-05-21 12:50:04.108 [rank:5] [train], epoch: 38/50, iter: 100/834, loss: 0.26733, top1: 0.71516, throughput: 1330.19 | 2022-05-21 12:50:18.542 [rank:7] [train], epoch: 38/50, iter: 100/834, loss: 0.26555, top1: 0.71870, throughput: 1299.16 | 2022-05-21 12:50:18.544 [rank:4] [train], epoch: 38/50, iter: 100/834, loss: 0.26652, top1: 0.71776, throughput: 1327.36 | 2022-05-21 12:50:18.544 [rank:6] [train], epoch: 38/50, iter: 100/834, loss: 0.26826, top1: 0.71219, throughput: 1312.35[rank:0] [train], epoch: 38/50, iter: 100/834, loss: 0.26871, top1: 0.71161, throughput: 1300.26 | 2022-05-21 12:50:18.544 | 2022-05-21 12:50:18.544 [rank:3] [train], epoch: 38/50, iter: 100/834, loss: 0.26688, top1: 0.71729, throughput: 1318.63 | 2022-05-21 12:50:18.544 [rank:1] [train], epoch: 38/50, iter: 100/834, loss: 0.26630, top1: 0.71771, throughput: 1313.54 | 2022-05-21 12:50:18.544 [rank:2] [train], epoch: 38/50, iter: 100/834, loss: 0.26990, top1: 0.70818, throughput: 1305.09 | 2022-05-21 12:50:18.544 [rank:7] [train], epoch: 38/50, iter: 200/834, loss: 0.26741, top1: 0.71255, throughput: 1329.78 | 2022-05-21 12:50:32.983 [rank:5] [train], epoch: 38/50, iter: 200/834, loss: 0.26745, top1: 0.71068, throughput: 1329.51 | 2022-05-21 12:50:32.984 [rank:0] [train], epoch: 38/50, iter: 200/834, loss: 0.26847, top1: 0.71328, throughput: 1329.61 | 2022-05-21 12:50:32.984 [rank:6] [train], epoch: 38/50, iter: 200/834, loss: 0.26775, top1: 0.71536, throughput: 1329.63 | 2022-05-21 12:50:32.984 [rank:4] [train], epoch: 38/50, iter: 200/834, loss: 0.26787, top1: 0.71312, throughput: 1329.49 | 2022-05-21 12:50:32.986 [rank:1] [train], epoch: 38/50, iter: 200/834, loss: 0.26727, top1: 0.71609, throughput: 1329.45 | 2022-05-21 12:50:32.986 [rank:2] [train], epoch: 38/50, iter: 200/834, loss: 0.26864, top1: 0.71000, throughput: 1329.21 | 2022-05-21 12:50:32.989 [rank:3] [train], epoch: 38/50, iter: 200/834, loss: 0.26938, top1: 0.70859, throughput: 1329.11 | 2022-05-21 12:50:32.989 [rank:7] [train], epoch: 38/50, iter: 300/834, loss: 0.26974, top1: 0.70818, throughput: 1328.73 | 2022-05-21 12:50:47.433 [rank:0] [train], epoch: 38/50, iter: 300/834, loss: 0.26851, top1: 0.71172, throughput: 1329.03 | 2022-05-21 12:50:47.431 [rank:2] [train], epoch: 38/50, iter: 300/834, loss: 0.26719, top1: 0.71396, throughput: 1329.49 | 2022-05-21 12:50:47.430 [rank:5] [train], epoch: 38/50, iter: 300/834, loss: 0.26734, top1: 0.71344, throughput: 1328.82 | 2022-05-21 12:50:47.433 [rank:4] [train], epoch: 38/50, iter: 300/834, loss: 0.26868, top1: 0.71062, throughput: 1329.01 | 2022-05-21 12:50:47.433 [rank:6] [train], epoch: 38/50, iter: 300/834, loss: 0.26779, top1: 0.71490, throughput: 1328.83 | 2022-05-21 12:50:47.433 [rank:1] [train], epoch: 38/50, iter: 300/834, loss: 0.26935, top1: 0.70620, throughput: 1329.03 | 2022-05-21 12:50:47.432 [rank:3] [train], epoch: 38/50, iter: 300/834, loss: 0.26802, top1: 0.71323, throughput: 1329.45 | 2022-05-21 12:50:47.432 [rank:6] [train], epoch: 38/50, iter: 400/834, loss: 0.26726, top1: 0.71453, throughput: 1329.78 | 2022-05-21 12:51:01.871 [rank:5] [train], epoch: 38/50, iter: 400/834, loss: 0.26686, top1: 0.71505, throughput: 1329.86 | 2022-05-21 12:51:01.870 [rank:4] [train], epoch: 38/50, iter: 400/834, loss: 0.26950, top1: 0.70797, throughput: 1329.81 | 2022-05-21 12:51:01.871 [rank:7] [train], epoch: 38/50, iter: 400/834, loss: 0.26649, top1: 0.71505, throughput: 1329.88 | 2022-05-21 12:51:01.870 [rank:0] [train], epoch: 38/50, iter: 400/834, loss: 0.27041, top1: 0.70417, throughput: 1329.68 | 2022-05-21 12:51:01.871 [rank:1] [train], epoch: 38/50, iter: 400/834, loss: 0.27004, top1: 0.71089, throughput: 1329.80 | 2022-05-21 12:51:01.871 [rank:3] [train], epoch: 38/50, iter: 400/834, loss: 0.26894, top1: 0.71047, throughput: 1329.13 | 2022-05-21 12:51:01.877 [rank:2] [train], epoch: 38/50, iter: 400/834, loss: 0.26741, top1: 0.71484, throughput: 1329.09 | 2022-05-21 12:51:01.876 [rank:0] [train], epoch: 38/50, iter: 500/834, loss: 0.26684, top1: 0.71188, throughput: 1327.12 | 2022-05-21 12:51:16.338 [rank:5] [train], epoch: 38/50, iter: 500/834, loss: 0.26774, top1: 0.71641, throughput: 1326.96 | 2022-05-21 12:51:16.339 [rank:2] [train], epoch: 38/50, iter: 500/834, loss: 0.27291, top1: 0.70104, throughput: 1327.55 | 2022-05-21 12:51:16.339 [rank:7] [train], epoch: 38/50, iter: 500/834, loss: 0.27030, top1: 0.70760, throughput: 1326.92 | 2022-05-21 12:51:16.340 [rank:4] [train], epoch: 38/50, iter: 500/834, loss: 0.26654, top1: 0.71146, throughput: 1326.93[rank:3] [train], epoch: 38/50, iter: 500/834, loss: 0.27059, top1: 0.70703, throughput: 1327.57 | 2022-05-21 12:51:16.340 | 2022-05-21 12:51:16.340 [rank:1] [train], epoch: 38/50, iter: 500/834, loss: 0.26728, top1: 0.71521, throughput: 1327.00 | 2022-05-21 12:51:16.339 [rank:6] [train], epoch: 38/50, iter: 500/834, loss: 0.26904, top1: 0.70812, throughput: 1326.99 | 2022-05-21 12:51:16.340 [rank:1] [train], epoch: 38/50, iter: 600/834, loss: 0.26888, top1: 0.71193, throughput: 1327.23[rank:7] [train], epoch: 38/50, iter: 600/834, loss: 0.27009, top1: 0.70865, throughput: 1327.15 | 2022-05-21 12:51:30.806| 2022-05-21 12:51:30.807 [rank:2] [train], epoch: 38/50, iter: 600/834, loss: 0.26883, top1: 0.71057, throughput: 1327.19 | 2022-05-21 12:51:30.806 [rank:6] [train], epoch: 38/50, iter: 600/834, loss: 0.26796, top1: 0.71651, throughput: 1327.09[rank:4] [train], epoch: 38/50, iter: 600/834, loss: 0.26860, top1: 0.71188, throughput: 1327.12 | 2022-05-21 12:51:30.808 | 2022-05-21 12:51:30.808 [rank:5] [train], epoch: 38/50, iter: 600/834, loss: 0.26857, top1: 0.71604, throughput: 1327.07 | 2022-05-21 12:51:30.807 [rank:0] [train], epoch: 38/50, iter: 600/834, loss: 0.26798, top1: 0.71318, throughput: 1326.97 | 2022-05-21 12:51:30.807 [rank:3] [train], epoch: 38/50, iter: 600/834, loss: 0.26709, top1: 0.71219, throughput: 1327.06 | 2022-05-21 12:51:30.808 [rank:4] [train], epoch: 38/50, iter: 700/834, loss: 0.26753, top1: 0.71396, throughput: 1328.13 | 2022-05-21 12:51:45.264 [rank:5] [train], epoch: 38/50, iter: 700/834, loss: 0.27168, top1: 0.70526, throughput: 1328.21 | 2022-05-21 12:51:45.263 [rank:7] [train], epoch: 38/50, iter: 700/834, loss: 0.27024, top1: 0.70427, throughput: 1328.05 [rank:1] [train], epoch: 38/50, iter: 700/834, loss: 0.26623, top1: 0.71448, throughput: 1328.07| 2022-05-21 12:51:45.264 | 2022-05-21 12:51:45.263 [rank:6] [train], epoch: 38/50, iter: 700/834, loss: 0.27085, top1: 0.70802, throughput: 1328.00 | 2022-05-21 12:51:45.265 [rank:0] [train], epoch: 38/50, iter: 700/834, loss: 0.26885, top1: 0.71266, throughput: 1328.03 | 2022-05-21 12:51:45.264 [rank:3] [train], epoch: 38/50, iter: 700/834, loss: 0.26735, top1: 0.71370, throughput: 1328.15 | 2022-05-21 12:51:45.264 [rank:2] [train], epoch: 38/50, iter: 700/834, loss: 0.27045, top1: 0.70651, throughput: 1327.79 | 2022-05-21 12:51:45.266 [rank:5] [train], epoch: 38/50, iter: 800/834, loss: 0.26746, top1: 0.71661, throughput: 1325.12 | 2022-05-21 12:51:59.752 [rank:7] [train], epoch: 38/50, iter: 800/834, loss: 0.26537, top1: 0.71677, throughput: 1325.28 | 2022-05-21 12:51:59.752 [rank:6] [train], epoch: 38/50, iter: 800/834, loss: 0.26621, top1: 0.71432, throughput: 1325.26 | 2022-05-21 12:51:59.753 [rank:3] [train], epoch: 38/50, iter: 800/834, loss: 0.26983, top1: 0.70927, throughput: 1325.19 | 2022-05-21 12:51:59.752 [rank:1] [train], epoch: 38/50, iter: 800/834, loss: 0.26719, top1: 0.71083, throughput: 1325.11 | 2022-05-21 12:51:59.752 [rank:4] [train], epoch: 38/50, iter: 800/834, loss: 0.26783, top1: 0.71208, throughput: 1325.03 | 2022-05-21 12:51:59.754 [rank:2] [train], epoch: 38/50, iter: 800/834, loss: 0.26828, top1: 0.71391, throughput: 1325.19 | 2022-05-21 12:51:59.754 [rank:0] [train], epoch: 38/50, iter: 800/834, loss: 0.26616, top1: 0.71755, throughput: 1325.01 | 2022-05-21 12:51:59.755 [rank:7] [train], epoch: 38/50, iter: 834/834, loss: 0.26943, top1: 0.70665, throughput: 1326.04 | 2022-05-21 12:52:04.675 [rank:0] [train], epoch: 38/50, iter: 834/834, loss: 0.26124, top1: 0.72518, throughput: 1326.70 | 2022-05-21 12:52:04.675 [rank:2] [train], epoch: 38/50, iter: 834/834, loss: 0.27141, top1: 0.70358, throughput: 1326.69 | 2022-05-21 12:52:04.675 [rank:1] [train], epoch: 38/50, iter: 834/834, loss: 0.26741, top1: 0.71599, throughput: 1326.04 | 2022-05-21 12:52:04.675 [rank:3] [train], epoch: 38/50, iter: 834/834, loss: 0.26577, top1: 0.72013, throughput: 1325.63 | 2022-05-21 12:52:04.677 [rank:5] [train], epoch: 38/50, iter: 834/834, loss: 0.26703, top1: 0.70956, throughput: 1325.54 | 2022-05-21 12:52:04.677 [rank:4] [train], epoch: 38/50, iter: 834/834, loss: 0.26557, top1: 0.71569, throughput: 1326.06 | 2022-05-21 12:52:04.677 [rank:6] [train], epoch: 38/50, iter: 834/834, loss: 0.27003, top1: 0.70787, throughput: 1325.71 | 2022-05-21 12:52:04.677 [rank:0] [eval], epoch: 38/50, iter: 125/125, loss: 0.00000, top1: 0.72608, throughput: 573.40 | 2022-05-21 12:52:15.575 [rank:2] [eval], epoch: 38/50, iter: 125/125, loss: 0.00000, top1: 0.71216, throughput: 573.33 | 2022-05-21 12:52:15.576 [rank:7] [eval], epoch: 38/50, iter: 125/125, loss: 0.00000, top1: 0.72448, throughput: 572.98 | 2022-05-21 12:52:15.582 [rank:4] [eval], epoch: 38/50, iter: 125/125, loss: 0.00000, top1: 0.71232, throughput: 570.92 | 2022-05-21 12:52:15.624 [rank:3] [eval], epoch: 38/50, iter: 125/125, loss: 0.00000, top1: 0.71344, throughput: 569.23 | 2022-05-21 12:52:15.657 [rank:5] [eval], epoch: 38/50, iter: 125/125, loss: 0.00000, top1: 0.71040, throughput: 567.86 | 2022-05-21 12:52:15.683 [rank:6] [eval], epoch: 38/50, iter: 125/125, loss: 0.00000, top1: 0.72096, throughput: 566.89 | 2022-05-21 12:52:15.702 [rank:1] [eval], epoch: 38/50, iter: 125/125, loss: 0.00000, top1: 0.72080, throughput: 557.73 | 2022-05-21 12:52:15.881 [rank:3] [train], epoch: 39/50, iter: 100/834, loss: 0.26233, top1: 0.72505, throughput: 1310.52 | 2022-05-21 12:52:30.307 [rank:6] [train], epoch: 39/50, iter: 100/834, loss: 0.26361, top1: 0.71969, throughput: 1314.62 | 2022-05-21 12:52:30.307 [rank:1] [train], epoch: 39/50, iter: 100/834, loss: 0.26214, top1: 0.72583, throughput: 1330.96 | 2022-05-21 12:52:30.307 [rank:7] [train], epoch: 39/50, iter: 100/834, loss: 0.25984, top1: 0.73391, throughput: 1303.90 | 2022-05-21 12:52:30.308 [rank:4] [train], epoch: 39/50, iter: 100/834, loss: 0.26387, top1: 0.72057, throughput: 1307.49 | 2022-05-21 12:52:30.309 [rank:5] [train], epoch: 39/50, iter: 100/834, loss: 0.26434, top1: 0.71984, throughput: 1312.75 | 2022-05-21 12:52:30.309 [rank:0] [train], epoch: 39/50, iter: 100/834, loss: 0.26307, top1: 0.72396, throughput: 1303.07 | 2022-05-21 12:52:30.310 [rank:2] [train], epoch: 39/50, iter: 100/834, loss: 0.25967, top1: 0.72927, throughput: 1303.16 | 2022-05-21 12:52:30.310 [rank:6] [train], epoch: 39/50, iter: 200/834, loss: 0.26384, top1: 0.72047, throughput: 1327.56 | 2022-05-21 12:52:44.770 [rank:5] [train], epoch: 39/50, iter: 200/834, loss: 0.26402, top1: 0.72240, throughput: 1327.79 | 2022-05-21 12:52:44.769 [rank:0] [train], epoch: 39/50, iter: 200/834, loss: 0.26456, top1: 0.72141, throughput: 1327.58 | 2022-05-21 12:52:44.772 [rank:2] [train], epoch: 39/50, iter: 200/834, loss: 0.26156, top1: 0.72448, throughput: 1327.65 | 2022-05-21 12:52:44.771 [rank:1] [train], epoch: 39/50, iter: 200/834, loss: 0.26262, top1: 0.72531, throughput: 1327.45 | 2022-05-21 12:52:44.771 [rank:3] [train], epoch: 39/50, iter: 200/834, loss: 0.26340, top1: 0.72349, throughput: 1327.40 | 2022-05-21 12:52:44.772 [rank:7] [train], epoch: 39/50, iter: 200/834, loss: 0.26375, top1: 0.72167, throughput: 1327.43 | 2022-05-21 12:52:44.772 [rank:4] [train], epoch: 39/50, iter: 200/834, loss: 0.26139, top1: 0.72250, throughput: 1327.33 | 2022-05-21 12:52:44.774 [rank:7] [train], epoch: 39/50, iter: 300/834, loss: 0.26257, top1: 0.72453, throughput: 1327.28 | 2022-05-21 12:52:59.237 [rank:3] [train], epoch: 39/50, iter: 300/834, loss: 0.26362, top1: 0.72000, throughput: 1327.17 | 2022-05-21 12:52:59.239 [rank:1] [train], epoch: 39/50, iter: 300/834, loss: 0.26555, top1: 0.71729, throughput: 1327.00 | 2022-05-21 12:52:59.239 [rank:6] [train], epoch: 39/50, iter: 300/834, loss: 0.26137, top1: 0.72573, throughput: 1326.95 | 2022-05-21 12:52:59.239 [rank:0] [train], epoch: 39/50, iter: 300/834, loss: 0.26412, top1: 0.72344, throughput: 1327.12 | 2022-05-21 12:52:59.239 [rank:2] [train], epoch: 39/50, iter: 300/834, loss: 0.26400, top1: 0.72203, throughput: 1326.95 | 2022-05-21 12:52:59.240 [rank:4] [train], epoch: 39/50, iter: 300/834, loss: 0.26378, top1: 0.72281, throughput: 1327.13 | 2022-05-21 12:52:59.241 [rank:5] [train], epoch: 39/50, iter: 300/834, loss: 0.26277, top1: 0.72505, throughput: 1326.69 | 2022-05-21 12:52:59.241 [rank:4] [train], epoch: 39/50, iter: 400/834, loss: 0.26491, top1: 0.71932, throughput: 1329.86 | 2022-05-21 12:53:13.679 [rank:7] [train], epoch: 39/50, iter: 400/834, loss: 0.26236, top1: 0.72594, throughput: 1329.45 | 2022-05-21 12:53:13.679 [rank:6] [train], epoch: 39/50, iter: 400/834, loss: 0.26410, top1: 0.72010, throughput: 1329.62 | 2022-05-21 12:53:13.679 [rank:5] [train], epoch: 39/50, iter: 400/834, loss: 0.26375, top1: 0.72552, throughput: 1329.83 | 2022-05-21 12:53:13.679 [rank:2] [train], epoch: 39/50, iter: 400/834, loss: 0.26442, top1: 0.72203, throughput: 1329.77 | 2022-05-21 12:53:13.679 [rank:0] [train], epoch: 39/50, iter: 400/834, loss: 0.26277, top1: 0.72250, throughput: 1329.59 | 2022-05-21 12:53:13.680 [rank:3] [train], epoch: 39/50, iter: 400/834, loss: 0.26454, top1: 0.72260, throughput: 1329.46 | 2022-05-21 12:53:13.680 [rank:1] [train], epoch: 39/50, iter: 400/834, loss: 0.26206, top1: 0.72214, throughput: 1329.50 | 2022-05-21 12:53:13.681 [rank:6] [train], epoch: 39/50, iter: 500/834, loss: 0.26335, top1: 0.72406, throughput: 1328.84 | 2022-05-21 12:53:28.128 [rank:4] [train], epoch: 39/50, iter: 500/834, loss: 0.26321, top1: 0.72323, throughput: 1328.84[rank:3] [train], epoch: 39/50, iter: 500/834, loss: 0.26355, top1: 0.71958, throughput: 1328.92 | 2022-05-21 12:53:28.128 | 2022-05-21 12:53:28.128 [rank:0] [train], epoch: 39/50, iter: 500/834, loss: 0.26541, top1: 0.71589, throughput: 1328.88 | 2022-05-21 12:53:28.128 [rank:1] [train], epoch: 39/50, iter: 500/834, loss: 0.26762, top1: 0.71089, throughput: 1328.95 | 2022-05-21 12:53:28.128 [rank:2] [train], epoch: 39/50, iter: 500/834, loss: 0.26240, top1: 0.72151, throughput: 1328.62 | 2022-05-21 12:53:28.130 [rank:5] [train], epoch: 39/50, iter: 500/834, loss: 0.26235, top1: 0.72359, throughput: 1328.43 | 2022-05-21 12:53:28.132 [rank:7] [train], epoch: 39/50, iter: 500/834, loss: 0.26530, top1: 0.71448, throughput: 1328.44 | 2022-05-21 12:53:28.132 [rank:5] [train], epoch: 39/50, iter: 600/834, loss: 0.26463, top1: 0.71974, throughput: 1327.24 | 2022-05-21 12:53:42.598 [rank:4] [train], epoch: 39/50, iter: 600/834, loss: 0.26144, top1: 0.72411, throughput: 1326.82 | 2022-05-21 12:53:42.598 [rank:7] [train], epoch: 39/50, iter: 600/834, loss: 0.26620, top1: 0.71380, throughput: 1327.22 | 2022-05-21 12:53:42.599 [rank:2] [train], epoch: 39/50, iter: 600/834, loss: 0.26478, top1: 0.71521, throughput: 1327.03 | 2022-05-21 12:53:42.598 [rank:6] [train], epoch: 39/50, iter: 600/834, loss: 0.26816, top1: 0.71479, throughput: 1326.77 | 2022-05-21 12:53:42.599 [rank:1] [train], epoch: 39/50, iter: 600/834, loss: 0.26423, top1: 0.72052, throughput: 1326.92 | 2022-05-21 12:53:42.598 [rank:0] [train], epoch: 39/50, iter: 600/834, loss: 0.26778, top1: 0.71260, throughput: 1326.77 | 2022-05-21 12:53:42.599 [rank:3] [train], epoch: 39/50, iter: 600/834, loss: 0.26540, top1: 0.71557, throughput: 1326.86 | 2022-05-21 12:53:42.599 [rank:6] [train], epoch: 39/50, iter: 700/834, loss: 0.26387, top1: 0.72156, throughput: 1327.82 | 2022-05-21 12:53:57.059 [rank:4] [train], epoch: 39/50, iter: 700/834, loss: 0.26094, top1: 0.72719, throughput: 1327.76 | 2022-05-21 12:53:57.059 [rank:5] [train], epoch: 39/50, iter: 700/834, loss: 0.26346, top1: 0.72323, throughput: 1327.64 | 2022-05-21 12:53:57.060 [rank:7] [train], epoch: 39/50, iter: 700/834, loss: 0.26273, top1: 0.72167, throughput: 1327.70 | 2022-05-21 12:53:57.060 [rank:0] [train], epoch: 39/50, iter: 700/834, loss: 0.26581, top1: 0.71526, throughput: 1327.75 | 2022-05-21 12:53:57.060 [rank:3] [train], epoch: 39/50, iter: 700/834, loss: 0.26327, top1: 0.72354, throughput: 1327.53 | 2022-05-21 12:53:57.061 [rank:1] [train], epoch: 39/50, iter: 700/834, loss: 0.26213, top1: 0.72245, throughput: 1327.27 | 2022-05-21 12:53:57.064 [rank:2] [train], epoch: 39/50, iter: 700/834, loss: 0.26255, top1: 0.72104, throughput: 1327.26 | 2022-05-21 12:53:57.064 [rank:5] [train], epoch: 39/50, iter: 800/834, loss: 0.26260, top1: 0.72469, throughput: 1329.49 | 2022-05-21 12:54:11.502 [rank:4] [train], epoch: 39/50, iter: 800/834, loss: 0.26047, top1: 0.72693, throughput: 1329.27 | 2022-05-21 12:54:11.503 [rank:6] [train], epoch: 39/50, iter: 800/834, loss: 0.26095, top1: 0.72781, throughput: 1329.24 | 2022-05-21 12:54:11.503 [rank:1] [train], epoch: 39/50, iter: 800/834, loss: 0.26296, top1: 0.72177, throughput: 1329.64 | 2022-05-21 12:54:11.504 [rank:2] [train], epoch: 39/50, iter: 800/834, loss: 0.26365, top1: 0.71943, throughput: 1329.85 | 2022-05-21 12:54:11.502 [rank:0] [train], epoch: 39/50, iter: 800/834, loss: 0.26395, top1: 0.72229, throughput: 1329.35 | 2022-05-21 12:54:11.503 [rank:3] [train], epoch: 39/50, iter: 800/834, loss: 0.26225, top1: 0.72432, throughput: 1329.43 | 2022-05-21 12:54:11.504 [rank:7] [train], epoch: 39/50, iter: 800/834, loss: 0.26288, top1: 0.72354, throughput: 1329.41 | 2022-05-21 12:54:11.502 [rank:5] [train], epoch: 39/50, iter: 834/834, loss: 0.26602, top1: 0.71584, throughput: 1324.23 | 2022-05-21 12:54:16.431 [rank:4] [train], epoch: 39/50, iter: 834/834, loss: 0.26768, top1: 0.71293, throughput: 1324.48 | 2022-05-21 12:54:16.432 [rank:6] [train], epoch: 39/50, iter: 834/834, loss: 0.26063, top1: 0.72457, throughput: 1324.58 | 2022-05-21 12:54:16.432 [rank:3] [train], epoch: 39/50, iter: 834/834, loss: 0.26470, top1: 0.71921, throughput: 1324.69 | 2022-05-21 12:54:16.432 [rank:7] [train], epoch: 39/50, iter: 834/834, loss: 0.26153, top1: 0.72135, throughput: 1324.16 | 2022-05-21 12:54:16.432 [rank:2] [train], epoch: 39/50, iter: 834/834, loss: 0.26453, top1: 0.72580, throughput: 1324.22 | 2022-05-21 12:54:16.432 [rank:1] [train], epoch: 39/50, iter: 834/834, loss: 0.26876, top1: 0.70818, throughput: 1324.38 | 2022-05-21 12:54:16.433 [rank:0] [train], epoch: 39/50, iter: 834/834, loss: 0.26604, top1: 0.71201, throughput: 1323.87 | 2022-05-21 12:54:16.434 [rank:0] [eval], epoch: 39/50, iter: 125/125, loss: 0.00000, top1: 0.72864, throughput: 566.85 | 2022-05-21 12:54:27.460 [rank:4] [eval], epoch: 39/50, iter: 125/125, loss: 0.00000, top1: 0.72032, throughput: 566.68 | 2022-05-21 12:54:27.461 [rank:2] [eval], epoch: 39/50, iter: 125/125, loss: 0.00000, top1: 0.71792, throughput: 566.57 | 2022-05-21 12:54:27.463 [rank:7] [eval], epoch: 39/50, iter: 125/125, loss: 0.00000, top1: 0.72352, throughput: 566.55 | 2022-05-21 12:54:27.464 [rank:3] [eval], epoch: 39/50, iter: 125/125, loss: 0.00000, top1: 0.70864, throughput: 563.52 | 2022-05-21 12:54:27.523 [rank:6] [eval], epoch: 39/50, iter: 125/125, loss: 0.00000, top1: 0.72112, throughput: 562.19 | 2022-05-21 12:54:27.549 [rank:1] [eval], epoch: 39/50, iter: 125/125, loss: 0.00000, top1: 0.72128, throughput: 559.74 | 2022-05-21 12:54:27.599 [rank:5] [eval], epoch: 39/50, iter: 125/125, loss: 0.00000, top1: 0.71488, throughput: 551.36 | 2022-05-21 12:54:27.767 [rank:5] [train], epoch: 40/50, iter: 100/834, loss: 0.25719, top1: 0.73385, throughput: 1330.26 | 2022-05-21 12:54:42.200 [rank:1] [train], epoch: 40/50, iter: 100/834, loss: 0.25750, top1: 0.73266, throughput: 1314.92 | 2022-05-21 12:54:42.200 [rank:4] [train], epoch: 40/50, iter: 100/834, loss: 0.25855, top1: 0.72974, throughput: 1302.53 | 2022-05-21 12:54:42.201 [rank:0] [train], epoch: 40/50, iter: 100/834, loss: 0.25614, top1: 0.73677, throughput: 1302.52 | 2022-05-21 12:54:42.201 [rank:6] [train], epoch: 40/50, iter: 100/834, loss: 0.25471, top1: 0.73875, throughput: 1310.28 | 2022-05-21 12:54:42.202 [rank:2] [train], epoch: 40/50, iter: 100/834, loss: 0.25934, top1: 0.72698, throughput: 1302.84 | 2022-05-21 12:54:42.200 [rank:3] [train], epoch: 40/50, iter: 100/834, loss: 0.25694, top1: 0.73542, throughput: 1307.91 | 2022-05-21 12:54:42.203 [rank:7] [train], epoch: 40/50, iter: 100/834, loss: 0.25802, top1: 0.73422, throughput: 1302.76 | 2022-05-21 12:54:42.202 [rank:0] [train], epoch: 40/50, iter: 200/834, loss: 0.26014, top1: 0.72828, throughput: 1330.70 | 2022-05-21 12:54:56.629 [rank:7] [train], epoch: 40/50, iter: 200/834, loss: 0.25843, top1: 0.73219, throughput: 1330.81 | 2022-05-21 12:54:56.629 [rank:2] [train], epoch: 40/50, iter: 200/834, loss: 0.26113, top1: 0.72641, throughput: 1330.68 | 2022-05-21 12:54:56.629 [rank:6] [train], epoch: 40/50, iter: 200/834, loss: 0.25804, top1: 0.73203, throughput: 1330.81 | 2022-05-21 12:54:56.630 [rank:3] [train], epoch: 40/50, iter: 200/834, loss: 0.25833, top1: 0.73344, throughput: 1330.39 | 2022-05-21 12:54:56.634 [rank:4] [train], epoch: 40/50, iter: 200/834, loss: 0.25865, top1: 0.73240, throughput: 1330.55 | 2022-05-21 12:54:56.631 [rank:5] [train], epoch: 40/50, iter: 200/834, loss: 0.26003, top1: 0.72943, throughput: 1330.46 | 2022-05-21 12:54:56.631 [rank:1] [train], epoch: 40/50, iter: 200/834, loss: 0.25789, top1: 0.73536, throughput: 1330.42 | 2022-05-21 12:54:56.632 [rank:7] [train], epoch: 40/50, iter: 300/834, loss: 0.25998, top1: 0.72948, throughput: 1328.24 | 2022-05-21 12:55:11.084 [rank:5] [train], epoch: 40/50, iter: 300/834, loss: 0.25926, top1: 0.73047, throughput: 1328.47 | 2022-05-21 12:55:11.084 [rank:4] [train], epoch: 40/50, iter: 300/834, loss: 0.26028, top1: 0.72698, throughput: 1328.47 | 2022-05-21 12:55:11.084 [rank:1] [train], epoch: 40/50, iter: 300/834, loss: 0.25980, top1: 0.73047, throughput: 1328.51 | 2022-05-21 12:55:11.084 [rank:6] [train], epoch: 40/50, iter: 300/834, loss: 0.25762, top1: 0.73406, throughput: 1328.16 | 2022-05-21 12:55:11.086 [rank:2] [train], epoch: 40/50, iter: 300/834, loss: 0.26088, top1: 0.72307, throughput: 1328.07 | 2022-05-21 12:55:11.086 [rank:3] [train], epoch: 40/50, iter: 300/834, loss: 0.25900, top1: 0.73167, throughput: 1328.66 | 2022-05-21 12:55:11.085 [rank:0] [train], epoch: 40/50, iter: 300/834, loss: 0.25980, top1: 0.72938, throughput: 1327.92 | 2022-05-21 12:55:11.088 [rank:7] [train], epoch: 40/50, iter: 400/834, loss: 0.25781, top1: 0.73214, throughput: 1326.30 | 2022-05-21 12:55:25.561 [rank:5] [train], epoch: 40/50, iter: 400/834, loss: 0.25942, top1: 0.73010, throughput: 1326.27 | 2022-05-21 12:55:25.561 [rank:3] [train], epoch: 40/50, iter: 400/834, loss: 0.25711, top1: 0.73536, throughput: 1326.30[rank:2] [train], epoch: 40/50, iter: 400/834, loss: 0.26016, top1: 0.72672, throughput: 1326.44 | 2022-05-21 12:55:25.561 | 2022-05-21 12:55:25.561 [rank:1] [train], epoch: 40/50, iter: 400/834, loss: 0.25993, top1: 0.72948, throughput: 1326.24 | 2022-05-21 12:55:25.561 [rank:6] [train], epoch: 40/50, iter: 400/834, loss: 0.25747, top1: 0.73146, throughput: 1326.18 | 2022-05-21 12:55:25.563 [rank:0] [train], epoch: 40/50, iter: 400/834, loss: 0.25944, top1: 0.72906, throughput: 1326.41 | 2022-05-21 12:55:25.563 [rank:4] [train], epoch: 40/50, iter: 400/834, loss: 0.25900, top1: 0.73224, throughput: 1326.03 | 2022-05-21 12:55:25.563 [rank:6] [train], epoch: 40/50, iter: 500/834, loss: 0.25892, top1: 0.72995, throughput: 1326.71 | 2022-05-21 12:55:40.035 [rank:4] [train], epoch: 40/50, iter: 500/834, loss: 0.25890, top1: 0.72974, throughput: 1326.67 | 2022-05-21 12:55:40.036 [rank:0] [train], epoch: 40/50, iter: 500/834, loss: 0.25948, top1: 0.72786, throughput: 1326.61 | 2022-05-21 12:55:40.036 [rank:7] [train], epoch: 40/50, iter: 500/834, loss: 0.25946, top1: 0.72792, throughput: 1326.41 | 2022-05-21 12:55:40.036 [rank:3] [train], epoch: 40/50, iter: 500/834, loss: 0.26069, top1: 0.73245, throughput: 1326.45 | 2022-05-21 12:55:40.036 [rank:5] [train], epoch: 40/50, iter: 500/834, loss: 0.25750, top1: 0.73359, throughput: 1326.42 | 2022-05-21 12:55:40.036 [rank:2] [train], epoch: 40/50, iter: 500/834, loss: 0.26061, top1: 0.72625, throughput: 1326.41 | 2022-05-21 12:55:40.036 [rank:1] [train], epoch: 40/50, iter: 500/834, loss: 0.26014, top1: 0.72745, throughput: 1326.27 | 2022-05-21 12:55:40.038 [rank:6] [train], epoch: 40/50, iter: 600/834, loss: 0.25946, top1: 0.72823, throughput: 1326.77 | 2022-05-21 12:55:54.507 [rank:1] [train], epoch: 40/50, iter: 600/834, loss: 0.25796, top1: 0.73391, throughput: 1327.05 | 2022-05-21 12:55:54.506 [rank:5] [train], epoch: 40/50, iter: 600/834, loss: 0.26017, top1: 0.72906, throughput: 1326.87[rank:7] [train], epoch: 40/50, iter: 600/834, loss: 0.26014, top1: 0.73057, throughput: 1326.65 | 2022-05-21 12:55:54.508 | 2022-05-21 12:55:54.506 [rank:2] [train], epoch: 40/50, iter: 600/834, loss: 0.26025, top1: 0.72786, throughput: 1326.61 | 2022-05-21 12:55:54.509 [rank:0] [train], epoch: 40/50, iter: 600/834, loss: 0.25951, top1: 0.72849, throughput: 1326.83 | 2022-05-21 12:55:54.507 [rank:3] [train], epoch: 40/50, iter: 600/834, loss: 0.25735, top1: 0.73401, throughput: 1326.61 | 2022-05-21 12:55:54.509 [rank:4] [train], epoch: 40/50, iter: 600/834, loss: 0.25711, top1: 0.73344, throughput: 1326.59 | 2022-05-21 12:55:54.509 [rank:6] [train], epoch: 40/50, iter: 700/834, loss: 0.25847, top1: 0.72984, throughput: 1330.33 | 2022-05-21 12:56:08.939 [rank:5] [train], epoch: 40/50, iter: 700/834, loss: 0.25780, top1: 0.73495, throughput: 1330.25 | 2022-05-21 12:56:08.940 [rank:4] [train], epoch: 40/50, iter: 700/834, loss: 0.25878, top1: 0.72802, throughput: 1330.35 | 2022-05-21 12:56:08.941 [rank:0] [train], epoch: 40/50, iter: 700/834, loss: 0.25723, top1: 0.73255, throughput: 1330.21 | 2022-05-21 12:56:08.941 [rank:3] [train], epoch: 40/50, iter: 700/834, loss: 0.26012, top1: 0.72995, throughput: 1330.34 | 2022-05-21 12:56:08.942 [rank:7] [train], epoch: 40/50, iter: 700/834, loss: 0.25853, top1: 0.73245, throughput: 1330.32 | 2022-05-21 12:56:08.941 [rank:2] [train], epoch: 40/50, iter: 700/834, loss: 0.26204, top1: 0.72641, throughput: 1330.34 | 2022-05-21 12:56:08.941 [rank:1] [train], epoch: 40/50, iter: 700/834, loss: 0.25979, top1: 0.72839, throughput: 1330.08 | 2022-05-21 12:56:08.941 [rank:3] [train], epoch: 40/50, iter: 800/834, loss: 0.25580, top1: 0.73693, throughput: 1328.24[rank:7] [train], epoch: 40/50, iter: 800/834, loss: 0.25791, top1: 0.73021, throughput: 1328.22 | 2022-05-21 12:56:23.397| 2022-05-21 12:56:23.396 [rank:4] [train], epoch: 40/50, iter: 800/834, loss: 0.26035, top1: 0.73354, throughput: 1328.23 | 2022-05-21 12:56:23.397 [rank:0] [train], epoch: 40/50, iter: 800/834, loss: 0.25984, top1: 0.72932, throughput: 1328.12 | 2022-05-21 12:56:23.397 [rank:6] [train], epoch: 40/50, iter: 800/834, loss: 0.26243, top1: 0.72448, throughput: 1327.97 | 2022-05-21 12:56:23.397 [rank:2] [train], epoch: 40/50, iter: 800/834, loss: 0.25886, top1: 0.72891, throughput: 1328.02 | 2022-05-21 12:56:23.399 [rank:1] [train], epoch: 40/50, iter: 800/834, loss: 0.25955, top1: 0.72802, throughput: 1328.01 | 2022-05-21 12:56:23.399 [rank:5] [train], epoch: 40/50, iter: 800/834, loss: 0.25946, top1: 0.72865, throughput: 1327.86 | 2022-05-21 12:56:23.399 [rank:0] [train], epoch: 40/50, iter: 834/834, loss: 0.25825, top1: 0.72626, throughput: 1322.99 | 2022-05-21 12:56:28.331 [rank:4] [train], epoch: 40/50, iter: 834/834, loss: 0.25925, top1: 0.72702, throughput: 1322.74 | 2022-05-21 12:56:28.332 [rank:5] [train], epoch: 40/50, iter: 834/834, loss: 0.25765, top1: 0.73300, throughput: 1323.36 | 2022-05-21 12:56:28.332 [rank:7] [train], epoch: 40/50, iter: 834/834, loss: 0.25793, top1: 0.72840, throughput: 1322.65 | 2022-05-21 12:56:28.332 [rank:6] [train], epoch: 40/50, iter: 834/834, loss: 0.26117, top1: 0.72641, throughput: 1322.82 | 2022-05-21 12:56:28.332 [rank:2] [train], epoch: 40/50, iter: 834/834, loss: 0.26112, top1: 0.72595, throughput: 1322.96 | 2022-05-21 12:56:28.333 [rank:1] [train], epoch: 40/50, iter: 834/834, loss: 0.25708, top1: 0.73667, throughput: 1322.22 | 2022-05-21 12:56:28.336 [rank:3] [train], epoch: 40/50, iter: 834/834, loss: 0.26115, top1: 0.73223, throughput: 1321.53 | 2022-05-21 12:56:28.336 [rank:0] [eval], epoch: 40/50, iter: 125/125, loss: 0.00000, top1: 0.73264, throughput: 572.20 | 2022-05-21 12:56:39.254 [rank:7] [eval], epoch: 40/50, iter: 125/125, loss: 0.00000, top1: 0.72992, throughput: 571.92 | 2022-05-21 12:56:39.260 [rank:2] [eval], epoch: 40/50, iter: 125/125, loss: 0.00000, top1: 0.72112, throughput: 571.23 | 2022-05-21 12:56:39.275 [rank:4] [eval], epoch: 40/50, iter: 125/125, loss: 0.00000, top1: 0.72352, throughput: 565.03 | 2022-05-21 12:56:39.393 [rank:6] [eval], epoch: 40/50, iter: 125/125, loss: 0.00000, top1: 0.73104, throughput: 564.78 | 2022-05-21 12:56:39.398 [rank:3] [eval], epoch: 40/50, iter: 125/125, loss: 0.00000, top1: 0.72816, throughput: 564.69 | 2022-05-21 12:56:39.405 [rank:5] [eval], epoch: 40/50, iter: 125/125, loss: 0.00000, top1: 0.72384, throughput: 561.40 | 2022-05-21 12:56:39.465 [rank:1] [eval], epoch: 40/50, iter: 125/125, loss: 0.00000, top1: 0.72976, throughput: 558.04 | 2022-05-21 12:56:39.536 [rank:7] [train], epoch: 41/50, iter: 100/834, loss: 0.25349, top1: 0.74281, throughput: 1307.57 | 2022-05-21 12:56:53.944 [rank:1] [train], epoch: 41/50, iter: 100/834, loss: 0.25529, top1: 0.73719, throughput: 1332.43 | 2022-05-21 12:56:53.946 [rank:2] [train], epoch: 41/50, iter: 100/834, loss: 0.25276, top1: 0.74526, throughput: 1308.80 | 2022-05-21 12:56:53.945 [rank:0] [train], epoch: 41/50, iter: 100/834, loss: 0.25634, top1: 0.73677, throughput: 1306.73 | 2022-05-21 12:56:53.947 [rank:4] [train], epoch: 41/50, iter: 100/834, loss: 0.25355, top1: 0.74057, throughput: 1319.13 | 2022-05-21 12:56:53.948 [rank:3] [train], epoch: 41/50, iter: 100/834, loss: 0.25351, top1: 0.74531, throughput: 1320.28[rank:5] [train], epoch: 41/50, iter: 100/834, loss: 0.25557, top1: 0.73870, throughput: 1325.62 | 2022-05-21 12:56:53.947| 2022-05-21 12:56:53.948 [rank:6] [train], epoch: 41/50, iter: 100/834, loss: 0.25726, top1: 0.73021, throughput: 1319.54 | 2022-05-21 12:56:53.949 [rank:7] [train], epoch: 41/50, iter: 200/834, loss: 0.25581, top1: 0.73828, throughput: 1327.44 | 2022-05-21 12:57:08.408 [rank:4] [train], epoch: 41/50, iter: 200/834, loss: 0.25326, top1: 0.74417, throughput: 1327.80 | 2022-05-21 12:57:08.408 [rank:6] [train], epoch: 41/50, iter: 200/834, loss: 0.25172, top1: 0.74568, throughput: 1327.79 | 2022-05-21 12:57:08.409 [rank:2] [train], epoch: 41/50, iter: 200/834, loss: 0.25554, top1: 0.73917, throughput: 1327.41 | 2022-05-21 12:57:08.409 [rank:3] [train], epoch: 41/50, iter: 200/834, loss: 0.25293, top1: 0.74406, throughput: 1327.41 | 2022-05-21 12:57:08.411 [rank:0] [train], epoch: 41/50, iter: 200/834, loss: 0.25655, top1: 0.73557, throughput: 1327.44[rank:5] [train], epoch: 41/50, iter: 200/834, loss: 0.25541, top1: 0.73917, throughput: 1327.54 | 2022-05-21 12:57:08.411| 2022-05-21 12:57:08.411 [rank:1] [train], epoch: 41/50, iter: 200/834, loss: 0.25378, top1: 0.73896, throughput: 1327.37 | 2022-05-21 12:57:08.410 [rank:7] [train], epoch: 41/50, iter: 300/834, loss: 0.25491, top1: 0.74120, throughput: 1323.47 | 2022-05-21 12:57:22.915 [rank:5] [train], epoch: 41/50, iter: 300/834, loss: 0.25651, top1: 0.73516, throughput: 1323.85 | 2022-05-21 12:57:22.914 [rank:0] [train], epoch: 41/50, iter: 300/834, loss: 0.25480, top1: 0.74063, throughput: 1323.80 | 2022-05-21 12:57:22.915 [rank:4] [train], epoch: 41/50, iter: 300/834, loss: 0.25463, top1: 0.73927, throughput: 1323.48 | 2022-05-21 12:57:22.915 [rank:6] [train], epoch: 41/50, iter: 300/834, loss: 0.25462, top1: 0.73958, throughput: 1323.45 | 2022-05-21 12:57:22.917 [rank:1] [train], epoch: 41/50, iter: 300/834, loss: 0.25676, top1: 0.73505, throughput: 1323.62 | 2022-05-21 12:57:22.916 [rank:3] [train], epoch: 41/50, iter: 300/834, loss: 0.25318, top1: 0.74219, throughput: 1323.60 | 2022-05-21 12:57:22.917 [rank:2] [train], epoch: 41/50, iter: 300/834, loss: 0.25314, top1: 0.74771, throughput: 1323.40 | 2022-05-21 12:57:22.917 [rank:6] [train], epoch: 41/50, iter: 400/834, loss: 0.25651, top1: 0.73672, throughput: 1328.18 | 2022-05-21 12:57:37.372 [rank:1] [train], epoch: 41/50, iter: 400/834, loss: 0.25585, top1: 0.73891, throughput: 1328.17 | 2022-05-21 12:57:37.372 [rank:2] [train], epoch: 41/50, iter: 400/834, loss: 0.25498, top1: 0.73927, throughput: 1327.95 | 2022-05-21 12:57:37.375 [rank:3] [train], epoch: 41/50, iter: 400/834, loss: 0.25433, top1: 0.74276, throughput: 1328.01 | 2022-05-21 12:57:37.375 [rank:4] [train], epoch: 41/50, iter: 400/834, loss: 0.25395, top1: 0.74193, throughput: 1327.93 | 2022-05-21 12:57:37.374 [rank:7] [train], epoch: 41/50, iter: 400/834, loss: 0.25489, top1: 0.73995, throughput: 1327.92 | 2022-05-21 12:57:37.374 [rank:0] [train], epoch: 41/50, iter: 400/834, loss: 0.25509, top1: 0.73823, throughput: 1327.90 | 2022-05-21 12:57:37.374 [rank:5] [train], epoch: 41/50, iter: 400/834, loss: 0.25254, top1: 0.74406, throughput: 1327.76 | 2022-05-21 12:57:37.375 [rank:7] [train], epoch: 41/50, iter: 500/834, loss: 0.25518, top1: 0.73917, throughput: 1328.69 | 2022-05-21 12:57:51.824 [rank:5] [train], epoch: 41/50, iter: 500/834, loss: 0.25341, top1: 0.74318, throughput: 1328.82 | 2022-05-21 12:57:51.824 [rank:0] [train], epoch: 41/50, iter: 500/834, loss: 0.25619, top1: 0.73604, throughput: 1328.56 | 2022-05-21 12:57:51.826 [rank:3] [train], epoch: 41/50, iter: 500/834, loss: 0.25485, top1: 0.74052, throughput: 1328.87[rank:6] [train], epoch: 41/50, iter: 500/834, loss: 0.25383, top1: 0.74203, throughput: 1328.58 | 2022-05-21 12:57:51.824| 2022-05-21 12:57:51.823 [rank:4] [train], epoch: 41/50, iter: 500/834, loss: 0.25320, top1: 0.74187, throughput: 1328.60 | 2022-05-21 12:57:51.825 [rank:1] [train], epoch: 41/50, iter: 500/834, loss: 0.25466, top1: 0.73839, throughput: 1328.60 | 2022-05-21 12:57:51.823 [rank:2] [train], epoch: 41/50, iter: 500/834, loss: 0.25530, top1: 0.73786, throughput: 1328.66 | 2022-05-21 12:57:51.826 [rank:4] [train], epoch: 41/50, iter: 600/834, loss: 0.25441, top1: 0.73974, throughput: 1329.51 | 2022-05-21 12:58:06.267 [rank:2] [train], epoch: 41/50, iter: 600/834, loss: 0.25292, top1: 0.74073, throughput: 1329.76 [rank:3] [train], epoch: 41/50, iter: 600/834, loss: 0.25295, top1: 0.74182, throughput: 1329.47| 2022-05-21 12:58:06.265 | 2022-05-21 12:58:06.265 [rank:7] [train], epoch: 41/50, iter: 600/834, loss: 0.25661, top1: 0.73630, throughput: 1329.40 | 2022-05-21 12:58:06.266 [rank:5] [train], epoch: 41/50, iter: 600/834, loss: 0.25520, top1: 0.73943, throughput: 1329.40 | 2022-05-21 12:58:06.266 [rank:6] [train], epoch: 41/50, iter: 600/834, loss: 0.25415, top1: 0.74229, throughput: 1329.35 | 2022-05-21 12:58:06.267 [rank:0] [train], epoch: 41/50, iter: 600/834, loss: 0.25547, top1: 0.73906, throughput: 1329.26 | 2022-05-21 12:58:06.270 [rank:1] [train], epoch: 41/50, iter: 600/834, loss: 0.25426, top1: 0.73896, throughput: 1329.07 | 2022-05-21 12:58:06.270 [rank:3] [train], epoch: 41/50, iter: 700/834, loss: 0.25304, top1: 0.74062, throughput: 1329.03 | 2022-05-21 12:58:20.712 [rank:7] [train], epoch: 41/50, iter: 700/834, loss: 0.25470, top1: 0.74016, throughput: 1329.16 | 2022-05-21 12:58:20.712 [rank:0] [train], epoch: 41/50, iter: 700/834, loss: 0.25555, top1: 0.74083, throughput: 1329.45 | 2022-05-21 12:58:20.712 [rank:6] [train], epoch: 41/50, iter: 700/834, loss: 0.25515, top1: 0.73953, throughput: 1329.10 | 2022-05-21 12:58:20.713 [rank:2] [train], epoch: 41/50, iter: 700/834, loss: 0.25157, top1: 0.74885, throughput: 1328.97 | 2022-05-21 12:58:20.712 [rank:5] [train], epoch: 41/50, iter: 700/834, loss: 0.25302, top1: 0.74167, throughput: 1329.05 | 2022-05-21 12:58:20.713 [rank:1] [train], epoch: 41/50, iter: 700/834, loss: 0.25530, top1: 0.73906, throughput: 1329.39 | 2022-05-21 12:58:20.712 [rank:4] [train], epoch: 41/50, iter: 700/834, loss: 0.25333, top1: 0.73922, throughput: 1329.01 | 2022-05-21 12:58:20.714 [rank:1] [train], epoch: 41/50, iter: 800/834, loss: 0.25587, top1: 0.73573, throughput: 1328.47 | 2022-05-21 12:58:35.165 [rank:7] [train], epoch: 41/50, iter: 800/834, loss: 0.25486, top1: 0.74297, throughput: 1328.43 | 2022-05-21 12:58:35.165 [rank:5] [train], epoch: 41/50, iter: 800/834, loss: 0.25465, top1: 0.74021, throughput: 1328.47 | 2022-05-21 12:58:35.165 [rank:0] [train], epoch: 41/50, iter: 800/834, loss: 0.25453, top1: 0.73979, throughput: 1328.32 | 2022-05-21 12:58:35.166 [rank:6] [train], epoch: 41/50, iter: 800/834, loss: 0.25373, top1: 0.73995, throughput: 1328.44 | 2022-05-21 12:58:35.166 [rank:2] [train], epoch: 41/50, iter: 800/834, loss: 0.25364, top1: 0.73885, throughput: 1328.25 | 2022-05-21 12:58:35.167 [rank:3] [train], epoch: 41/50, iter: 800/834, loss: 0.25410, top1: 0.74042, throughput: 1328.28 | 2022-05-21 12:58:35.167 [rank:4] [train], epoch: 41/50, iter: 800/834, loss: 0.25404, top1: 0.74047, throughput: 1328.35 | 2022-05-21 12:58:35.168 [rank:0] [train], epoch: 41/50, iter: 834/834, loss: 0.25626, top1: 0.73805, throughput: 1322.09 | 2022-05-21 12:58:40.104 [rank:1] [train], epoch: 41/50, iter: 834/834, loss: 0.25257, top1: 0.74831, throughput: 1321.66 | 2022-05-21 12:58:40.104 [rank:4] [train], epoch: 41/50, iter: 834/834, loss: 0.25547, top1: 0.73637, throughput: 1322.07 | 2022-05-21 12:58:40.105 [rank:7] [train], epoch: 41/50, iter: 834/834, loss: 0.25367, top1: 0.74173, throughput: 1321.34 | 2022-05-21 12:58:40.105 [rank:5] [train], epoch: 41/50, iter: 834/834, loss: 0.25819, top1: 0.73146, throughput: 1321.42 | 2022-05-21 12:58:40.106 [rank:2] [train], epoch: 41/50, iter: 834/834, loss: 0.25257, top1: 0.73866, throughput: 1321.90 | 2022-05-21 12:58:40.105 [rank:6] [train], epoch: 41/50, iter: 834/834, loss: 0.25818, top1: 0.73545, throughput: 1321.40[rank:3] [train], epoch: 41/50, iter: 834/834, loss: 0.25801, top1: 0.73652, throughput: 1321.63 | 2022-05-21 12:58:40.106| 2022-05-21 12:58:40.106 [rank:0] [eval], epoch: 41/50, iter: 125/125, loss: 0.00000, top1: 0.73408, throughput: 571.92 | 2022-05-21 12:58:51.032 [rank:2] [eval], epoch: 41/50, iter: 125/125, loss: 0.00000, top1: 0.73328, throughput: 571.71 | 2022-05-21 12:58:51.037 [rank:7] [eval], epoch: 41/50, iter: 125/125, loss: 0.00000, top1: 0.74176, throughput: 571.70 | 2022-05-21 12:58:51.038 [rank:6] [eval], epoch: 41/50, iter: 125/125, loss: 0.00000, top1: 0.73536, throughput: 568.15 | 2022-05-21 12:58:51.107 [rank:3] [eval], epoch: 41/50, iter: 125/125, loss: 0.00000, top1: 0.73024, throughput: 565.25 | 2022-05-21 12:58:51.163 [rank:4] [eval], epoch: 41/50, iter: 125/125, loss: 0.00000, top1: 0.72736, throughput: 565.03 | 2022-05-21 12:58:51.167 [rank:1] [eval], epoch: 41/50, iter: 125/125, loss: 0.00000, top1: 0.73744, throughput: 557.94 | 2022-05-21 12:58:51.306 [rank:5] [eval], epoch: 41/50, iter: 125/125, loss: 0.00000, top1: 0.73232, throughput: 555.03 | 2022-05-21 12:58:51.366 [rank:2] [train], epoch: 42/50, iter: 100/834, loss: 0.24790, top1: 0.75625, throughput: 1302.56 | 2022-05-21 12:59:05.778 [rank:7] [train], epoch: 42/50, iter: 100/834, loss: 0.24835, top1: 0.75333, throughput: 1302.53 | 2022-05-21 12:59:05.778 [rank:1] [train], epoch: 42/50, iter: 100/834, loss: 0.25142, top1: 0.74958, throughput: 1326.69 | 2022-05-21 12:59:05.778 [rank:0] [train], epoch: 42/50, iter: 100/834, loss: 0.25219, top1: 0.74505, throughput: 1302.01 | 2022-05-21 12:59:05.778 [rank:3] [train], epoch: 42/50, iter: 100/834, loss: 0.24831, top1: 0.75141, throughput: 1313.47 | 2022-05-21 12:59:05.781 [rank:5] [train], epoch: 42/50, iter: 100/834, loss: 0.24861, top1: 0.75193, throughput: 1332.05 | 2022-05-21 12:59:05.780 [rank:4] [train], epoch: 42/50, iter: 100/834, loss: 0.25073, top1: 0.74990, throughput: 1313.56 | 2022-05-21 12:59:05.784 [rank:6] [train], epoch: 42/50, iter: 100/834, loss: 0.25202, top1: 0.74510, throughput: 1308.08 | 2022-05-21 12:59:05.785 [rank:5] [train], epoch: 42/50, iter: 200/834, loss: 0.25016, top1: 0.74891, throughput: 1329.45 | 2022-05-21 12:59:20.222 [rank:4] [train], epoch: 42/50, iter: 200/834, loss: 0.25033, top1: 0.74880, throughput: 1329.76 | 2022-05-21 12:59:20.222 [rank:6] [train], epoch: 42/50, iter: 200/834, loss: 0.25010, top1: 0.74927, throughput: 1329.73 | 2022-05-21 12:59:20.224 [rank:7] [train], epoch: 42/50, iter: 200/834, loss: 0.24904, top1: 0.75208, throughput: 1329.12 | 2022-05-21 12:59:20.224 [rank:3] [train], epoch: 42/50, iter: 200/834, loss: 0.24996, top1: 0.75182, throughput: 1329.37 | 2022-05-21 12:59:20.224 [rank:1] [train], epoch: 42/50, iter: 200/834, loss: 0.25264, top1: 0.73932, throughput: 1329.11 | 2022-05-21 12:59:20.224 [rank:2] [train], epoch: 42/50, iter: 200/834, loss: 0.24907, top1: 0.75109, throughput: 1329.07 | 2022-05-21 12:59:20.224 [rank:0] [train], epoch: 42/50, iter: 200/834, loss: 0.24826, top1: 0.75370, throughput: 1329.02 | 2022-05-21 12:59:20.225 [rank:7] [train], epoch: 42/50, iter: 300/834, loss: 0.24792, top1: 0.75521, throughput: 1329.10 | 2022-05-21 12:59:34.670 [rank:4] [train], epoch: 42/50, iter: 300/834, loss: 0.24946, top1: 0.74896, throughput: 1328.98 | 2022-05-21 12:59:34.669 [rank:5] [train], epoch: 42/50, iter: 300/834, loss: 0.25015, top1: 0.75151, throughput: 1328.84 | 2022-05-21 12:59:34.671 [rank:3] [train], epoch: 42/50, iter: 300/834, loss: 0.25188, top1: 0.74776, throughput: 1329.10 | 2022-05-21 12:59:34.669 [rank:6] [train], epoch: 42/50, iter: 300/834, loss: 0.24925, top1: 0.75203, throughput: 1329.00 | 2022-05-21 12:59:34.671 [rank:0] [train], epoch: 42/50, iter: 300/834, loss: 0.25170, top1: 0.74693, throughput: 1329.09 | 2022-05-21 12:59:34.671 [rank:2] [train], epoch: 42/50, iter: 300/834, loss: 0.25019, top1: 0.74797, throughput: 1328.93 | 2022-05-21 12:59:34.672 [rank:1] [train], epoch: 42/50, iter: 300/834, loss: 0.25120, top1: 0.74844, throughput: 1328.90 | 2022-05-21 12:59:34.672 [rank:6] [train], epoch: 42/50, iter: 400/834, loss: 0.25079, top1: 0.74823, throughput: 1329.72 | 2022-05-21 12:59:49.110 [rank:0] [train], epoch: 42/50, iter: 400/834, loss: 0.24994, top1: 0.74953, throughput: 1329.70[rank:3] [train], epoch: 42/50, iter: 400/834, loss: 0.24823, top1: 0.75271, throughput: 1329.58 | 2022-05-21 12:59:49.110 | 2022-05-21 12:59:49.110 [rank:7] [train], epoch: 42/50, iter: 400/834, loss: 0.25115, top1: 0.74583, throughput: 1329.46 | 2022-05-21 12:59:49.112 [rank:2] [train], epoch: 42/50, iter: 400/834, loss: 0.24899, top1: 0.75036, throughput: 1329.66 | 2022-05-21 12:59:49.111 [rank:4] [train], epoch: 42/50, iter: 400/834, loss: 0.25084, top1: 0.74536, throughput: 1329.26 | 2022-05-21 12:59:49.113 [rank:1] [train], epoch: 42/50, iter: 400/834, loss: 0.24902, top1: 0.75464, throughput: 1329.70 | 2022-05-21 12:59:49.111 [rank:5] [train], epoch: 42/50, iter: 400/834, loss: 0.24983, top1: 0.74937, throughput: 1329.37 | 2022-05-21 12:59:49.114 [rank:4] [train], epoch: 42/50, iter: 500/834, loss: 0.25038, top1: 0.74802, throughput: 1328.22 | 2022-05-21 13:00:03.569 [rank:6] [train], epoch: 42/50, iter: 500/834, loss: 0.24914, top1: 0.74844, throughput: 1327.85 | 2022-05-21 13:00:03.569 [rank:2] [train], epoch: 42/50, iter: 500/834, loss: 0.25157, top1: 0.74693, throughput: 1328.07 | 2022-05-21 13:00:03.569 [rank:5] [train], epoch: 42/50, iter: 500/834, loss: 0.25002, top1: 0.74984, throughput: 1328.24 | 2022-05-21 13:00:03.569 [rank:7] [train], epoch: 42/50, iter: 500/834, loss: 0.24926, top1: 0.74854, throughput: 1328.04 | 2022-05-21 13:00:03.569 [rank:0] [train], epoch: 42/50, iter: 500/834, loss: 0.25204, top1: 0.74573, throughput: 1327.74 | 2022-05-21 13:00:03.571 [rank:3] [train], epoch: 42/50, iter: 500/834, loss: 0.24932, top1: 0.75234, throughput: 1327.72 | 2022-05-21 13:00:03.571 [rank:1] [train], epoch: 42/50, iter: 500/834, loss: 0.25142, top1: 0.74526, throughput: 1327.83 | 2022-05-21 13:00:03.571 [rank:5] [train], epoch: 42/50, iter: 600/834, loss: 0.25163, top1: 0.74594, throughput: 1329.42 | 2022-05-21 13:00:18.011 [rank:4] [train], epoch: 42/50, iter: 600/834, loss: 0.24996, top1: 0.75568, throughput: 1329.28 | 2022-05-21 13:00:18.013 [rank:3] [train], epoch: 42/50, iter: 600/834, loss: 0.25189, top1: 0.74729, throughput: 1329.56 | 2022-05-21 13:00:18.012 [rank:2] [train], epoch: 42/50, iter: 600/834, loss: 0.25268, top1: 0.74344, throughput: 1329.33 | 2022-05-21 13:00:18.012 [rank:6] [train], epoch: 42/50, iter: 600/834, loss: 0.24897, top1: 0.75156, throughput: 1329.37 | 2022-05-21 13:00:18.012 [rank:7] [train], epoch: 42/50, iter: 600/834, loss: 0.25141, top1: 0.74729, throughput: 1329.19 | 2022-05-21 13:00:18.014 [rank:0] [train], epoch: 42/50, iter: 600/834, loss: 0.24932, top1: 0.74974, throughput: 1329.29 | 2022-05-21 13:00:18.015 [rank:1] [train], epoch: 42/50, iter: 600/834, loss: 0.25070, top1: 0.74563, throughput: 1329.32 | 2022-05-21 13:00:18.015 [rank:7] [train], epoch: 42/50, iter: 700/834, loss: 0.24821, top1: 0.75276, throughput: 1321.15 | 2022-05-21 13:00:32.547 [rank:4] [train], epoch: 42/50, iter: 700/834, loss: 0.25184, top1: 0.74422, throughput: 1321.02 | 2022-05-21 13:00:32.547 [rank:5] [train], epoch: 42/50, iter: 700/834, loss: 0.24894, top1: 0.75380, throughput: 1320.84 | 2022-05-21 13:00:32.548 [rank:1] [train], epoch: 42/50, iter: 700/834, loss: 0.25122, top1: 0.74630, throughput: 1321.18 | 2022-05-21 13:00:32.547 [rank:6] [train], epoch: 42/50, iter: 700/834, loss: 0.25172, top1: 0.74464, throughput: 1320.87 | 2022-05-21 13:00:32.548 [rank:2] [train], epoch: 42/50, iter: 700/834, loss: 0.25084, top1: 0.75068, throughput: 1320.94 | 2022-05-21 13:00:32.547 [rank:0] [train], epoch: 42/50, iter: 700/834, loss: 0.25144, top1: 0.74583, throughput: 1320.97 | 2022-05-21 13:00:32.549 [rank:3] [train], epoch: 42/50, iter: 700/834, loss: 0.25174, top1: 0.74333, throughput: 1320.70 | 2022-05-21 13:00:32.550 [rank:3] [train], epoch: 42/50, iter: 800/834, loss: 0.25100, top1: 0.74474, throughput: 1330.23 | 2022-05-21 13:00:46.983 [rank:7] [train], epoch: 42/50, iter: 800/834, loss: 0.24868, top1: 0.75500, throughput: 1330.12 | 2022-05-21 13:00:46.981 [rank:5] [train], epoch: 42/50, iter: 800/834, loss: 0.25165, top1: 0.74740, throughput: 1330.16 | 2022-05-21 13:00:46.982 [rank:4] [train], epoch: 42/50, iter: 800/834, loss: 0.25094, top1: 0.74766, throughput: 1330.05 | 2022-05-21 13:00:46.983 [rank:6] [train], epoch: 42/50, iter: 800/834, loss: 0.24949, top1: 0.74990, throughput: 1330.16 | 2022-05-21 13:00:46.982 [rank:1] [train], epoch: 42/50, iter: 800/834, loss: 0.25073, top1: 0.74490, throughput: 1330.05 | 2022-05-21 13:00:46.983 [rank:2] [train], epoch: 42/50, iter: 800/834, loss: 0.25249, top1: 0.74589, throughput: 1330.06 | 2022-05-21 13:00:46.982 [rank:0] [train], epoch: 42/50, iter: 800/834, loss: 0.25108, top1: 0.74349, throughput: 1330.09 | 2022-05-21 13:00:46.985 [rank:5] [train], epoch: 42/50, iter: 834/834, loss: 0.24993, top1: 0.75107, throughput: 1325.25 | 2022-05-21 13:00:51.908 [rank:4] [train], epoch: 42/50, iter: 834/834, loss: 0.25226, top1: 0.75138, throughput: 1325.31 | 2022-05-21 13:00:51.908 [rank:7] [train], epoch: 42/50, iter: 834/834, loss: 0.24738, top1: 0.75352, throughput: 1324.83 | 2022-05-21 13:00:51.909 [rank:6] [train], epoch: 42/50, iter: 834/834, loss: 0.24537, top1: 0.75812, throughput: 1325.06 | 2022-05-21 13:00:51.909 [rank:2] [train], epoch: 42/50, iter: 834/834, loss: 0.24763, top1: 0.75797, throughput: 1325.06 | 2022-05-21 13:00:51.909 [rank:1] [train], epoch: 42/50, iter: 834/834, loss: 0.24516, top1: 0.75337, throughput: 1325.16 | 2022-05-21 13:00:51.909 [rank:3] [train], epoch: 42/50, iter: 834/834, loss: 0.24826, top1: 0.76149, throughput: 1325.17 | 2022-05-21 13:00:51.909 [rank:0] [train], epoch: 42/50, iter: 834/834, loss: 0.25183, top1: 0.73866, throughput: 1325.32 | 2022-05-21 13:00:51.910 [rank:0] [eval], epoch: 42/50, iter: 125/125, loss: 0.00000, top1: 0.74496, throughput: 569.88 | 2022-05-21 13:01:02.877 [rank:7] [eval], epoch: 42/50, iter: 125/125, loss: 0.00000, top1: 0.74032, throughput: 569.17 | 2022-05-21 13:01:02.890 [rank:1] [eval], epoch: 42/50, iter: 125/125, loss: 0.00000, top1: 0.73776, throughput: 568.26 | 2022-05-21 13:01:02.907 [rank:6] [eval], epoch: 42/50, iter: 125/125, loss: 0.00000, top1: 0.73936, throughput: 564.07 | 2022-05-21 13:01:02.989 [rank:2] [eval], epoch: 42/50, iter: 125/125, loss: 0.00000, top1: 0.73296, throughput: 563.22 | 2022-05-21 13:01:03.006 [rank:3] [eval], epoch: 42/50, iter: 125/125, loss: 0.00000, top1: 0.73008, throughput: 560.59 | 2022-05-21 13:01:03.058 [rank:4] [eval], epoch: 42/50, iter: 125/125, loss: 0.00000, top1: 0.73152, throughput: 557.97 | 2022-05-21 13:01:03.110 [rank:5] [eval], epoch: 42/50, iter: 125/125, loss: 0.00000, top1: 0.73056, throughput: 549.46 | 2022-05-21 13:01:03.283 [rank:5] [train], epoch: 43/50, iter: 100/834, loss: 0.24424, top1: 0.76042, throughput: 1334.10 | 2022-05-21 13:01:17.675 [rank:2] [train], epoch: 43/50, iter: 100/834, loss: 0.24591, top1: 0.76026, throughput: 1308.93 | 2022-05-21 13:01:17.674 [rank:7] [train], epoch: 43/50, iter: 100/834, loss: 0.24507, top1: 0.75880, throughput: 1298.59 | 2022-05-21 13:01:17.675 [rank:4] [train], epoch: 43/50, iter: 100/834, loss: 0.24706, top1: 0.75781, throughput: 1318.14 | 2022-05-21 13:01:17.675 [rank:1] [train], epoch: 43/50, iter: 100/834, loss: 0.24638, top1: 0.76000, throughput: 1300.18 | 2022-05-21 13:01:17.674 [rank:6] [train], epoch: 43/50, iter: 100/834, loss: 0.24382, top1: 0.76313, throughput: 1307.30 | 2022-05-21 13:01:17.676 [rank:0] [train], epoch: 43/50, iter: 100/834, loss: 0.24784, top1: 0.75458, throughput: 1297.32 | 2022-05-21 13:01:17.677 [rank:3] [train], epoch: 43/50, iter: 100/834, loss: 0.24700, top1: 0.75823, throughput: 1313.41 | 2022-05-21 13:01:17.677 [rank:7] [train], epoch: 43/50, iter: 200/834, loss: 0.24382, top1: 0.76313, throughput: 1328.41 | 2022-05-21 13:01:32.128 [rank:1] [train], epoch: 43/50, iter: 200/834, loss: 0.24405, top1: 0.76266, throughput: 1328.30 | 2022-05-21 13:01:32.129 [rank:4] [train], epoch: 43/50, iter: 200/834, loss: 0.24540, top1: 0.76094, throughput: 1328.30 | 2022-05-21 13:01:32.130 [rank:2] [train], epoch: 43/50, iter: 200/834, loss: 0.24468, top1: 0.75917, throughput: 1328.30 | 2022-05-21 13:01:32.129 [rank:6] [train], epoch: 43/50, iter: 200/834, loss: 0.24532, top1: 0.75719, throughput: 1328.39 | 2022-05-21 13:01:32.129 [rank:5] [train], epoch: 43/50, iter: 200/834, loss: 0.24500, top1: 0.76078, throughput: 1328.21 | 2022-05-21 13:01:32.130 [rank:3] [train], epoch: 43/50, iter: 200/834, loss: 0.24660, top1: 0.75818, throughput: 1328.08 | 2022-05-21 13:01:32.134 [rank:0] [train], epoch: 43/50, iter: 200/834, loss: 0.24622, top1: 0.75646, throughput: 1328.48 | 2022-05-21 13:01:32.130 [rank:5] [train], epoch: 43/50, iter: 300/834, loss: 0.24478, top1: 0.76042, throughput: 1329.43 | 2022-05-21 13:01:46.572 [rank:0] [train], epoch: 43/50, iter: 300/834, loss: 0.24677, top1: 0.75536, throughput: 1329.37 | 2022-05-21 13:01:46.573 [rank:2] [train], epoch: 43/50, iter: 300/834, loss: 0.24606, top1: 0.75786, throughput: 1329.02 | 2022-05-21 13:01:46.575 [rank:1] [train], epoch: 43/50, iter: 300/834, loss: 0.24491, top1: 0.76391, throughput: 1329.19 | 2022-05-21 13:01:46.574 [rank:3] [train], epoch: 43/50, iter: 300/834, loss: 0.24533, top1: 0.76031, throughput: 1329.61 | 2022-05-21 13:01:46.574 [rank:7] [train], epoch: 43/50, iter: 300/834, loss: 0.24511, top1: 0.75953, throughput: 1329.06[rank:6] [train], epoch: 43/50, iter: 300/834, loss: 0.24751, top1: 0.75380, throughput: 1329.09 | 2022-05-21 13:01:46.575| 2022-05-21 13:01:46.575 [rank:4] [train], epoch: 43/50, iter: 300/834, loss: 0.24908, top1: 0.75359, throughput: 1329.11 | 2022-05-21 13:01:46.576 [rank:6] [train], epoch: 43/50, iter: 400/834, loss: 0.24580, top1: 0.76094, throughput: 1325.78 | 2022-05-21 13:02:01.058 [rank:0] [train], epoch: 43/50, iter: 400/834, loss: 0.24573, top1: 0.75911, throughput: 1325.35 | 2022-05-21 13:02:01.059 [rank:3] [train], epoch: 43/50, iter: 400/834, loss: 0.24498, top1: 0.76469, throughput: 1325.60 | 2022-05-21 13:02:01.058 [rank:5] [train], epoch: 43/50, iter: 400/834, loss: 0.24619, top1: 0.75896, throughput: 1325.38 | 2022-05-21 13:02:01.059 [rank:4] [train], epoch: 43/50, iter: 400/834, loss: 0.24196, top1: 0.76885, throughput: 1325.60 | 2022-05-21 13:02:01.060 [rank:1] [train], epoch: 43/50, iter: 400/834, loss: 0.24944, top1: 0.75151, throughput: 1325.55 | 2022-05-21 13:02:01.058 [rank:7] [train], epoch: 43/50, iter: 400/834, loss: 0.24438, top1: 0.76411, throughput: 1325.50 | 2022-05-21 13:02:01.060 [rank:2] [train], epoch: 43/50, iter: 400/834, loss: 0.24685, top1: 0.75703, throughput: 1325.50 | 2022-05-21 13:02:01.061 [rank:4] [train], epoch: 43/50, iter: 500/834, loss: 0.24591, top1: 0.75568, throughput: 1330.77 | 2022-05-21 13:02:15.488 [rank:6] [train], epoch: 43/50, iter: 500/834, loss: 0.24646, top1: 0.75786, throughput: 1330.47 | 2022-05-21 13:02:15.489 [rank:7] [train], epoch: 43/50, iter: 500/834, loss: 0.24443, top1: 0.76141, throughput: 1330.77 | 2022-05-21 13:02:15.488 [rank:5] [train], epoch: 43/50, iter: 500/834, loss: 0.24736, top1: 0.75781, throughput: 1330.63 | 2022-05-21 13:02:15.488 [rank:0] [train], epoch: 43/50, iter: 500/834, loss: 0.24388, top1: 0.76083, throughput: 1330.56 | 2022-05-21 13:02:15.489 [rank:1] [train], epoch: 43/50, iter: 500/834, loss: 0.24514, top1: 0.75818, throughput: 1330.55 | 2022-05-21 13:02:15.488 [rank:3] [train], epoch: 43/50, iter: 500/834, loss: 0.24604, top1: 0.75792, throughput: 1330.42 | 2022-05-21 13:02:15.490 [rank:2] [train], epoch: 43/50, iter: 500/834, loss: 0.24599, top1: 0.76188, throughput: 1330.64 | 2022-05-21 13:02:15.490 [rank:0] [train], epoch: 43/50, iter: 600/834, loss: 0.24465, top1: 0.76172, throughput: 1327.28 | 2022-05-21 13:02:29.955 [rank:5] [train], epoch: 43/50, iter: 600/834, loss: 0.24597, top1: 0.75578, throughput: 1327.20 | 2022-05-21 13:02:29.955 [rank:4] [train], epoch: 43/50, iter: 600/834, loss: 0.24470, top1: 0.76203, throughput: 1327.01 | 2022-05-21 13:02:29.956 [rank:7] [train], epoch: 43/50, iter: 600/834, loss: 0.24557, top1: 0.75729, throughput: 1326.75 | 2022-05-21 13:02:29.959 [rank:2] [train], epoch: 43/50, iter: 600/834, loss: 0.24828, top1: 0.75458, throughput: 1327.15 | 2022-05-21 13:02:29.957 [rank:1] [train], epoch: 43/50, iter: 600/834, loss: 0.24727, top1: 0.75557, throughput: 1327.13 | 2022-05-21 13:02:29.956 [rank:6] [train], epoch: 43/50, iter: 600/834, loss: 0.24789, top1: 0.75385, throughput: 1326.80 | 2022-05-21 13:02:29.959 [rank:3] [train], epoch: 43/50, iter: 600/834, loss: 0.24634, top1: 0.75807, throughput: 1327.09 | 2022-05-21 13:02:29.958 [rank:6] [train], epoch: 43/50, iter: 700/834, loss: 0.24553, top1: 0.75865, throughput: 1327.43 | 2022-05-21 13:02:44.424 [rank:5] [train], epoch: 43/50, iter: 700/834, loss: 0.24626, top1: 0.75719, throughput: 1327.05 | 2022-05-21 13:02:44.423 [rank:1] [train], epoch: 43/50, iter: 700/834, loss: 0.24589, top1: 0.75937, throughput: 1327.16 | 2022-05-21 13:02:44.423 [rank:3] [train], epoch: 43/50, iter: 700/834, loss: 0.24661, top1: 0.75458, throughput: 1327.14 | 2022-05-21 13:02:44.425 [rank:0] [train], epoch: 43/50, iter: 700/834, loss: 0.24545, top1: 0.75953, throughput: 1327.08 | 2022-05-21 13:02:44.423 [rank:4] [train], epoch: 43/50, iter: 700/834, loss: 0.24724, top1: 0.75510, throughput: 1327.01 | 2022-05-21 13:02:44.425 [rank:2] [train], epoch: 43/50, iter: 700/834, loss: 0.24606, top1: 0.76031, throughput: 1327.20 | 2022-05-21 13:02:44.423 [rank:7] [train], epoch: 43/50, iter: 700/834, loss: 0.24645, top1: 0.75677, throughput: 1327.30 | 2022-05-21 13:02:44.424 [rank:5] [train], epoch: 43/50, iter: 800/834, loss: 0.24672, top1: 0.75609, throughput: 1324.03 | 2022-05-21 13:02:58.924 [rank:7] [train], epoch: 43/50, iter: 800/834, loss: 0.24489, top1: 0.75719, throughput: 1324.19 | 2022-05-21 13:02:58.924 [rank:2] [train], epoch: 43/50, iter: 800/834, loss: 0.24703, top1: 0.75760, throughput: 1324.09 | 2022-05-21 13:02:58.924 [rank:6] [train], epoch: 43/50, iter: 800/834, loss: 0.24903, top1: 0.75000, throughput: 1323.94 | 2022-05-21 13:02:58.926 [rank:4] [train], epoch: 43/50, iter: 800/834, loss: 0.24734, top1: 0.75453, throughput: 1324.02 | 2022-05-21 13:02:58.926 [rank:1] [train], epoch: 43/50, iter: 800/834, loss: 0.24668, top1: 0.75792, throughput: 1323.84 | 2022-05-21 13:02:58.926 [rank:3] [train], epoch: 43/50, iter: 800/834, loss: 0.24550, top1: 0.75578, throughput: 1323.65 | 2022-05-21 13:02:58.930 [rank:0] [train], epoch: 43/50, iter: 800/834, loss: 0.24382, top1: 0.76318, throughput: 1323.52 | 2022-05-21 13:02:58.930 [rank:4] [train], epoch: 43/50, iter: 834/834, loss: 0.24786, top1: 0.75383, throughput: 1319.41 | 2022-05-21 13:03:03.874 [rank:5] [train], epoch: 43/50, iter: 834/834, loss: 0.24344, top1: 0.76057, throughput: 1318.81 | 2022-05-21 13:03:03.874 [rank:7] [train], epoch: 43/50, iter: 834/834, loss: 0.24605, top1: 0.75980, throughput: 1318.75 | 2022-05-21 13:03:03.874 [rank:1] [train], epoch: 43/50, iter: 834/834, loss: 0.24127, top1: 0.76838, throughput: 1319.25 | 2022-05-21 13:03:03.874 [rank:0] [train], epoch: 43/50, iter: 834/834, loss: 0.24526, top1: 0.75781, throughput: 1319.93 | 2022-05-21 13:03:03.875 [rank:6] [train], epoch: 43/50, iter: 834/834, loss: 0.24658, top1: 0.75980, throughput: 1318.69[rank:2] [train], epoch: 43/50, iter: 834/834, loss: 0.24574, top1: 0.76195, throughput: 1318.33 | 2022-05-21 13:03:03.876 | 2022-05-21 13:03:03.876 [rank:3] [train], epoch: 43/50, iter: 834/834, loss: 0.24684, top1: 0.75720, throughput: 1319.82 | 2022-05-21 13:03:03.876 [rank:7] [eval], epoch: 43/50, iter: 125/125, loss: 0.00000, top1: 0.75328, throughput: 574.54 | 2022-05-21 13:03:14.752 [rank:0] [eval], epoch: 43/50, iter: 125/125, loss: 0.00000, top1: 0.75056, throughput: 574.58 | 2022-05-21 13:03:14.753 [rank:2] [eval], epoch: 43/50, iter: 125/125, loss: 0.00000, top1: 0.73872, throughput: 574.39 | 2022-05-21 13:03:14.757 [rank:4] [eval], epoch: 43/50, iter: 125/125, loss: 0.00000, top1: 0.74016, throughput: 571.39 | 2022-05-21 13:03:14.812 [rank:3] [eval], epoch: 43/50, iter: 125/125, loss: 0.00000, top1: 0.74016, throughput: 568.15 | 2022-05-21 13:03:14.877 [rank:6] [eval], epoch: 43/50, iter: 125/125, loss: 0.00000, top1: 0.74624, throughput: 567.01 | 2022-05-21 13:03:14.899 [rank:1] [eval], epoch: 43/50, iter: 125/125, loss: 0.00000, top1: 0.75344, throughput: 559.39 | 2022-05-21 13:03:15.047 [rank:5] [eval], epoch: 43/50, iter: 125/125, loss: 0.00000, top1: 0.73952, throughput: 557.57 | 2022-05-21 13:03:15.083 [rank:4] [train], epoch: 44/50, iter: 100/834, loss: 0.24248, top1: 0.77057, throughput: 1297.42 | 2022-05-21 13:03:29.610 [rank:7] [train], epoch: 44/50, iter: 100/834, loss: 0.24321, top1: 0.76302, throughput: 1292.22 | 2022-05-21 13:03:29.611 [rank:6] [train], epoch: 44/50, iter: 100/834, loss: 0.24573, top1: 0.75771, throughput: 1304.94 | 2022-05-21 13:03:29.612 [rank:3] [train], epoch: 44/50, iter: 100/834, loss: 0.24330, top1: 0.76078, throughput: 1303.02 | 2022-05-21 13:03:29.612 [rank:2] [train], epoch: 44/50, iter: 100/834, loss: 0.24290, top1: 0.76448, throughput: 1292.54 | 2022-05-21 13:03:29.611 [rank:5] [train], epoch: 44/50, iter: 100/834, loss: 0.24084, top1: 0.76542, throughput: 1321.48 | 2022-05-21 13:03:29.612 [rank:0] [train], epoch: 44/50, iter: 100/834, loss: 0.24064, top1: 0.76833, throughput: 1292.22 | 2022-05-21 13:03:29.611 [rank:1] [train], epoch: 44/50, iter: 100/834, loss: 0.24309, top1: 0.76641, throughput: 1318.24 | 2022-05-21 13:03:29.612 [rank:2] [train], epoch: 44/50, iter: 200/834, loss: 0.24276, top1: 0.76599, throughput: 1329.52 | 2022-05-21 13:03:44.052 [rank:6] [train], epoch: 44/50, iter: 200/834, loss: 0.24323, top1: 0.76615, throughput: 1329.46 | 2022-05-21 13:03:44.054 [rank:3] [train], epoch: 44/50, iter: 200/834, loss: 0.24120, top1: 0.76719, throughput: 1329.38 | 2022-05-21 13:03:44.055 [rank:5] [train], epoch: 44/50, iter: 200/834, loss: 0.24118, top1: 0.76828, throughput: 1329.42 | 2022-05-21 13:03:44.055 [rank:4] [train], epoch: 44/50, iter: 200/834, loss: 0.24191, top1: 0.76724, throughput: 1329.27 | 2022-05-21 13:03:44.054 [rank:7] [train], epoch: 44/50, iter: 200/834, loss: 0.24090, top1: 0.76500, throughput: 1329.27 | 2022-05-21 13:03:44.055 [rank:0] [train], epoch: 44/50, iter: 200/834, loss: 0.24202, top1: 0.76859, throughput: 1329.21 | 2022-05-21 13:03:44.056 [rank:1] [train], epoch: 44/50, iter: 200/834, loss: 0.23932, top1: 0.77422, throughput: 1329.48 | 2022-05-21 13:03:44.054 [rank:4] [train], epoch: 44/50, iter: 300/834, loss: 0.24283, top1: 0.76609, throughput: 1327.60 | 2022-05-21 13:03:58.517 [rank:0] [train], epoch: 44/50, iter: 300/834, loss: 0.24132, top1: 0.76854, throughput: 1327.71 | 2022-05-21 13:03:58.517 [rank:5] [train], epoch: 44/50, iter: 300/834, loss: 0.24124, top1: 0.76661, throughput: 1327.66 | 2022-05-21 13:03:58.516 [rank:7] [train], epoch: 44/50, iter: 300/834, loss: 0.24368, top1: 0.76005, throughput: 1327.60 | 2022-05-21 13:03:58.517 [rank:1] [train], epoch: 44/50, iter: 300/834, loss: 0.24304, top1: 0.76745, throughput: 1327.33 | 2022-05-21 13:03:58.519 [rank:2] [train], epoch: 44/50, iter: 300/834, loss: 0.24398, top1: 0.75990, throughput: 1327.36 | 2022-05-21 13:03:58.517 [rank:6] [train], epoch: 44/50, iter: 300/834, loss: 0.24198, top1: 0.76651, throughput: 1327.31 | 2022-05-21 13:03:58.519 [rank:3] [train], epoch: 44/50, iter: 300/834, loss: 0.24309, top1: 0.76417, throughput: 1327.32 | 2022-05-21 13:03:58.520 [rank:6] [train], epoch: 44/50, iter: 400/834, loss: 0.24273, top1: 0.76792, throughput: 1327.68 | 2022-05-21 13:04:12.981 [rank:3] [train], epoch: 44/50, iter: 400/834, loss: 0.24171, top1: 0.76521, throughput: 1327.89 | 2022-05-21 13:04:12.979 [rank:7] [train], epoch: 44/50, iter: 400/834, loss: 0.24086, top1: 0.76823, throughput: 1327.62 | 2022-05-21 13:04:12.979 [rank:1] [train], epoch: 44/50, iter: 400/834, loss: 0.24393, top1: 0.75813, throughput: 1327.84 | 2022-05-21 13:04:12.979 [rank:5] [train], epoch: 44/50, iter: 400/834, loss: 0.24193, top1: 0.76792, throughput: 1327.54 | 2022-05-21 13:04:12.979 [rank:4] [train], epoch: 44/50, iter: 400/834, loss: 0.24217, top1: 0.76729, throughput: 1327.41 | 2022-05-21 13:04:12.981 [rank:0] [train], epoch: 44/50, iter: 400/834, loss: 0.24187, top1: 0.76719, throughput: 1327.43 | 2022-05-21 13:04:12.981 [rank:2] [train], epoch: 44/50, iter: 400/834, loss: 0.24165, top1: 0.76859, throughput: 1327.53 | 2022-05-21 13:04:12.980 [rank:0] [train], epoch: 44/50, iter: 500/834, loss: 0.24165, top1: 0.76740, throughput: 1327.50 | 2022-05-21 13:04:27.444 [rank:2] [train], epoch: 44/50, iter: 500/834, loss: 0.24242, top1: 0.76760, throughput: 1327.39 | 2022-05-21 13:04:27.445 [rank:5] [train], epoch: 44/50, iter: 500/834, loss: 0.24501, top1: 0.75661, throughput: 1327.39 | 2022-05-21 13:04:27.444 [rank:6] [train], epoch: 44/50, iter: 500/834, loss: 0.24278, top1: 0.76729, throughput: 1327.50 | 2022-05-21 13:04:27.444 [rank:7] [train], epoch: 44/50, iter: 500/834, loss: 0.24369, top1: 0.76333, throughput: 1327.18 | 2022-05-21 13:04:27.446 [rank:4] [train], epoch: 44/50, iter: 500/834, loss: 0.24282, top1: 0.76266, throughput: 1327.34 | 2022-05-21 13:04:27.446 [rank:1] [train], epoch: 44/50, iter: 500/834, loss: 0.24335, top1: 0.76568, throughput: 1327.23 | 2022-05-21 13:04:27.445 [rank:3] [train], epoch: 44/50, iter: 500/834, loss: 0.24051, top1: 0.76839, throughput: 1327.10 | 2022-05-21 13:04:27.446 [rank:5] [train], epoch: 44/50, iter: 600/834, loss: 0.24274, top1: 0.76578, throughput: 1327.27 | 2022-05-21 13:04:41.909 [rank:7] [train], epoch: 44/50, iter: 600/834, loss: 0.24153, top1: 0.76745, throughput: 1327.50 | 2022-05-21 13:04:41.909 [rank:2] [train], epoch: 44/50, iter: 600/834, loss: 0.24042, top1: 0.76979, throughput: 1327.45 | 2022-05-21 13:04:41.908 [rank:6] [train], epoch: 44/50, iter: 600/834, loss: 0.24161, top1: 0.76776, throughput: 1327.37 | 2022-05-21 13:04:41.909 [rank:1] [train], epoch: 44/50, iter: 600/834, loss: 0.24127, top1: 0.76917, throughput: 1327.34 | 2022-05-21 13:04:41.910 [rank:0] [train], epoch: 44/50, iter: 600/834, loss: 0.24164, top1: 0.76703, throughput: 1327.24 | 2022-05-21 13:04:41.910 [rank:3] [train], epoch: 44/50, iter: 600/834, loss: 0.24162, top1: 0.76698, throughput: 1327.48 | 2022-05-21 13:04:41.910 [rank:4] [train], epoch: 44/50, iter: 600/834, loss: 0.24370, top1: 0.76328, throughput: 1327.30 | 2022-05-21 13:04:41.911 [rank:5] [train], epoch: 44/50, iter: 700/834, loss: 0.24309, top1: 0.76557, throughput: 1326.11 | 2022-05-21 13:04:56.388 [rank:7] [train], epoch: 44/50, iter: 700/834, loss: 0.24163, top1: 0.76995, throughput: 1326.04 | 2022-05-21 13:04:56.388 [rank:3] [train], epoch: 44/50, iter: 700/834, loss: 0.24094, top1: 0.77172, throughput: 1326.13 | 2022-05-21 13:04:56.388 [rank:1] [train], epoch: 44/50, iter: 700/834, loss: 0.24307, top1: 0.76505, throughput: 1326.12 | 2022-05-21 13:04:56.388 [rank:0] [train], epoch: 44/50, iter: 700/834, loss: 0.24224, top1: 0.76536, throughput: 1326.11 | 2022-05-21 13:04:56.389 [rank:6] [train], epoch: 44/50, iter: 700/834, loss: 0.24322, top1: 0.76667, throughput: 1325.85 | 2022-05-21 13:04:56.390 [rank:2] [train], epoch: 44/50, iter: 700/834, loss: 0.24084, top1: 0.77000, throughput: 1325.79 | 2022-05-21 13:04:56.390 [rank:4] [train], epoch: 44/50, iter: 700/834, loss: 0.24345, top1: 0.76411, throughput: 1325.75 | 2022-05-21 13:04:56.394 [rank:7] [train], epoch: 44/50, iter: 800/834, loss: 0.24193, top1: 0.76870, throughput: 1329.09 | 2022-05-21 13:05:10.834 [rank:5] [train], epoch: 44/50, iter: 800/834, loss: 0.24085, top1: 0.76990, throughput: 1329.09 | 2022-05-21 13:05:10.834 [rank:4] [train], epoch: 44/50, iter: 800/834, loss: 0.24090, top1: 0.76807, throughput: 1329.69 | 2022-05-21 13:05:10.833 [rank:6] [train], epoch: 44/50, iter: 800/834, loss: 0.24263, top1: 0.76635, throughput: 1329.25 | 2022-05-21 13:05:10.834 [rank:3] [train], epoch: 44/50, iter: 800/834, loss: 0.24227, top1: 0.76328, throughput: 1329.10[rank:2] [train], epoch: 44/50, iter: 800/834, loss: 0.24167, top1: 0.76609, throughput: 1329.32 | 2022-05-21 13:05:10.834 | 2022-05-21 13:05:10.834 [rank:0] [train], epoch: 44/50, iter: 800/834, loss: 0.23901, top1: 0.77182, throughput: 1328.88 | 2022-05-21 13:05:10.837 [rank:1] [train], epoch: 44/50, iter: 800/834, loss: 0.24192, top1: 0.76333, throughput: 1329.03 | 2022-05-21 13:05:10.835 [rank:0] [train], epoch: 44/50, iter: 834/834, loss: 0.23913, top1: 0.77895, throughput: 1327.58 | 2022-05-21 13:05:15.754 [rank:4] [train], epoch: 44/50, iter: 834/834, loss: 0.23891, top1: 0.77083, throughput: 1326.52 | 2022-05-21 13:05:15.754 [rank:5] [train], epoch: 44/50, iter: 834/834, loss: 0.24203, top1: 0.75934, throughput: 1326.40 | 2022-05-21 13:05:15.755 [rank:1] [train], epoch: 44/50, iter: 834/834, loss: 0.24084, top1: 0.77099, throughput: 1326.57 | 2022-05-21 13:05:15.756 [rank:3] [train], epoch: 44/50, iter: 834/834, loss: 0.24174, top1: 0.76624, throughput: 1326.27 | 2022-05-21 13:05:15.756 [rank:7] [train], epoch: 44/50, iter: 834/834, loss: 0.24126, top1: 0.76455, throughput: 1325.92 | 2022-05-21 13:05:15.757 [rank:2] [train], epoch: 44/50, iter: 834/834, loss: 0.24155, top1: 0.76440, throughput: 1326.11 | 2022-05-21 13:05:15.756 [rank:6] [train], epoch: 44/50, iter: 834/834, loss: 0.24413, top1: 0.76654, throughput: 1325.89 | 2022-05-21 13:05:15.758 [rank:7] [eval], epoch: 44/50, iter: 125/125, loss: 0.00000, top1: 0.76016, throughput: 579.08 | 2022-05-21 13:05:26.550 [rank:0] [eval], epoch: 44/50, iter: 125/125, loss: 0.00000, top1: 0.75264, throughput: 578.91 | 2022-05-21 13:05:26.550 [rank:6] [eval], epoch: 44/50, iter: 125/125, loss: 0.00000, top1: 0.75248, throughput: 572.24 | 2022-05-21 13:05:26.680 [rank:2] [eval], epoch: 44/50, iter: 125/125, loss: 0.00000, top1: 0.73968, throughput: 571.60 | 2022-05-21 13:05:26.691 [rank:4] [eval], epoch: 44/50, iter: 125/125, loss: 0.00000, top1: 0.75040, throughput: 566.11 | 2022-05-21 13:05:26.795 [rank:1] [eval], epoch: 44/50, iter: 125/125, loss: 0.00000, top1: 0.75472, throughput: 565.68 | 2022-05-21 13:05:26.804 [rank:3] [eval], epoch: 44/50, iter: 125/125, loss: 0.00000, top1: 0.74752, throughput: 565.50 | 2022-05-21 13:05:26.808 [rank:5] [eval], epoch: 44/50, iter: 125/125, loss: 0.00000, top1: 0.74528, throughput: 556.37 | 2022-05-21 13:05:26.989 [rank:3] [train], epoch: 45/50, iter: 100/834, loss: 0.23965, top1: 0.76964, throughput: 1314.46 | 2022-05-21 13:05:41.415 [rank:5] [train], epoch: 45/50, iter: 100/834, loss: 0.23960, top1: 0.77250, throughput: 1330.69 | 2022-05-21 13:05:41.417 [rank:1] [train], epoch: 45/50, iter: 100/834, loss: 0.23872, top1: 0.77099, throughput: 1314.06 | 2022-05-21 13:05:41.416 [rank:0] [train], epoch: 45/50, iter: 100/834, loss: 0.23798, top1: 0.77219, throughput: 1291.39 | 2022-05-21 13:05:41.418 [rank:7] [train], epoch: 45/50, iter: 100/834, loss: 0.23837, top1: 0.77427, throughput: 1291.42 | 2022-05-21 13:05:41.418 [rank:6] [train], epoch: 45/50, iter: 100/834, loss: 0.24014, top1: 0.77115, throughput: 1302.67 | 2022-05-21 13:05:41.419 [rank:4] [train], epoch: 45/50, iter: 100/834, loss: 0.24141, top1: 0.76958, throughput: 1312.93 | 2022-05-21 13:05:41.418 [rank:2] [train], epoch: 45/50, iter: 100/834, loss: 0.23865, top1: 0.77224, throughput: 1303.73 | 2022-05-21 13:05:41.418 [rank:3] [train], epoch: 45/50, iter: 200/834, loss: 0.23863, top1: 0.77724, throughput: 1326.37 | 2022-05-21 13:05:55.891 [rank:7] [train], epoch: 45/50, iter: 200/834, loss: 0.23914, top1: 0.77339, throughput: 1326.79 | 2022-05-21 13:05:55.889 [rank:0] [train], epoch: 45/50, iter: 200/834, loss: 0.23990, top1: 0.77266, throughput: 1326.77 | 2022-05-21 13:05:55.889 [rank:5] [train], epoch: 45/50, iter: 200/834, loss: 0.23912, top1: 0.77432, throughput: 1326.80 | 2022-05-21 13:05:55.888 [rank:4] [train], epoch: 45/50, iter: 200/834, loss: 0.24104, top1: 0.76932, throughput: 1326.86 | 2022-05-21 13:05:55.889 [rank:6] [train], epoch: 45/50, iter: 200/834, loss: 0.23781, top1: 0.77740, throughput: 1326.75 | 2022-05-21 13:05:55.890 [rank:2] [train], epoch: 45/50, iter: 200/834, loss: 0.23949, top1: 0.77266, throughput: 1326.39 | 2022-05-21 13:05:55.893 [rank:1] [train], epoch: 45/50, iter: 200/834, loss: 0.23624, top1: 0.77708, throughput: 1326.20 | 2022-05-21 13:05:55.893 [rank:7] [train], epoch: 45/50, iter: 300/834, loss: 0.23782, top1: 0.77563, throughput: 1328.95 | 2022-05-21 13:06:10.336 [rank:6] [train], epoch: 45/50, iter: 300/834, loss: 0.23697, top1: 0.77401, throughput: 1328.98 | 2022-05-21 13:06:10.337 [rank:3] [train], epoch: 45/50, iter: 300/834, loss: 0.23864, top1: 0.77411, throughput: 1329.05 | 2022-05-21 13:06:10.337 [rank:0] [train], epoch: 45/50, iter: 300/834, loss: 0.23756, top1: 0.77562, throughput: 1328.89 | 2022-05-21 13:06:10.337 [rank:5] [train], epoch: 45/50, iter: 300/834, loss: 0.23854, top1: 0.77240, throughput: 1328.68 | 2022-05-21 13:06:10.339 [rank:1] [train], epoch: 45/50, iter: 300/834, loss: 0.23769, top1: 0.77745, throughput: 1329.15 | 2022-05-21 13:06:10.338 [rank:4] [train], epoch: 45/50, iter: 300/834, loss: 0.23862, top1: 0.77526, throughput: 1328.72 | 2022-05-21 13:06:10.339 [rank:2] [train], epoch: 45/50, iter: 300/834, loss: 0.23867, top1: 0.77594, throughput: 1329.20 | 2022-05-21 13:06:10.338 [rank:6] [train], epoch: 45/50, iter: 400/834, loss: 0.24098, top1: 0.76948, throughput: 1328.31 | 2022-05-21 13:06:24.792 [rank:4] [train], epoch: 45/50, iter: 400/834, loss: 0.23855, top1: 0.77375, throughput: 1328.36 | 2022-05-21 13:06:24.792 [rank:2] [train], epoch: 45/50, iter: 400/834, loss: 0.23934, top1: 0.77297, throughput: 1328.28 | 2022-05-21 13:06:24.792 [rank:1] [train], epoch: 45/50, iter: 400/834, loss: 0.23799, top1: 0.77479, throughput: 1328.35 | 2022-05-21 13:06:24.792 [rank:7] [train], epoch: 45/50, iter: 400/834, loss: 0.23805, top1: 0.77766, throughput: 1328.05 | 2022-05-21 13:06:24.793 [rank:5] [train], epoch: 45/50, iter: 400/834, loss: 0.23899, top1: 0.77583, throughput: 1328.23 | 2022-05-21 13:06:24.794 [rank:3] [train], epoch: 45/50, iter: 400/834, loss: 0.24015, top1: 0.76917, throughput: 1328.00[rank:0] [train], epoch: 45/50, iter: 400/834, loss: 0.24149, top1: 0.77057, throughput: 1328.05 | 2022-05-21 13:06:24.795 | 2022-05-21 13:06:24.795 [rank:7] [train], epoch: 45/50, iter: 500/834, loss: 0.24198, top1: 0.76531, throughput: 1328.29 | 2022-05-21 13:06:39.248 [rank:3] [train], epoch: 45/50, iter: 500/834, loss: 0.24094, top1: 0.76495, throughput: 1328.46 | 2022-05-21 13:06:39.248 [rank:2] [train], epoch: 45/50, iter: 500/834, loss: 0.23972, top1: 0.77474, throughput: 1328.23 | 2022-05-21 13:06:39.248 [rank:5] [train], epoch: 45/50, iter: 500/834, loss: 0.23744, top1: 0.77734, throughput: 1328.37 | 2022-05-21 13:06:39.248 [rank:4] [train], epoch: 45/50, iter: 500/834, loss: 0.23977, top1: 0.77031, throughput: 1328.12 | 2022-05-21 13:06:39.249 [rank:0] [train], epoch: 45/50, iter: 500/834, loss: 0.23815, top1: 0.77443, throughput: 1328.34 | 2022-05-21 13:06:39.249 [rank:6] [train], epoch: 45/50, iter: 500/834, loss: 0.24017, top1: 0.76917, throughput: 1327.95 | 2022-05-21 13:06:39.250 [rank:1] [train], epoch: 45/50, iter: 500/834, loss: 0.24015, top1: 0.77094, throughput: 1327.97 | 2022-05-21 13:06:39.251 [rank:6] [train], epoch: 45/50, iter: 600/834, loss: 0.24009, top1: 0.77250, throughput: 1330.73 | 2022-05-21 13:06:53.678 [rank:0] [train], epoch: 45/50, iter: 600/834, loss: 0.23827, top1: 0.77583, throughput: 1330.62 | 2022-05-21 13:06:53.678 [rank:3] [train], epoch: 45/50, iter: 600/834, loss: 0.23669, top1: 0.77729, throughput: 1330.32 | 2022-05-21 13:06:53.680 [rank:1] [train], epoch: 45/50, iter: 600/834, loss: 0.23834, top1: 0.77708, throughput: 1330.59 | 2022-05-21 13:06:53.680 [rank:7] [train], epoch: 45/50, iter: 600/834, loss: 0.23922, top1: 0.77552, throughput: 1330.43 | 2022-05-21 13:06:53.679 [rank:5] [train], epoch: 45/50, iter: 600/834, loss: 0.23717, top1: 0.77635, throughput: 1330.13 | 2022-05-21 13:06:53.683 [rank:2] [train], epoch: 45/50, iter: 600/834, loss: 0.23980, top1: 0.77109, throughput: 1330.35 | 2022-05-21 13:06:53.680 [rank:4] [train], epoch: 45/50, iter: 600/834, loss: 0.23746, top1: 0.77578, throughput: 1330.21 | 2022-05-21 13:06:53.683 [rank:5] [train], epoch: 45/50, iter: 700/834, loss: 0.23882, top1: 0.77120, throughput: 1328.64 | 2022-05-21 13:07:08.133 [rank:7] [train], epoch: 45/50, iter: 700/834, loss: 0.23856, top1: 0.77172, throughput: 1328.29 | 2022-05-21 13:07:08.134 [rank:4] [train], epoch: 45/50, iter: 700/834, loss: 0.23929, top1: 0.77255, throughput: 1328.54 | 2022-05-21 13:07:08.135 [rank:1] [train], epoch: 45/50, iter: 700/834, loss: 0.23912, top1: 0.77229, throughput: 1328.40[rank:3] [train], epoch: 45/50, iter: 700/834, loss: 0.23874, top1: 0.77240, throughput: 1328.39 | 2022-05-21 13:07:08.134 [rank:0] [train], epoch: 45/50, iter: 700/834, loss: 0.23964, top1: 0.76969, throughput: 1328.17 | 2022-05-21 13:07:08.134 | 2022-05-21 13:07:08.134 [rank:2] [train], epoch: 45/50, iter: 700/834, loss: 0.23737, top1: 0.77495, throughput: 1328.41 | 2022-05-21 13:07:08.133 [rank:6] [train], epoch: 45/50, iter: 700/834, loss: 0.23858, top1: 0.77240, throughput: 1328.11 | 2022-05-21 13:07:08.135 [rank:5] [train], epoch: 45/50, iter: 800/834, loss: 0.23832, top1: 0.77651, throughput: 1327.49 | 2022-05-21 13:07:22.597 [rank:4] [train], epoch: 45/50, iter: 800/834, loss: 0.23808, top1: 0.77333, throughput: 1327.61 | 2022-05-21 13:07:22.597 [rank:6] [train], epoch: 45/50, iter: 800/834, loss: 0.24072, top1: 0.76849, throughput: 1327.53 | 2022-05-21 13:07:22.598 [rank:0] [train], epoch: 45/50, iter: 800/834, loss: 0.24109, top1: 0.77031, throughput: 1327.35 | 2022-05-21 13:07:22.599 [rank:1] [train], epoch: 45/50, iter: 800/834, loss: 0.23806, top1: 0.77479, throughput: 1327.32 | 2022-05-21 13:07:22.599 [rank:3] [train], epoch: 45/50, iter: 800/834, loss: 0.24042, top1: 0.77010, throughput: 1327.41 | 2022-05-21 13:07:22.598 [rank:7] [train], epoch: 45/50, iter: 800/834, loss: 0.23961, top1: 0.77172, throughput: 1327.39 | 2022-05-21 13:07:22.599 [rank:2] [train], epoch: 45/50, iter: 800/834, loss: 0.23809, top1: 0.77333, throughput: 1327.30 | 2022-05-21 13:07:22.599 [rank:3] [train], epoch: 45/50, iter: 834/834, loss: 0.23743, top1: 0.77466, throughput: 1324.12 | 2022-05-21 13:07:27.528 [rank:4] [train], epoch: 45/50, iter: 834/834, loss: 0.23777, top1: 0.77987, throughput: 1323.73 | 2022-05-21 13:07:27.528 [rank:7] [train], epoch: 45/50, iter: 834/834, loss: 0.24080, top1: 0.76915, throughput: 1324.07 | 2022-05-21 13:07:27.529 [rank:1] [train], epoch: 45/50, iter: 834/834, loss: 0.24039, top1: 0.77267, throughput: 1324.16 | 2022-05-21 13:07:27.529 [rank:6] [train], epoch: 45/50, iter: 834/834, loss: 0.23787, top1: 0.77237, throughput: 1323.79 | 2022-05-21 13:07:27.529 [rank:5] [train], epoch: 45/50, iter: 834/834, loss: 0.23765, top1: 0.77405, throughput: 1323.29 | 2022-05-21 13:07:27.530 [rank:2] [train], epoch: 45/50, iter: 834/834, loss: 0.24397, top1: 0.76256, throughput: 1323.09 | 2022-05-21 13:07:27.533 [rank:0] [train], epoch: 45/50, iter: 834/834, loss: 0.23789, top1: 0.77083, throughput: 1322.96 | 2022-05-21 13:07:27.533 [rank:0] [eval], epoch: 45/50, iter: 125/125, loss: 0.00000, top1: 0.75664, throughput: 565.41 | 2022-05-21 13:07:38.587 [rank:4] [eval], epoch: 45/50, iter: 125/125, loss: 0.00000, top1: 0.75360, throughput: 565.10 | 2022-05-21 13:07:38.588 [rank:7] [eval], epoch: 45/50, iter: 125/125, loss: 0.00000, top1: 0.75936, throughput: 564.92 | 2022-05-21 13:07:38.592 [rank:6] [eval], epoch: 45/50, iter: 125/125, loss: 0.00000, top1: 0.76064, throughput: 561.31 | 2022-05-21 13:07:38.664 [rank:2] [eval], epoch: 45/50, iter: 125/125, loss: 0.00000, top1: 0.75056, throughput: 560.28 | 2022-05-21 13:07:38.688 [rank:3] [eval], epoch: 45/50, iter: 125/125, loss: 0.00000, top1: 0.75248, throughput: 556.24 | 2022-05-21 13:07:38.764 [rank:1] [eval], epoch: 45/50, iter: 125/125, loss: 0.00000, top1: 0.75712, throughput: 552.75 | 2022-05-21 13:07:38.836 [rank:5] [eval], epoch: 45/50, iter: 125/125, loss: 0.00000, top1: 0.74928, throughput: 546.19 | 2022-05-21 13:07:38.973 [rank:1] [train], epoch: 46/50, iter: 100/834, loss: 0.23545, top1: 0.78057, throughput: 1311.29 | 2022-05-21 13:07:53.478 [rank:4] [train], epoch: 46/50, iter: 100/834, loss: 0.23761, top1: 0.77807, throughput: 1289.39 | 2022-05-21 13:07:53.479 [rank:2] [train], epoch: 46/50, iter: 100/834, loss: 0.23615, top1: 0.78109, throughput: 1298.14 | 2022-05-21 13:07:53.478 [rank:6] [train], epoch: 46/50, iter: 100/834, loss: 0.23842, top1: 0.77510, throughput: 1295.96 | 2022-05-21 13:07:53.479 [rank:0] [train], epoch: 46/50, iter: 100/834, loss: 0.23503, top1: 0.78068, throughput: 1289.33 | 2022-05-21 13:07:53.479 [rank:7] [train], epoch: 46/50, iter: 100/834, loss: 0.23570, top1: 0.77880, throughput: 1289.77 | 2022-05-21 13:07:53.479 [rank:5] [train], epoch: 46/50, iter: 100/834, loss: 0.23696, top1: 0.77896, throughput: 1323.51[rank:3] [train], epoch: 46/50, iter: 100/834, loss: 0.23611, top1: 0.77719, throughput: 1304.58 | 2022-05-21 13:07:53.482 | 2022-05-21 13:07:53.480 [rank:0] [train], epoch: 46/50, iter: 200/834, loss: 0.23858, top1: 0.77219, throughput: 1330.88 | 2022-05-21 13:08:07.905 [rank:4] [train], epoch: 46/50, iter: 200/834, loss: 0.23603, top1: 0.77839, throughput: 1331.13 | 2022-05-21 13:08:07.903 [rank:1] [train], epoch: 46/50, iter: 200/834, loss: 0.23540, top1: 0.78161, throughput: 1331.00 | 2022-05-21 13:08:07.903 [rank:5] [train], epoch: 46/50, iter: 200/834, loss: 0.23754, top1: 0.77724, throughput: 1331.04 | 2022-05-21 13:08:07.905 [rank:7] [train], epoch: 46/50, iter: 200/834, loss: 0.23640, top1: 0.77880, throughput: 1330.96 | 2022-05-21 13:08:07.904 [rank:3] [train], epoch: 46/50, iter: 200/834, loss: 0.23576, top1: 0.77911, throughput: 1331.15 | 2022-05-21 13:08:07.905 [rank:6] [train], epoch: 46/50, iter: 200/834, loss: 0.23811, top1: 0.77578, throughput: 1330.90 | 2022-05-21 13:08:07.906 [rank:2] [train], epoch: 46/50, iter: 200/834, loss: 0.23591, top1: 0.78250, throughput: 1330.82 | 2022-05-21 13:08:07.905 [rank:2] [train], epoch: 46/50, iter: 300/834, loss: 0.23707, top1: 0.77547, throughput: 1328.35 | 2022-05-21 13:08:22.360 [rank:5] [train], epoch: 46/50, iter: 300/834, loss: 0.23667, top1: 0.77948, throughput: 1328.16 | 2022-05-21 13:08:22.361 [rank:4] [train], epoch: 46/50, iter: 300/834, loss: 0.23740, top1: 0.77547, throughput: 1328.00 | 2022-05-21 13:08:22.361 [rank:3] [train], epoch: 46/50, iter: 300/834, loss: 0.23449, top1: 0.78307, throughput: 1328.05 | 2022-05-21 13:08:22.362 [rank:7] [train], epoch: 46/50, iter: 300/834, loss: 0.23655, top1: 0.77885, throughput: 1327.99 | 2022-05-21 13:08:22.362 [rank:6] [train], epoch: 46/50, iter: 300/834, loss: 0.23742, top1: 0.77792, throughput: 1327.98 | 2022-05-21 13:08:22.364 [rank:0] [train], epoch: 46/50, iter: 300/834, loss: 0.23617, top1: 0.78505, throughput: 1328.02 [rank:1] [train], epoch: 46/50, iter: 300/834, loss: 0.23478, top1: 0.78177, throughput: 1327.88| 2022-05-21 13:08:22.363 | 2022-05-21 13:08:22.362 [rank:6] [train], epoch: 46/50, iter: 400/834, loss: 0.23566, top1: 0.77880, throughput: 1330.27 | 2022-05-21 13:08:36.797 [rank:7] [train], epoch: 46/50, iter: 400/834, loss: 0.23543, top1: 0.77932, throughput: 1329.99 | 2022-05-21 13:08:36.799 [rank:0] [train], epoch: 46/50, iter: 400/834, loss: 0.23654, top1: 0.78016, throughput: 1330.15[rank:5] [train], epoch: 46/50, iter: 400/834, loss: 0.23494, top1: 0.77964, throughput: 1329.91 | 2022-05-21 13:08:36.798| 2022-05-21 13:08:36.798 [rank:1] [train], epoch: 46/50, iter: 400/834, loss: 0.23528, top1: 0.78161, throughput: 1330.09 | 2022-05-21 13:08:36.798 [rank:3] [train], epoch: 46/50, iter: 400/834, loss: 0.23691, top1: 0.77859, throughput: 1329.98 | 2022-05-21 13:08:36.799 [rank:2] [train], epoch: 46/50, iter: 400/834, loss: 0.23763, top1: 0.77911, throughput: 1329.68 | 2022-05-21 13:08:36.799 [rank:4] [train], epoch: 46/50, iter: 400/834, loss: 0.23720, top1: 0.77526, throughput: 1329.80 | 2022-05-21 13:08:36.799 [rank:3] [train], epoch: 46/50, iter: 500/834, loss: 0.23435, top1: 0.78099, throughput: 1330.09 | 2022-05-21 13:08:51.234 [rank:7] [train], epoch: 46/50, iter: 500/834, loss: 0.23613, top1: 0.77677, throughput: 1330.08 | 2022-05-21 13:08:51.234 [rank:0] [train], epoch: 46/50, iter: 500/834, loss: 0.23456, top1: 0.78276, throughput: 1329.98 | 2022-05-21 13:08:51.234 [rank:1] [train], epoch: 46/50, iter: 500/834, loss: 0.23738, top1: 0.77615, throughput: 1329.92 | 2022-05-21 13:08:51.234 [rank:5] [train], epoch: 46/50, iter: 500/834, loss: 0.23680, top1: 0.77599, throughput: 1329.95 | 2022-05-21 13:08:51.234 [rank:4] [train], epoch: 46/50, iter: 500/834, loss: 0.23533, top1: 0.78432, throughput: 1329.92 | 2022-05-21 13:08:51.236 [rank:6] [train], epoch: 46/50, iter: 500/834, loss: 0.23648, top1: 0.77990, throughput: 1329.78 | 2022-05-21 13:08:51.235 [rank:2] [train], epoch: 46/50, iter: 500/834, loss: 0.23651, top1: 0.77708, throughput: 1329.96 | 2022-05-21 13:08:51.236 [rank:2] [train], epoch: 46/50, iter: 600/834, loss: 0.23498, top1: 0.78135, throughput: 1326.33 | 2022-05-21 13:09:05.712 [rank:7] [train], epoch: 46/50, iter: 600/834, loss: 0.23531, top1: 0.78286, throughput: 1326.23 | 2022-05-21 13:09:05.711 [rank:5] [train], epoch: 46/50, iter: 600/834, loss: 0.23630, top1: 0.78078, throughput: 1326.23 | 2022-05-21 13:09:05.712 [rank:6] [train], epoch: 46/50, iter: 600/834, loss: 0.23535, top1: 0.78250, throughput: 1326.23 | 2022-05-21 13:09:05.712 [rank:1] [train], epoch: 46/50, iter: 600/834, loss: 0.23714, top1: 0.77630, throughput: 1326.22 | 2022-05-21 13:09:05.712 [rank:4] [train], epoch: 46/50, iter: 600/834, loss: 0.23619, top1: 0.77771, throughput: 1326.24 | 2022-05-21 13:09:05.713 [rank:3] [train], epoch: 46/50, iter: 600/834, loss: 0.23659, top1: 0.77771, throughput: 1325.96 | 2022-05-21 13:09:05.714 [rank:0] [train], epoch: 46/50, iter: 600/834, loss: 0.23403, top1: 0.78172, throughput: 1325.94 | 2022-05-21 13:09:05.714 [rank:7] [train], epoch: 46/50, iter: 700/834, loss: 0.23629, top1: 0.77813, throughput: 1326.47 | 2022-05-21 13:09:20.185 [rank:3] [train], epoch: 46/50, iter: 700/834, loss: 0.23645, top1: 0.78005, throughput: 1326.72 | 2022-05-21 13:09:20.186 [rank:5] [train], epoch: 46/50, iter: 700/834, loss: 0.23578, top1: 0.78333, throughput: 1326.38 | 2022-05-21 13:09:20.187 [rank:0] [train], epoch: 46/50, iter: 700/834, loss: 0.23524, top1: 0.78099, throughput: 1326.78 | 2022-05-21 13:09:20.185 [rank:4] [train], epoch: 46/50, iter: 700/834, loss: 0.23620, top1: 0.78005, throughput: 1326.54 | 2022-05-21 13:09:20.187 [rank:6] [train], epoch: 46/50, iter: 700/834, loss: 0.23679, top1: 0.78083, throughput: 1326.37 | 2022-05-21 13:09:20.188 [rank:2] [train], epoch: 46/50, iter: 700/834, loss: 0.23688, top1: 0.77896, throughput: 1326.00 | 2022-05-21 13:09:20.191 [rank:1] [train], epoch: 46/50, iter: 700/834, loss: 0.23563, top1: 0.78130, throughput: 1326.04 | 2022-05-21 13:09:20.191 [rank:1] [train], epoch: 46/50, iter: 800/834, loss: 0.23609, top1: 0.77901, throughput: 1328.92 | 2022-05-21 13:09:34.639 [rank:4] [train], epoch: 46/50, iter: 800/834, loss: 0.23819, top1: 0.77568, throughput: 1328.46 | 2022-05-21 13:09:34.640 [rank:6] [train], epoch: 46/50, iter: 800/834, loss: 0.23593, top1: 0.77870, throughput: 1328.57 | 2022-05-21 13:09:34.640 [rank:7] [train], epoch: 46/50, iter: 800/834, loss: 0.23639, top1: 0.77927, throughput: 1328.37 | 2022-05-21 13:09:34.639 [rank:0] [train], epoch: 46/50, iter: 800/834, loss: 0.23787, top1: 0.77630, throughput: 1328.30 | 2022-05-21 13:09:34.640 [rank:5] [train], epoch: 46/50, iter: 800/834, loss: 0.23547, top1: 0.78078, throughput: 1328.41 | 2022-05-21 13:09:34.640 [rank:2] [train], epoch: 46/50, iter: 800/834, loss: 0.23599, top1: 0.78135, throughput: 1328.73 | 2022-05-21 13:09:34.641 [rank:3] [train], epoch: 46/50, iter: 800/834, loss: 0.23565, top1: 0.77870, throughput: 1328.18 | 2022-05-21 13:09:34.642 [rank:4] [train], epoch: 46/50, iter: 834/834, loss: 0.23706, top1: 0.78048, throughput: 1323.07 | 2022-05-21 13:09:39.573 [rank:3] [train], epoch: 46/50, iter: 834/834, loss: 0.23438, top1: 0.78110, throughput: 1323.47 | 2022-05-21 13:09:39.574 [rank:7] [train], epoch: 46/50, iter: 834/834, loss: 0.23710, top1: 0.77589, throughput: 1322.84 | 2022-05-21 13:09:39.574 [rank:5] [train], epoch: 46/50, iter: 834/834, loss: 0.23332, top1: 0.78462, throughput: 1323.15 | 2022-05-21 13:09:39.574 [rank:6] [train], epoch: 46/50, iter: 834/834, loss: 0.23802, top1: 0.77237, throughput: 1322.93 | 2022-05-21 13:09:39.574 [rank:1] [train], epoch: 46/50, iter: 834/834, loss: 0.23622, top1: 0.77834, throughput: 1322.16 | 2022-05-21 13:09:39.576 [rank:0] [train], epoch: 46/50, iter: 834/834, loss: 0.23486, top1: 0.78079, throughput: 1322.36 | 2022-05-21 13:09:39.576 [rank:2] [train], epoch: 46/50, iter: 834/834, loss: 0.23345, top1: 0.78462, throughput: 1322.70 | 2022-05-21 13:09:39.577 [rank:0] [eval], epoch: 46/50, iter: 125/125, loss: 0.00000, top1: 0.75984, throughput: 581.43 | 2022-05-21 13:09:50.326 [rank:7] [eval], epoch: 46/50, iter: 125/125, loss: 0.00000, top1: 0.76160, throughput: 575.72 | 2022-05-21 13:09:50.430 [rank:6] [eval], epoch: 46/50, iter: 125/125, loss: 0.00000, top1: 0.75840, throughput: 572.83 | 2022-05-21 13:09:50.485 [rank:2] [eval], epoch: 46/50, iter: 125/125, loss: 0.00000, top1: 0.75168, throughput: 569.96 | 2022-05-21 13:09:50.542 [rank:3] [eval], epoch: 46/50, iter: 125/125, loss: 0.00000, top1: 0.75520, throughput: 569.72 | 2022-05-21 13:09:50.544 [rank:5] [eval], epoch: 46/50, iter: 125/125, loss: 0.00000, top1: 0.75296, throughput: 567.53 | 2022-05-21 13:09:50.587 [rank:1] [eval], epoch: 46/50, iter: 125/125, loss: 0.00000, top1: 0.75696, throughput: 567.54 | 2022-05-21 13:09:50.589 [rank:4] [eval], epoch: 46/50, iter: 125/125, loss: 0.00000, top1: 0.75120, throughput: 563.04 | 2022-05-21 13:09:50.674 [rank:4] [train], epoch: 47/50, iter: 100/834, loss: 0.23452, top1: 0.78245, throughput: 1331.37 | 2022-05-21 13:10:05.095 [rank:5] [train], epoch: 47/50, iter: 100/834, loss: 0.23390, top1: 0.78411, throughput: 1323.30 | 2022-05-21 13:10:05.096 [rank:6] [train], epoch: 47/50, iter: 100/834, loss: 0.23305, top1: 0.78724, throughput: 1314.06 | 2022-05-21 13:10:05.096 [rank:0] [train], epoch: 47/50, iter: 100/834, loss: 0.23461, top1: 0.78078, throughput: 1299.78 | 2022-05-21 13:10:05.097 [rank:2] [train], epoch: 47/50, iter: 100/834, loss: 0.23478, top1: 0.78474, throughput: 1319.23 | 2022-05-21 13:10:05.096 [rank:7] [train], epoch: 47/50, iter: 100/834, loss: 0.23367, top1: 0.78359, throughput: 1309.01 | 2022-05-21 13:10:05.098 [rank:1] [train], epoch: 47/50, iter: 100/834, loss: 0.23416, top1: 0.78526, throughput: 1323.29 | 2022-05-21 13:10:05.098 [rank:3] [train], epoch: 47/50, iter: 100/834, loss: 0.23552, top1: 0.77833, throughput: 1319.23 | 2022-05-21 13:10:05.098 [rank:5] [train], epoch: 47/50, iter: 200/834, loss: 0.23527, top1: 0.77969, throughput: 1318.57 | 2022-05-21 13:10:19.657 [rank:7] [train], epoch: 47/50, iter: 200/834, loss: 0.23506, top1: 0.78380, throughput: 1318.65 | 2022-05-21 13:10:19.658 [rank:0] [train], epoch: 47/50, iter: 200/834, loss: 0.23319, top1: 0.79042, throughput: 1318.62 | 2022-05-21 13:10:19.658 [rank:6] [train], epoch: 47/50, iter: 200/834, loss: 0.23494, top1: 0.77839, throughput: 1318.46 | 2022-05-21 13:10:19.658 [rank:1] [train], epoch: 47/50, iter: 200/834, loss: 0.23410, top1: 0.78401, throughput: 1318.61 | 2022-05-21 13:10:19.659 [rank:3] [train], epoch: 47/50, iter: 200/834, loss: 0.23371, top1: 0.78688, throughput: 1318.60 | 2022-05-21 13:10:19.659 [rank:2] [train], epoch: 47/50, iter: 200/834, loss: 0.23443, top1: 0.78625, throughput: 1318.43 | 2022-05-21 13:10:19.659 [rank:4] [train], epoch: 47/50, iter: 200/834, loss: 0.23414, top1: 0.78042, throughput: 1318.21 | 2022-05-21 13:10:19.660 [rank:7] [train], epoch: 47/50, iter: 300/834, loss: 0.23533, top1: 0.78370, throughput: 1328.02 | 2022-05-21 13:10:34.115 [rank:1] [train], epoch: 47/50, iter: 300/834, loss: 0.23338, top1: 0.78625, throughput: 1328.14[rank:2] [train], epoch: 47/50, iter: 300/834, loss: 0.23435, top1: 0.78510, throughput: 1328.03 | 2022-05-21 13:10:34.116| 2022-05-21 13:10:34.115 [rank:4] [train], epoch: 47/50, iter: 300/834, loss: 0.23525, top1: 0.77979, throughput: 1328.31 | 2022-05-21 13:10:34.115 [rank:6] [train], epoch: 47/50, iter: 300/834, loss: 0.23472, top1: 0.78443, throughput: 1328.11 | 2022-05-21 13:10:34.115 [rank:0] [train], epoch: 47/50, iter: 300/834, loss: 0.23451, top1: 0.78219, throughput: 1327.97 | 2022-05-21 13:10:34.116 [rank:5] [train], epoch: 47/50, iter: 300/834, loss: 0.23408, top1: 0.78250, throughput: 1327.78 | 2022-05-21 13:10:34.117 [rank:3] [train], epoch: 47/50, iter: 300/834, loss: 0.23518, top1: 0.78297, throughput: 1327.98 | 2022-05-21 13:10:34.117 [rank:1] [train], epoch: 47/50, iter: 400/834, loss: 0.23430, top1: 0.78469, throughput: 1327.54 | 2022-05-21 13:10:48.578 [rank:3] [train], epoch: 47/50, iter: 400/834, loss: 0.23424, top1: 0.78286, throughput: 1327.48 | 2022-05-21 13:10:48.581 [rank:7] [train], epoch: 47/50, iter: 400/834, loss: 0.23343, top1: 0.78323, throughput: 1327.43 | 2022-05-21 13:10:48.580 [rank:0] [train], epoch: 47/50, iter: 400/834, loss: 0.23231, top1: 0.78391, throughput: 1327.51 | 2022-05-21 13:10:48.579 [rank:4] [train], epoch: 47/50, iter: 400/834, loss: 0.23388, top1: 0.78500, throughput: 1327.29 | 2022-05-21 13:10:48.580 [rank:6] [train], epoch: 47/50, iter: 400/834, loss: 0.23535, top1: 0.77906, throughput: 1327.31 | 2022-05-21 13:10:48.580 [rank:5] [train], epoch: 47/50, iter: 400/834, loss: 0.23411, top1: 0.78318, throughput: 1327.50 | 2022-05-21 13:10:48.581 [rank:2] [train], epoch: 47/50, iter: 400/834, loss: 0.23435, top1: 0.78359, throughput: 1327.36 | 2022-05-21 13:10:48.581 [rank:4] [train], epoch: 47/50, iter: 500/834, loss: 0.23174, top1: 0.78865, throughput: 1330.31 | 2022-05-21 13:11:03.013 [rank:1] [train], epoch: 47/50, iter: 500/834, loss: 0.23383, top1: 0.78604, throughput: 1330.05 | 2022-05-21 13:11:03.013 [rank:5] [train], epoch: 47/50, iter: 500/834, loss: 0.23303, top1: 0.78562, throughput: 1330.37 | 2022-05-21 13:11:03.013 [rank:0] [train], epoch: 47/50, iter: 500/834, loss: 0.23616, top1: 0.78214, throughput: 1330.16 | 2022-05-21 13:11:03.014 [rank:6] [train], epoch: 47/50, iter: 500/834, loss: 0.23554, top1: 0.77901, throughput: 1330.24 | 2022-05-21 13:11:03.014 [rank:3] [train], epoch: 47/50, iter: 500/834, loss: 0.23489, top1: 0.78182, throughput: 1330.30 | 2022-05-21 13:11:03.014 [rank:2] [train], epoch: 47/50, iter: 500/834, loss: 0.23127, top1: 0.78911, throughput: 1330.28 | 2022-05-21 13:11:03.014 [rank:7] [train], epoch: 47/50, iter: 500/834, loss: 0.23562, top1: 0.78172, throughput: 1329.96 | 2022-05-21 13:11:03.016 [rank:5] [train], epoch: 47/50, iter: 600/834, loss: 0.23480, top1: 0.78385, throughput: 1326.75 | 2022-05-21 13:11:17.484 [rank:7] [train], epoch: 47/50, iter: 600/834, loss: 0.23499, top1: 0.78203, throughput: 1326.97 | 2022-05-21 13:11:17.485 [rank:1] [train], epoch: 47/50, iter: 600/834, loss: 0.23417, top1: 0.78286, throughput: 1326.77 | 2022-05-21 13:11:17.485 [rank:0] [train], epoch: 47/50, iter: 600/834, loss: 0.23270, top1: 0.78781, throughput: 1326.77 | 2022-05-21 13:11:17.485 [rank:2] [train], epoch: 47/50, iter: 600/834, loss: 0.23457, top1: 0.78474, throughput: 1326.84 | 2022-05-21 13:11:17.485 [rank:3] [train], epoch: 47/50, iter: 600/834, loss: 0.23261, top1: 0.78578, throughput: 1326.71 | 2022-05-21 13:11:17.485 [rank:4] [train], epoch: 47/50, iter: 600/834, loss: 0.23355, top1: 0.78417, throughput: 1326.67 | 2022-05-21 13:11:17.485 [rank:6] [train], epoch: 47/50, iter: 600/834, loss: 0.23480, top1: 0.78177, throughput: 1326.76 | 2022-05-21 13:11:17.485 [rank:4] [train], epoch: 47/50, iter: 700/834, loss: 0.23380, top1: 0.78693, throughput: 1326.90 | 2022-05-21 13:11:31.955 [rank:5] [train], epoch: 47/50, iter: 700/834, loss: 0.23542, top1: 0.77781, throughput: 1326.84 | 2022-05-21 13:11:31.955 [rank:0] [train], epoch: 47/50, iter: 700/834, loss: 0.23300, top1: 0.78521, throughput: 1327.00 | 2022-05-21 13:11:31.954 [rank:6] [train], epoch: 47/50, iter: 700/834, loss: 0.23317, top1: 0.78698, throughput: 1326.89 | 2022-05-21 13:11:31.955 [rank:7] [train], epoch: 47/50, iter: 700/834, loss: 0.23371, top1: 0.78307, throughput: 1326.91 | 2022-05-21 13:11:31.955 [rank:1] [train], epoch: 47/50, iter: 700/834, loss: 0.23332, top1: 0.78130, throughput: 1326.86 | 2022-05-21 13:11:31.955 [rank:3] [train], epoch: 47/50, iter: 700/834, loss: 0.23416, top1: 0.78245, throughput: 1326.69[rank:2] [train], epoch: 47/50, iter: 700/834, loss: 0.23345, top1: 0.78734, throughput: 1326.65 | 2022-05-21 13:11:31.957| 2022-05-21 13:11:31.957 [rank:5] [train], epoch: 47/50, iter: 800/834, loss: 0.23622, top1: 0.77714, throughput: 1326.86 | 2022-05-21 13:11:46.425 [rank:0] [train], epoch: 47/50, iter: 800/834, loss: 0.23552, top1: 0.77974, throughput: 1326.78 | 2022-05-21 13:11:46.425 [rank:3] [train], epoch: 47/50, iter: 800/834, loss: 0.23374, top1: 0.78557, throughput: 1327.07 | 2022-05-21 13:11:46.425 [rank:6] [train], epoch: 47/50, iter: 800/834, loss: 0.23333, top1: 0.78354, throughput: 1326.77[rank:4] [train], epoch: 47/50, iter: 800/834, loss: 0.23377, top1: 0.78161, throughput: 1326.83 | 2022-05-21 13:11:46.427| 2022-05-21 13:11:46.426 [rank:2] [train], epoch: 47/50, iter: 800/834, loss: 0.23335, top1: 0.78568, throughput: 1326.92 | 2022-05-21 13:11:46.427 [rank:1] [train], epoch: 47/50, iter: 800/834, loss: 0.23447, top1: 0.78104, throughput: 1326.65 | 2022-05-21 13:11:46.427 [rank:7] [train], epoch: 47/50, iter: 800/834, loss: 0.23613, top1: 0.77901, throughput: 1326.54 | 2022-05-21 13:11:46.428 [rank:7] [train], epoch: 47/50, iter: 834/834, loss: 0.23432, top1: 0.78569, throughput: 1329.99 | 2022-05-21 13:11:51.337 [rank:2] [train], epoch: 47/50, iter: 834/834, loss: 0.23533, top1: 0.78753, throughput: 1329.61 | 2022-05-21 13:11:51.337 [rank:5] [train], epoch: 47/50, iter: 834/834, loss: 0.23346, top1: 0.78631, throughput: 1328.97 | 2022-05-21 13:11:51.337 [rank:4] [train], epoch: 47/50, iter: 834/834, loss: 0.23328, top1: 0.78401, throughput: 1329.04 | 2022-05-21 13:11:51.338 [rank:1] [train], epoch: 47/50, iter: 834/834, loss: 0.23533, top1: 0.77819, throughput: 1329.40 | 2022-05-21 13:11:51.338 [rank:6] [train], epoch: 47/50, iter: 834/834, loss: 0.23334, top1: 0.78033, throughput: 1328.98 | 2022-05-21 13:11:51.339 [rank:0] [train], epoch: 47/50, iter: 834/834, loss: 0.23396, top1: 0.78278, throughput: 1328.29 | 2022-05-21 13:11:51.339 [rank:3] [train], epoch: 47/50, iter: 834/834, loss: 0.23507, top1: 0.78140, throughput: 1328.49 | 2022-05-21 13:11:51.339 [rank:0] [eval], epoch: 47/50, iter: 125/125, loss: 0.00000, top1: 0.76080, throughput: 577.01 | 2022-05-21 13:12:02.171 [rank:7] [eval], epoch: 47/50, iter: 125/125, loss: 0.00000, top1: 0.76432, throughput: 576.37 | 2022-05-21 13:12:02.181 [rank:3] [eval], epoch: 47/50, iter: 125/125, loss: 0.00000, top1: 0.75616, throughput: 575.42 | 2022-05-21 13:12:02.201 [rank:2] [eval], epoch: 47/50, iter: 125/125, loss: 0.00000, top1: 0.75632, throughput: 574.33 | 2022-05-21 13:12:02.219 [rank:6] [eval], epoch: 47/50, iter: 125/125, loss: 0.00000, top1: 0.75744, throughput: 571.10 | 2022-05-21 13:12:02.282 [rank:5] [eval], epoch: 47/50, iter: 125/125, loss: 0.00000, top1: 0.75392, throughput: 569.47 | 2022-05-21 13:12:02.312 [rank:4] [eval], epoch: 47/50, iter: 125/125, loss: 0.00000, top1: 0.75328, throughput: 569.01 | 2022-05-21 13:12:02.322 [rank:1] [eval], epoch: 47/50, iter: 125/125, loss: 0.00000, top1: 0.76224, throughput: 557.90 | 2022-05-21 13:12:02.541 [rank:5] [train], epoch: 48/50, iter: 100/834, loss: 0.23367, top1: 0.78458, throughput: 1311.96 | 2022-05-21 13:12:16.947 [rank:3] [train], epoch: 48/50, iter: 100/834, loss: 0.23393, top1: 0.78672, throughput: 1302.05 | 2022-05-21 13:12:16.947 [rank:7] [train], epoch: 48/50, iter: 100/834, loss: 0.23196, top1: 0.78922, throughput: 1300.20 | 2022-05-21 13:12:16.948 [rank:1] [train], epoch: 48/50, iter: 100/834, loss: 0.23115, top1: 0.78896, throughput: 1332.70 | 2022-05-21 13:12:16.947 [rank:2] [train], epoch: 48/50, iter: 100/834, loss: 0.23378, top1: 0.78193, throughput: 1303.61 | 2022-05-21 13:12:16.947 [rank:0] [train], epoch: 48/50, iter: 100/834, loss: 0.23187, top1: 0.78708, throughput: 1299.34 | 2022-05-21 13:12:16.948 [rank:6] [train], epoch: 48/50, iter: 100/834, loss: 0.23360, top1: 0.78182, throughput: 1309.09 | 2022-05-21 13:12:16.949 [rank:4] [train], epoch: 48/50, iter: 100/834, loss: 0.23308, top1: 0.78734, throughput: 1312.39 | 2022-05-21 13:12:16.951 [rank:0] [train], epoch: 48/50, iter: 200/834, loss: 0.23158, top1: 0.78771, throughput: 1325.56 | 2022-05-21 13:12:31.432 [rank:1] [train], epoch: 48/50, iter: 200/834, loss: 0.23358, top1: 0.78151, throughput: 1325.51 | 2022-05-21 13:12:31.432 [rank:7] [train], epoch: 48/50, iter: 200/834, loss: 0.23325, top1: 0.78432, throughput: 1325.52 | 2022-05-21 13:12:31.432 [rank:5] [train], epoch: 48/50, iter: 200/834, loss: 0.23156, top1: 0.78870, throughput: 1325.39 | 2022-05-21 13:12:31.433 [rank:3] [train], epoch: 48/50, iter: 200/834, loss: 0.23451, top1: 0.78417, throughput: 1325.35 | 2022-05-21 13:12:31.434 [rank:2] [train], epoch: 48/50, iter: 200/834, loss: 0.23367, top1: 0.78516, throughput: 1325.39 | 2022-05-21 13:12:31.433 [rank:4] [train], epoch: 48/50, iter: 200/834, loss: 0.23465, top1: 0.78562, throughput: 1325.77 | 2022-05-21 13:12:31.434 [rank:6] [train], epoch: 48/50, iter: 200/834, loss: 0.23316, top1: 0.78490, throughput: 1325.51 | 2022-05-21 13:12:31.434 [rank:6] [train], epoch: 48/50, iter: 300/834, loss: 0.23270, top1: 0.78661, throughput: 1328.22 | 2022-05-21 13:12:45.890 [rank:2] [train], epoch: 48/50, iter: 300/834, loss: 0.23297, top1: 0.78661, throughput: 1328.09 | 2022-05-21 13:12:45.890 [rank:1] [train], epoch: 48/50, iter: 300/834, loss: 0.23202, top1: 0.79021, throughput: 1327.99 | 2022-05-21 13:12:45.890 [rank:4] [train], epoch: 48/50, iter: 300/834, loss: 0.23493, top1: 0.78448, throughput: 1327.99 | 2022-05-21 13:12:45.892 [rank:3] [train], epoch: 48/50, iter: 300/834, loss: 0.23416, top1: 0.78594, throughput: 1327.90 | 2022-05-21 13:12:45.893 [rank:5] [train], epoch: 48/50, iter: 300/834, loss: 0.23270, top1: 0.78708, throughput: 1327.95 | 2022-05-21 13:12:45.892 [rank:0] [train], epoch: 48/50, iter: 300/834, loss: 0.23040, top1: 0.79198, throughput: 1327.93 | 2022-05-21 13:12:45.891 [rank:7] [train], epoch: 48/50, iter: 300/834, loss: 0.23556, top1: 0.77953, throughput: 1327.88 | 2022-05-21 13:12:45.892 [rank:2] [train], epoch: 48/50, iter: 400/834, loss: 0.23365, top1: 0.78875, throughput: 1328.20 | 2022-05-21 13:13:00.346 [rank:0] [train], epoch: 48/50, iter: 400/834, loss: 0.23253, top1: 0.78865, throughput: 1328.22 | 2022-05-21 13:13:00.346 [rank:3] [train], epoch: 48/50, iter: 400/834, loss: 0.23149, top1: 0.78734, throughput: 1328.25 | 2022-05-21 13:13:00.348 [rank:5] [train], epoch: 48/50, iter: 400/834, loss: 0.23316, top1: 0.78406, throughput: 1328.25 | 2022-05-21 13:13:00.347 [rank:1] [train], epoch: 48/50, iter: 400/834, loss: 0.23194, top1: 0.78979, throughput: 1328.18 | 2022-05-21 13:13:00.346 [rank:6] [train], epoch: 48/50, iter: 400/834, loss: 0.23166, top1: 0.79224, throughput: 1327.84 | 2022-05-21 13:13:00.349 [rank:4] [train], epoch: 48/50, iter: 400/834, loss: 0.23170, top1: 0.79125, throughput: 1328.07 | 2022-05-21 13:13:00.349 [rank:7] [train], epoch: 48/50, iter: 400/834, loss: 0.23198, top1: 0.79047, throughput: 1328.06 | 2022-05-21 13:13:00.349 [rank:3] [train], epoch: 48/50, iter: 500/834, loss: 0.23371, top1: 0.78615, throughput: 1328.01 | 2022-05-21 13:13:14.805 [rank:6] [train], epoch: 48/50, iter: 500/834, loss: 0.23343, top1: 0.78766, throughput: 1328.17 | 2022-05-21 13:13:14.805 [rank:0] [train], epoch: 48/50, iter: 500/834, loss: 0.23450, top1: 0.78464, throughput: 1327.73 | 2022-05-21 13:13:14.807 [rank:1] [train], epoch: 48/50, iter: 500/834, loss: 0.23351, top1: 0.78583, throughput: 1327.74 | 2022-05-21 13:13:14.807 [rank:4] [train], epoch: 48/50, iter: 500/834, loss: 0.23124, top1: 0.78911, throughput: 1327.90 | 2022-05-21 13:13:14.808 [rank:5] [train], epoch: 48/50, iter: 500/834, loss: 0.23525, top1: 0.78349, throughput: 1327.49 | 2022-05-21 13:13:14.810 [rank:2] [train], epoch: 48/50, iter: 500/834, loss: 0.23363, top1: 0.78667, throughput: 1327.64 | 2022-05-21 13:13:14.808 [rank:7] [train], epoch: 48/50, iter: 500/834, loss: 0.23170, top1: 0.79167, throughput: 1327.70 | 2022-05-21 13:13:14.810 [rank:5] [train], epoch: 48/50, iter: 600/834, loss: 0.23431, top1: 0.78552, throughput: 1327.44 | 2022-05-21 13:13:29.274 [rank:1] [train], epoch: 48/50, iter: 600/834, loss: 0.23420, top1: 0.78557, throughput: 1327.18 | 2022-05-21 13:13:29.274 [rank:7] [train], epoch: 48/50, iter: 600/834, loss: 0.23280, top1: 0.78781, throughput: 1327.45 | 2022-05-21 13:13:29.274 [rank:4] [train], epoch: 48/50, iter: 600/834, loss: 0.23170, top1: 0.78771, throughput: 1327.05 | 2022-05-21 13:13:29.276 [rank:0] [train], epoch: 48/50, iter: 600/834, loss: 0.23487, top1: 0.78479, throughput: 1327.04 | 2022-05-21 13:13:29.276 [rank:6] [train], epoch: 48/50, iter: 600/834, loss: 0.23416, top1: 0.78344, throughput: 1326.86 | 2022-05-21 13:13:29.276 [rank:3] [train], epoch: 48/50, iter: 600/834, loss: 0.23362, top1: 0.78417, throughput: 1326.78 | 2022-05-21 13:13:29.277 [rank:2] [train], epoch: 48/50, iter: 600/834, loss: 0.23173, top1: 0.79052, throughput: 1327.13 | 2022-05-21 13:13:29.275 [rank:6] [train], epoch: 48/50, iter: 700/834, loss: 0.23064, top1: 0.79214, throughput: 1327.42 | 2022-05-21 13:13:43.740 [rank:0] [train], epoch: 48/50, iter: 700/834, loss: 0.23239, top1: 0.78901, throughput: 1327.41 | 2022-05-21 13:13:43.740 [rank:4] [train], epoch: 48/50, iter: 700/834, loss: 0.23159, top1: 0.78849, throughput: 1327.43 | 2022-05-21 13:13:43.740 [rank:5] [train], epoch: 48/50, iter: 700/834, loss: 0.23292, top1: 0.79010, throughput: 1327.25 | 2022-05-21 13:13:43.740 [rank:2] [train], epoch: 48/50, iter: 700/834, loss: 0.23337, top1: 0.78557, throughput: 1327.36 | 2022-05-21 13:13:43.740 [rank:7] [train], epoch: 48/50, iter: 700/834, loss: 0.23174, top1: 0.78776, throughput: 1327.11 | 2022-05-21 13:13:43.741 [rank:3] [train], epoch: 48/50, iter: 700/834, loss: 0.23439, top1: 0.78115, throughput: 1327.09 | 2022-05-21 13:13:43.744 [rank:1] [train], epoch: 48/50, iter: 700/834, loss: 0.23310, top1: 0.78651, throughput: 1326.77 | 2022-05-21 13:13:43.745 [rank:7] [train], epoch: 48/50, iter: 800/834, loss: 0.22942, top1: 0.79589, throughput: 1328.74 | 2022-05-21 13:13:58.191 [rank:4] [train], epoch: 48/50, iter: 800/834, loss: 0.23210, top1: 0.79281, throughput: 1328.57 | 2022-05-21 13:13:58.191 [rank:3] [train], epoch: 48/50, iter: 800/834, loss: 0.23190, top1: 0.79047, throughput: 1329.01 | 2022-05-21 13:13:58.191 [rank:5] [train], epoch: 48/50, iter: 800/834, loss: 0.23151, top1: 0.78859, throughput: 1328.58 | 2022-05-21 13:13:58.191 [rank:6] [train], epoch: 48/50, iter: 800/834, loss: 0.23281, top1: 0.79047, throughput: 1328.49 | 2022-05-21 13:13:58.192 [rank:0] [train], epoch: 48/50, iter: 800/834, loss: 0.23148, top1: 0.79453, throughput: 1328.47 | 2022-05-21 13:13:58.192 [rank:1] [train], epoch: 48/50, iter: 800/834, loss: 0.23349, top1: 0.78786, throughput: 1329.05 | 2022-05-21 13:13:58.191 [rank:2] [train], epoch: 48/50, iter: 800/834, loss: 0.23379, top1: 0.78583, throughput: 1328.53 | 2022-05-21 13:13:58.192 [rank:3] [train], epoch: 48/50, iter: 834/834, loss: 0.23753, top1: 0.77742, throughput: 1322.99 | 2022-05-21 13:14:03.125 [rank:6] [train], epoch: 48/50, iter: 834/834, loss: 0.22779, top1: 0.80193, throughput: 1323.31 | 2022-05-21 13:14:03.125 [rank:4] [train], epoch: 48/50, iter: 834/834, loss: 0.23584, top1: 0.78401, throughput: 1322.92 | 2022-05-21 13:14:03.126 [rank:7] [train], epoch: 48/50, iter: 834/834, loss: 0.23358, top1: 0.78554, throughput: 1322.81 | 2022-05-21 13:14:03.126 [rank:5] [train], epoch: 48/50, iter: 834/834, loss: 0.23271, top1: 0.78845, throughput: 1322.87 | 2022-05-21 13:14:03.126 [rank:0] [train], epoch: 48/50, iter: 834/834, loss: 0.22992, top1: 0.78860, throughput: 1322.96 | 2022-05-21 13:14:03.127 [rank:1] [train], epoch: 48/50, iter: 834/834, loss: 0.23046, top1: 0.78799, throughput: 1322.61 | 2022-05-21 13:14:03.127 [rank:2] [train], epoch: 48/50, iter: 834/834, loss: 0.23669, top1: 0.78294, throughput: 1322.56 | 2022-05-21 13:14:03.128 [rank:4] [eval], epoch: 48/50, iter: 125/125, loss: 0.00000, top1: 0.75520, throughput: 566.38 | 2022-05-21 13:14:14.161 [rank:7] [eval], epoch: 48/50, iter: 125/125, loss: 0.00000, top1: 0.76480, throughput: 565.70 | 2022-05-21 13:14:14.174 [rank:0] [eval], epoch: 48/50, iter: 125/125, loss: 0.00000, top1: 0.76240, throughput: 565.17 | 2022-05-21 13:14:14.185 [rank:2] [eval], epoch: 48/50, iter: 125/125, loss: 0.00000, top1: 0.75888, throughput: 563.66 | 2022-05-21 13:14:14.216 [rank:6] [eval], epoch: 48/50, iter: 125/125, loss: 0.00000, top1: 0.76448, throughput: 559.47 | 2022-05-21 13:14:14.297 [rank:3] [eval], epoch: 48/50, iter: 125/125, loss: 0.00000, top1: 0.75824, throughput: 558.95 | 2022-05-21 13:14:14.307 [rank:5] [eval], epoch: 48/50, iter: 125/125, loss: 0.00000, top1: 0.75360, throughput: 558.28 | 2022-05-21 13:14:14.321 [rank:1] [eval], epoch: 48/50, iter: 125/125, loss: 0.00000, top1: 0.76400, throughput: 549.95 | 2022-05-21 13:14:14.492 [rank:7] [train], epoch: 49/50, iter: 100/834, loss: 0.23157, top1: 0.78766, throughput: 1304.28 | 2022-05-21 13:14:28.895 [rank:1] [train], epoch: 49/50, iter: 100/834, loss: 0.23299, top1: 0.78484, throughput: 1333.06 | 2022-05-21 13:14:28.895 [rank:4] [train], epoch: 49/50, iter: 100/834, loss: 0.23186, top1: 0.78974, throughput: 1303.02 | 2022-05-21 13:14:28.896 [rank:6] [train], epoch: 49/50, iter: 100/834, loss: 0.23407, top1: 0.78432, throughput: 1315.03 | 2022-05-21 13:14:28.897 [rank:3] [train], epoch: 49/50, iter: 100/834, loss: 0.23238, top1: 0.78875, throughput: 1316.09 | 2022-05-21 13:14:28.896 [rank:0] [train], epoch: 49/50, iter: 100/834, loss: 0.23189, top1: 0.78802, throughput: 1305.14 | 2022-05-21 13:14:28.897 [rank:5] [train], epoch: 49/50, iter: 100/834, loss: 0.23269, top1: 0.78974, throughput: 1317.22 | 2022-05-21 13:14:28.897 [rank:2] [train], epoch: 49/50, iter: 100/834, loss: 0.23181, top1: 0.79141, throughput: 1307.77 | 2022-05-21 13:14:28.897 [rank:1] [train], epoch: 49/50, iter: 200/834, loss: 0.23302, top1: 0.78812, throughput: 1328.95 | 2022-05-21 13:14:43.342 [rank:3] [train], epoch: 49/50, iter: 200/834, loss: 0.23124, top1: 0.79125, throughput: 1329.02 | 2022-05-21 13:14:43.343 [rank:4] [train], epoch: 49/50, iter: 200/834, loss: 0.23063, top1: 0.79198, throughput: 1329.01 | 2022-05-21 13:14:43.343 [rank:5] [train], epoch: 49/50, iter: 200/834, loss: 0.23161, top1: 0.79047, throughput: 1329.12 | 2022-05-21 13:14:43.343 [rank:2] [train], epoch: 49/50, iter: 200/834, loss: 0.23273, top1: 0.78786, throughput: 1329.19 | 2022-05-21 13:14:43.342 [rank:6] [train], epoch: 49/50, iter: 200/834, loss: 0.23274, top1: 0.78557, throughput: 1329.08 | 2022-05-21 13:14:43.343 [rank:0] [train], epoch: 49/50, iter: 200/834, loss: 0.23253, top1: 0.78682, throughput: 1329.08 | 2022-05-21 13:14:43.343 [rank:7] [train], epoch: 49/50, iter: 200/834, loss: 0.23213, top1: 0.78995, throughput: 1328.76 | 2022-05-21 13:14:43.345 [rank:2] [train], epoch: 49/50, iter: 300/834, loss: 0.23249, top1: 0.78807, throughput: 1329.60 | 2022-05-21 13:14:57.783 [rank:3] [train], epoch: 49/50, iter: 300/834, loss: 0.23059, top1: 0.79229, throughput: 1329.59 | 2022-05-21 13:14:57.783 [rank:1] [train], epoch: 49/50, iter: 300/834, loss: 0.23453, top1: 0.78469, throughput: 1329.58 | 2022-05-21 13:14:57.783 [rank:7] [train], epoch: 49/50, iter: 300/834, loss: 0.23067, top1: 0.79141, throughput: 1329.76 | 2022-05-21 13:14:57.783 [rank:0] [train], epoch: 49/50, iter: 300/834, loss: 0.23290, top1: 0.78609, throughput: 1329.63 | 2022-05-21 13:14:57.783 [rank:4] [train], epoch: 49/50, iter: 300/834, loss: 0.23236, top1: 0.78734, throughput: 1329.56 | 2022-05-21 13:14:57.784 [rank:5] [train], epoch: 49/50, iter: 300/834, loss: 0.23219, top1: 0.78839, throughput: 1329.56 | 2022-05-21 13:14:57.784 [rank:6] [train], epoch: 49/50, iter: 300/834, loss: 0.23403, top1: 0.78203, throughput: 1329.42 | 2022-05-21 13:14:57.785 [rank:5] [train], epoch: 49/50, iter: 400/834, loss: 0.23212, top1: 0.79026, throughput: 1329.05 | 2022-05-21 13:15:12.230 [rank:7] [train], epoch: 49/50, iter: 400/834, loss: 0.23217, top1: 0.78411, throughput: 1328.91 | 2022-05-21 13:15:12.231 [rank:6] [train], epoch: 49/50, iter: 400/834, loss: 0.23191, top1: 0.78964, throughput: 1328.98 | 2022-05-21 13:15:12.233 [rank:4] [train], epoch: 49/50, iter: 400/834, loss: 0.23272, top1: 0.78604, throughput: 1328.86 | 2022-05-21 13:15:12.232 [rank:1] [train], epoch: 49/50, iter: 400/834, loss: 0.23303, top1: 0.78526, throughput: 1328.70 | 2022-05-21 13:15:12.233 [rank:3] [train], epoch: 49/50, iter: 400/834, loss: 0.23322, top1: 0.78448, throughput: 1328.73 | 2022-05-21 13:15:12.233 [rank:0] [train], epoch: 49/50, iter: 400/834, loss: 0.23104, top1: 0.79094, throughput: 1328.61 | 2022-05-21 13:15:12.234 [rank:2] [train], epoch: 49/50, iter: 400/834, loss: 0.23303, top1: 0.79036, throughput: 1328.69 | 2022-05-21 13:15:12.233 [rank:2] [train], epoch: 49/50, iter: 500/834, loss: 0.23382, top1: 0.78464, throughput: 1325.80 | 2022-05-21 13:15:26.715 [rank:3] [train], epoch: 49/50, iter: 500/834, loss: 0.23124, top1: 0.79016, throughput: 1325.96 | 2022-05-21 13:15:26.713 [rank:7] [train], epoch: 49/50, iter: 500/834, loss: 0.23435, top1: 0.78745, throughput: 1325.73 | 2022-05-21 13:15:26.714 [rank:6] [train], epoch: 49/50, iter: 500/834, loss: 0.23229, top1: 0.79083, throughput: 1325.85 | 2022-05-21 13:15:26.714 [rank:5] [train], epoch: 49/50, iter: 500/834, loss: 0.23511, top1: 0.78391, throughput: 1325.64 | 2022-05-21 13:15:26.714 [rank:1] [train], epoch: 49/50, iter: 500/834, loss: 0.23199, top1: 0.78703, throughput: 1325.89 | 2022-05-21 13:15:26.714 [rank:4] [train], epoch: 49/50, iter: 500/834, loss: 0.23453, top1: 0.78266, throughput: 1325.66 | 2022-05-21 13:15:26.716 [rank:0] [train], epoch: 49/50, iter: 500/834, loss: 0.23138, top1: 0.79583, throughput: 1325.80 | 2022-05-21 13:15:26.716 [rank:5] [train], epoch: 49/50, iter: 600/834, loss: 0.22985, top1: 0.79432, throughput: 1328.59 | 2022-05-21 13:15:41.165 [rank:0] [train], epoch: 49/50, iter: 600/834, loss: 0.23157, top1: 0.79036, throughput: 1328.70 | 2022-05-21 13:15:41.166 [rank:7] [train], epoch: 49/50, iter: 600/834, loss: 0.23196, top1: 0.78896, throughput: 1328.64 | 2022-05-21 13:15:41.165 [rank:3] [train], epoch: 49/50, iter: 600/834, loss: 0.23348, top1: 0.78693, throughput: 1328.45 | 2022-05-21 13:15:41.166 [rank:6] [train], epoch: 49/50, iter: 600/834, loss: 0.23159, top1: 0.78938, throughput: 1328.49 | 2022-05-21 13:15:41.167 [rank:1] [train], epoch: 49/50, iter: 600/834, loss: 0.23353, top1: 0.78672, throughput: 1328.38 | 2022-05-21 13:15:41.168 [rank:4] [train], epoch: 49/50, iter: 600/834, loss: 0.23168, top1: 0.78880, throughput: 1328.67 | 2022-05-21 13:15:41.166 [rank:2] [train], epoch: 49/50, iter: 600/834, loss: 0.23104, top1: 0.79109, throughput: 1328.46 | 2022-05-21 13:15:41.168 [rank:2] [train], epoch: 49/50, iter: 700/834, loss: 0.23214, top1: 0.78995, throughput: 1328.41 | 2022-05-21 13:15:55.621 [rank:6] [train], epoch: 49/50, iter: 700/834, loss: 0.23146, top1: 0.78984, throughput: 1328.21 | 2022-05-21 13:15:55.622 [rank:4] [train], epoch: 49/50, iter: 700/834, loss: 0.23320, top1: 0.78594, throughput: 1328.23 | 2022-05-21 13:15:55.621 [rank:0] [train], epoch: 49/50, iter: 700/834, loss: 0.23162, top1: 0.79141, throughput: 1328.25 | 2022-05-21 13:15:55.621 [rank:7] [train], epoch: 49/50, iter: 700/834, loss: 0.23437, top1: 0.78333, throughput: 1328.15 | 2022-05-21 13:15:55.621 [rank:1] [train], epoch: 49/50, iter: 700/834, loss: 0.23277, top1: 0.78760, throughput: 1328.30 | 2022-05-21 13:15:55.622 [rank:3] [train], epoch: 49/50, iter: 700/834, loss: 0.23244, top1: 0.78583, throughput: 1328.26 | 2022-05-21 13:15:55.621 [rank:5] [train], epoch: 49/50, iter: 700/834, loss: 0.23165, top1: 0.78849, throughput: 1328.07 | 2022-05-21 13:15:55.622 [rank:7] [train], epoch: 49/50, iter: 800/834, loss: 0.23348, top1: 0.78318, throughput: 1328.58 | 2022-05-21 13:16:10.072 [rank:4] [train], epoch: 49/50, iter: 800/834, loss: 0.23388, top1: 0.78406, throughput: 1328.57 | 2022-05-21 13:16:10.073 [rank:2] [train], epoch: 49/50, iter: 800/834, loss: 0.23270, top1: 0.78646, throughput: 1328.58 | 2022-05-21 13:16:10.073 [rank:0] [train], epoch: 49/50, iter: 800/834, loss: 0.23076, top1: 0.79458, throughput: 1328.36 | 2022-05-21 13:16:10.075 [rank:6] [train], epoch: 49/50, iter: 800/834, loss: 0.23249, top1: 0.78542, throughput: 1328.39 | 2022-05-21 13:16:10.076 [rank:1] [train], epoch: 49/50, iter: 800/834, loss: 0.23186, top1: 0.78891, throughput: 1328.45 | 2022-05-21 13:16:10.075 [rank:3] [train], epoch: 49/50, iter: 800/834, loss: 0.23261, top1: 0.78583, throughput: 1328.38 | 2022-05-21 13:16:10.075 [rank:5] [train], epoch: 49/50, iter: 800/834, loss: 0.23368, top1: 0.78568, throughput: 1328.42 | 2022-05-21 13:16:10.076 [rank:4] [train], epoch: 49/50, iter: 834/834, loss: 0.23346, top1: 0.78646, throughput: 1326.08 | 2022-05-21 13:16:14.996 [rank:7] [train], epoch: 49/50, iter: 834/834, loss: 0.22943, top1: 0.79274, throughput: 1325.78 | 2022-05-21 13:16:14.996 [rank:2] [train], epoch: 49/50, iter: 834/834, loss: 0.23068, top1: 0.79151, throughput: 1325.79 | 2022-05-21 13:16:14.996 [rank:6] [train], epoch: 49/50, iter: 834/834, loss: 0.23243, top1: 0.79243, throughput: 1326.46 | 2022-05-21 13:16:14.997 [rank:5] [train], epoch: 49/50, iter: 834/834, loss: 0.23327, top1: 0.78707, throughput: 1326.18 | 2022-05-21 13:16:14.998 [rank:0] [train], epoch: 49/50, iter: 834/834, loss: 0.23412, top1: 0.78217, throughput: 1325.75 | 2022-05-21 13:16:14.999 [rank:1] [train], epoch: 49/50, iter: 834/834, loss: 0.22795, top1: 0.79779, throughput: 1325.13 | 2022-05-21 13:16:15.001 [rank:3] [train], epoch: 49/50, iter: 834/834, loss: 0.23131, top1: 0.79228, throughput: 1325.00 | 2022-05-21 13:16:15.001 [rank:0] [eval], epoch: 49/50, iter: 125/125, loss: 0.00000, top1: 0.76064, throughput: 567.44 | 2022-05-21 13:16:26.013 [rank:7] [eval], epoch: 49/50, iter: 125/125, loss: 0.00000, top1: 0.76320, throughput: 567.11 | 2022-05-21 13:16:26.017 [rank:4] [eval], epoch: 49/50, iter: 125/125, loss: 0.00000, top1: 0.75664, throughput: 563.36 | 2022-05-21 13:16:26.090 [rank:2] [eval], epoch: 49/50, iter: 125/125, loss: 0.00000, top1: 0.75952, throughput: 561.27 | 2022-05-21 13:16:26.132 [rank:6] [eval], epoch: 49/50, iter: 125/125, loss: 0.00000, top1: 0.76448, throughput: 557.49 | 2022-05-21 13:16:26.208 [rank:3] [eval], epoch: 49/50, iter: 125/125, loss: 0.00000, top1: 0.76000, throughput: 557.49 | 2022-05-21 13:16:26.212 [rank:1] [eval], epoch: 49/50, iter: 125/125, loss: 0.00000, top1: 0.76304, throughput: 549.64 | 2022-05-21 13:16:26.372 [rank:5] [eval], epoch: 49/50, iter: 125/125, loss: 0.00000, top1: 0.75248, throughput: 549.10 | 2022-05-21 13:16:26.380