loaded library: /usr/lib/x86_64-linux-gnu/libibverbs.so.1 ***************************************** Setting OMP_NUM_THREADS environment variable for each process to be 1 in default, to avoid your system being overloaded, please further tune the variable for optimal performance in your application as needed. ***************************************** loaded library: loaded library: loaded library: loaded library: loaded library: /usr/lib/x86_64-linux-gnu/libibverbs.so.1/usr/lib/x86_64-linux-gnu/libibverbs.so.1/usr/lib/x86_64-linux-gnu/libibverbs.so.1 /usr/lib/x86_64-linux-gnu/libibverbs.so.1/usr/lib/x86_64-linux-gnu/libibverbs.so.1 loaded library: /usr/lib/x86_64-linux-gnu/libibverbs.so.1 loaded library: /usr/lib/x86_64-linux-gnu/libibverbs.so.1 loaded library: /usr/lib/x86_64-linux-gnu/libibverbs.so.1 W20220410 22:59:43.587877 406 rpc_client.cpp:190] LoadServer 10.7.222.219 Failed at 0 times error_code 14 error_message failed to connect to all addresses W20220410 22:59:43.592027 405 rpc_client.cpp:190] LoadServer 10.7.222.219 Failed at 0 times error_code 14 error_message failed to connect to all addresses W20220410 22:59:43.594254 407 rpc_client.cpp:190] LoadServer 10.7.222.219 Failed at 0 times error_code 14 error_message failed to connect to all addresses W20220410 22:59:43.598204 404 rpc_client.cpp:190] LoadServer 10.7.222.219 Failed at 0 times error_code 14 error_message failed to connect to all addresses W20220410 22:59:43.598415 403 rpc_client.cpp:190] LoadServer 10.7.222.219 Failed at 0 times error_code 14 error_message failed to connect to all addresses ------------------------ arguments ------------------------ batches_per_epoch ............................... 834 channel_last .................................... True ddp ............................................. False exit_num ........................................ -1 fuse_bn_add_relu ................................ True fuse_bn_relu .................................... True gpu_stat_file ................................... None grad_clipping ................................... 0.0 graph ........................................... True label_smoothing ................................. 0.1 learning_rate ................................... 1.536 legacy_init ..................................... False load_path ....................................... None lr_decay_type ................................... cosine metric_local .................................... True metric_train_acc ................................ True momentum ........................................ 0.875 nccl_fusion_max_ops ............................. 24 nccl_fusion_threshold_mb ........................ 16 num_classes ..................................... 1000 num_devices_per_node ............................ 8 num_epochs ...................................... 50 num_nodes ....................................... 1 ofrecord_part_num ............................... 256 ofrecord_path ................................... /dataset/79846248 print_interval .................................. 100 print_timestamp ................................. False samples_per_epoch ............................... 1281167 save_init ....................................... False save_path ....................................... None scale_grad ...................................... True skip_eval ....................................... False synthetic_data .................................. False total_batches ................................... -1 train_batch_size ................................ 192 train_global_batch_size ......................... 1536 use_fp16 ........................................ True use_gpu_decode .................................. True val_batch_size .................................. 50 val_batches_per_epoch ........................... 125 val_global_batch_size ........................... 400 val_samples_per_epoch ........................... 50000 warmup_epochs ................................... 5 weight_decay .................................... 3.0517578125e-05 zero_init_residual .............................. True -------------------- end of arguments --------------------- ***** Model Init ***** ***** Model Init Finish, time escapled: 2.81515 s ***** [rank:5] [train], epoch: 0/50, iter: 100/834, loss: 0.86128, top1: 0.00318, throughput: 286.04 | 2022-04-10 23:01:06.049 [rank:4] [train], epoch: 0/50, iter: 100/834, loss: 0.86082, top1: 0.00286, throughput: 286.08 | 2022-04-10 23:01:06.049 [rank:6] [train], epoch: 0/50, iter: 100/834, loss: 0.86118, top1: 0.00307, throughput: 286.05 | 2022-04-10 23:01:06.049 [rank:1] [train], epoch: 0/50, iter: 100/834, loss: 0.86113, top1: 0.00328, throughput: 286.05 | 2022-04-10 23:01:06.050 [rank:7] [train], epoch: 0/50, iter: 100/834, loss: 0.86105, top1: 0.00312, throughput: 286.03 | 2022-04-10 23:01:06.050 [rank:0] [train], epoch: 0/50, iter: 100/834, loss: 0.86143, top1: 0.00323, throughput: 286.01 | 2022-04-10 23:01:06.056 [rank:3] [train], epoch: 0/50, iter: 100/834, loss: 0.86133, top1: 0.00292, throughput: 286.06 | 2022-04-10 23:01:06.051 [rank:2] [train], epoch: 0/50, iter: 100/834, loss: 0.86109, top1: 0.00354, throughput: 286.01 | 2022-04-10 23:01:06.055 timestamp, name, driver_version, utilization.gpu [%], utilization.memory [%], memory.total [MiB], memory.free [MiB], memory.used [MiB] timestamp, name, driver_version, utilization.gpu [%], utilization.memory [%], memory.total [MiB], memory.free [MiB], memory.used [MiB] timestamp, name, driver_version, utilization.gpu [%], utilization.memory [%], memory.total [MiB], memory.free [MiB], memory.used [MiB] timestamp, name, driver_version, utilization.gpu [%], utilization.memory [%], memory.total [MiB], memory.free [MiB], memory.used [MiB] 2022/04/10 23:01:06.418, Tesla V100-SXM2-32GB, 470.57.02, 64 %, 40 %, 32510 MiB, 21163 MiB, 11347 MiB timestamp, name, driver_version, utilization.gpu [%], utilization.memory [%], memory.total [MiB], memory.free [MiB], memory.used [MiB] 2022/04/10 23:01:06.419, Tesla V100-SXM2-32GB, 470.57.02, 64 %, 40 %, 32510 MiB, 21163 MiB, 11347 MiB timestamp, name, driver_version, utilization.gpu [%], utilization.memory [%], memory.total [MiB], memory.free [MiB], memory.used [MiB] timestamp, name, driver_version, utilization.gpu [%], utilization.memory [%], memory.total [MiB], memory.free [MiB], memory.used [MiB] 2022/04/10 23:01:06.422, Tesla V100-SXM2-32GB, 470.57.02, 64 %, 40 %, 32510 MiB, 21163 MiB, 11347 MiB timestamp, name, driver_version, utilization.gpu [%], utilization.memory [%], memory.total [MiB], memory.free [MiB], memory.used [MiB] 2022/04/10 23:01:06.424, Tesla V100-SXM2-32GB, 470.57.02, 64 %, 40 %, 32510 MiB, 21163 MiB, 11347 MiB 2022/04/10 23:01:06.425, Tesla V100-SXM2-32GB, 470.57.02, 62 %, 40 %, 32510 MiB, 21172 MiB, 11338 MiB 2022/04/10 23:01:06.426, Tesla V100-SXM2-32GB, 470.57.02, 64 %, 40 %, 32510 MiB, 21163 MiB, 11347 MiB 2022/04/10 23:01:06.427, Tesla V100-SXM2-32GB, 470.57.02, 62 %, 40 %, 32510 MiB, 21172 MiB, 11338 MiB 2022/04/10 23:01:06.428, Tesla V100-SXM2-32GB, 470.57.02, 64 %, 40 %, 32510 MiB, 21163 MiB, 11347 MiB 2022/04/10 23:01:06.429, Tesla V100-SXM2-32GB, 470.57.02, 64 %, 40 %, 32510 MiB, 21163 MiB, 11347 MiB 2022/04/10 23:01:06.430, Tesla V100-SXM2-32GB, 470.57.02, 62 %, 40 %, 32510 MiB, 21172 MiB, 11338 MiB 2022/04/10 23:01:06.430, Tesla V100-SXM2-32GB, 470.57.02, 64 %, 40 %, 32510 MiB, 21163 MiB, 11347 MiB 2022/04/10 23:01:06.434, Tesla V100-SXM2-32GB, 470.57.02, 62 %, 40 %, 32510 MiB, 21172 MiB, 11338 MiB 2022/04/10 23:01:06.434, Tesla V100-SXM2-32GB, 470.57.02, 33 %, 28 %, 32510 MiB, 21332 MiB, 11178 MiB 2022/04/10 23:01:06.436, Tesla V100-SXM2-32GB, 470.57.02, 62 %, 40 %, 32510 MiB, 21172 MiB, 11338 MiB 2022/04/10 23:01:06.437, Tesla V100-SXM2-32GB, 470.57.02, 33 %, 28 %, 32510 MiB, 21332 MiB, 11178 MiB 2022/04/10 23:01:06.439, Tesla V100-SXM2-32GB, 470.57.02, 62 %, 40 %, 32510 MiB, 21172 MiB, 11338 MiB 2022/04/10 23:01:06.441, Tesla V100-SXM2-32GB, 470.57.02, 62 %, 40 %, 32510 MiB, 21172 MiB, 11338 MiB 2022/04/10 23:01:06.441, Tesla V100-SXM2-32GB, 470.57.02, 33 %, 28 %, 32510 MiB, 21332 MiB, 11178 MiB 2022/04/10 23:01:06.442, Tesla V100-SXM2-32GB, 470.57.02, 62 %, 40 %, 32510 MiB, 21172 MiB, 11338 MiB 2022/04/10 23:01:06.445, Tesla V100-SXM2-32GB, 470.57.02, 33 %, 28 %, 32510 MiB, 21332 MiB, 11178 MiB 2022/04/10 23:01:06.445, Tesla V100-SXM2-32GB, 470.57.02, 16 %, 14 %, 32510 MiB, 21268 MiB, 11242 MiB 2022/04/10 23:01:06.447, Tesla V100-SXM2-32GB, 470.57.02, 33 %, 28 %, 32510 MiB, 21332 MiB, 11178 MiB 2022/04/10 23:01:06.447, Tesla V100-SXM2-32GB, 470.57.02, 16 %, 14 %, 32510 MiB, 21268 MiB, 11242 MiB 2022/04/10 23:01:06.449, Tesla V100-SXM2-32GB, 470.57.02, 33 %, 28 %, 32510 MiB, 21332 MiB, 11178 MiB 2022/04/10 23:01:06.451, Tesla V100-SXM2-32GB, 470.57.02, 33 %, 28 %, 32510 MiB, 21332 MiB, 11178 MiB 2022/04/10 23:01:06.452, Tesla V100-SXM2-32GB, 470.57.02, 16 %, 14 %, 32510 MiB, 21268 MiB, 11242 MiB 2022/04/10 23:01:06.453, Tesla V100-SXM2-32GB, 470.57.02, 33 %, 28 %, 32510 MiB, 21332 MiB, 11178 MiB 2022/04/10 23:01:06.456, Tesla V100-SXM2-32GB, 470.57.02, 16 %, 14 %, 32510 MiB, 21268 MiB, 11242 MiB 2022/04/10 23:01:06.456, Tesla V100-SXM2-32GB, 470.57.02, 28 %, 24 %, 32510 MiB, 21282 MiB, 11228 MiB 2022/04/10 23:01:06.458, Tesla V100-SXM2-32GB, 470.57.02, 16 %, 14 %, 32510 MiB, 21268 MiB, 11242 MiB 2022/04/10 23:01:06.458, Tesla V100-SXM2-32GB, 470.57.02, 28 %, 24 %, 32510 MiB, 21282 MiB, 11228 MiB 2022/04/10 23:01:06.460, Tesla V100-SXM2-32GB, 470.57.02, 16 %, 14 %, 32510 MiB, 21268 MiB, 11242 MiB 2022/04/10 23:01:06.462, Tesla V100-SXM2-32GB, 470.57.02, 16 %, 14 %, 32510 MiB, 21268 MiB, 11242 MiB 2022/04/10 23:01:06.462, Tesla V100-SXM2-32GB, 470.57.02, 28 %, 24 %, 32510 MiB, 21282 MiB, 11228 MiB 2022/04/10 23:01:06.464, Tesla V100-SXM2-32GB, 470.57.02, 16 %, 14 %, 32510 MiB, 21268 MiB, 11242 MiB 2022/04/10 23:01:06.467, Tesla V100-SXM2-32GB, 470.57.02, 28 %, 24 %, 32510 MiB, 21282 MiB, 11228 MiB 2022/04/10 23:01:06.467, Tesla V100-SXM2-32GB, 470.57.02, 89 %, 58 %, 32510 MiB, 21188 MiB, 11322 MiB 2022/04/10 23:01:06.469, Tesla V100-SXM2-32GB, 470.57.02, 28 %, 24 %, 32510 MiB, 21282 MiB, 11228 MiB 2022/04/10 23:01:06.469, Tesla V100-SXM2-32GB, 470.57.02, 89 %, 58 %, 32510 MiB, 21188 MiB, 11322 MiB 2022/04/10 23:01:06.471, Tesla V100-SXM2-32GB, 470.57.02, 28 %, 24 %, 32510 MiB, 21282 MiB, 11228 MiB 2022/04/10 23:01:06.473, Tesla V100-SXM2-32GB, 470.57.02, 28 %, 24 %, 32510 MiB, 21282 MiB, 11228 MiB 2022/04/10 23:01:06.474, Tesla V100-SXM2-32GB, 470.57.02, 0 %, 0 %, 32510 MiB, 21188 MiB, 11322 MiB 2022/04/10 23:01:06.475, Tesla V100-SXM2-32GB, 470.57.02, 28 %, 24 %, 32510 MiB, 21282 MiB, 11228 MiB 2022/04/10 23:01:06.478, Tesla V100-SXM2-32GB, 470.57.02, 0 %, 0 %, 32510 MiB, 21188 MiB, 11322 MiB 2022/04/10 23:01:06.478, Tesla V100-SXM2-32GB, 470.57.02, 83 %, 58 %, 32510 MiB, 21042 MiB, 11468 MiB 2022/04/10 23:01:06.480, Tesla V100-SXM2-32GB, 470.57.02, 0 %, 0 %, 32510 MiB, 21188 MiB, 11322 MiB 2022/04/10 23:01:06.480, Tesla V100-SXM2-32GB, 470.57.02, 83 %, 58 %, 32510 MiB, 21042 MiB, 11468 MiB 2022/04/10 23:01:06.482, Tesla V100-SXM2-32GB, 470.57.02, 0 %, 0 %, 32510 MiB, 21188 MiB, 11322 MiB 2022/04/10 23:01:06.485, Tesla V100-SXM2-32GB, 470.57.02, 0 %, 0 %, 32510 MiB, 21188 MiB, 11322 MiB 2022/04/10 23:01:06.485, Tesla V100-SXM2-32GB, 470.57.02, 0 %, 0 %, 32510 MiB, 21042 MiB, 11468 MiB 2022/04/10 23:01:06.486, Tesla V100-SXM2-32GB, 470.57.02, 0 %, 0 %, 32510 MiB, 21188 MiB, 11322 MiB 2022/04/10 23:01:06.489, Tesla V100-SXM2-32GB, 470.57.02, 0 %, 0 %, 32510 MiB, 21042 MiB, 11468 MiB 2022/04/10 23:01:06.490, Tesla V100-SXM2-32GB, 470.57.02, 8 %, 6 %, 32510 MiB, 21230 MiB, 11280 MiB 2022/04/10 23:01:06.491, Tesla V100-SXM2-32GB, 470.57.02, 0 %, 0 %, 32510 MiB, 21042 MiB, 11468 MiB 2022/04/10 23:01:06.492, Tesla V100-SXM2-32GB, 470.57.02, 8 %, 6 %, 32510 MiB, 21230 MiB, 11280 MiB 2022/04/10 23:01:06.494, Tesla V100-SXM2-32GB, 470.57.02, 0 %, 0 %, 32510 MiB, 21042 MiB, 11468 MiB 2022/04/10 23:01:06.496, Tesla V100-SXM2-32GB, 470.57.02, 0 %, 0 %, 32510 MiB, 21042 MiB, 11468 MiB 2022/04/10 23:01:06.496, Tesla V100-SXM2-32GB, 470.57.02, 8 %, 6 %, 32510 MiB, 21230 MiB, 11280 MiB 2022/04/10 23:01:06.498, Tesla V100-SXM2-32GB, 470.57.02, 0 %, 0 %, 32510 MiB, 21042 MiB, 11468 MiB 2022/04/10 23:01:06.502, Tesla V100-SXM2-32GB, 470.57.02, 8 %, 6 %, 32510 MiB, 21230 MiB, 11280 MiB 2022/04/10 23:01:06.504, Tesla V100-SXM2-32GB, 470.57.02, 8 %, 6 %, 32510 MiB, 21230 MiB, 11280 MiB 2022/04/10 23:01:06.507, Tesla V100-SXM2-32GB, 470.57.02, 8 %, 6 %, 32510 MiB, 21230 MiB, 11280 MiB 2022/04/10 23:01:06.511, Tesla V100-SXM2-32GB, 470.57.02, 8 %, 6 %, 32510 MiB, 21230 MiB, 11280 MiB 2022/04/10 23:01:06.514, Tesla V100-SXM2-32GB, 470.57.02, 8 %, 6 %, 32510 MiB, 21230 MiB, 11280 MiB [rank:5] [train], epoch: 0/50, iter: 200/834, loss: 0.82846, top1: 0.01323, throughput: 1207.00 | 2022-04-10 23:01:21.956 [rank:6] [train], epoch: 0/50, iter: 200/834, loss: 0.82873, top1: 0.01229, throughput: 1206.98 | 2022-04-10 23:01:21.957 [rank:3] [train], epoch: 0/50, iter: 200/834, loss: 0.82923, top1: 0.01198, throughput: 1206.99 | 2022-04-10 23:01:21.959 [rank:0] [train], epoch: 0/50, iter: 200/834, loss: 0.82930, top1: 0.01203, throughput: 1207.22 | 2022-04-10 23:01:21.960 [rank:1] [train], epoch: 0/50, iter: 200/834, loss: 0.82906, top1: 0.01208, throughput: 1206.95 | 2022-04-10 23:01:21.958 [rank:2] [train], epoch: 0/50, iter: 200/834, loss: 0.82901, top1: 0.01219, throughput: 1207.29 | 2022-04-10 23:01:21.959 [rank:7] [train], epoch: 0/50, iter: 200/834, loss: 0.83003, top1: 0.01214, throughput: 1206.87 | 2022-04-10 23:01:21.959 [rank:4] [train], epoch: 0/50, iter: 200/834, loss: 0.82920, top1: 0.01417, throughput: 1206.88 | 2022-04-10 23:01:21.958 [rank:5] [train], epoch: 0/50, iter: 300/834, loss: 0.80315, top1: 0.01937, throughput: 1308.96 | 2022-04-10 23:01:36.624 [rank:4] [train], epoch: 0/50, iter: 300/834, loss: 0.80390, top1: 0.01729, throughput: 1309.08 | 2022-04-10 23:01:36.625 [rank:0] [train], epoch: 0/50, iter: 300/834, loss: 0.80380, top1: 0.01990, throughput: 1309.16 | 2022-04-10 23:01:36.626 [rank:6] [train], epoch: 0/50, iter: 300/834, loss: 0.80425, top1: 0.01979, throughput: 1309.04 | 2022-04-10 23:01:36.624 [rank:7] [train], epoch: 0/50, iter: 300/834, loss: 0.80353, top1: 0.01953, throughput: 1309.10 | 2022-04-10 23:01:36.626 [rank:2] [train], epoch: 0/50, iter: 300/834, loss: 0.80467, top1: 0.01896, throughput: 1308.93 | 2022-04-10 23:01:36.627 [rank:1] [train], epoch: 0/50, iter: 300/834, loss: 0.80262, top1: 0.01984, throughput: 1308.70 | 2022-04-10 23:01:36.629 [rank:3] [train], epoch: 0/50, iter: 300/834, loss: 0.80326, top1: 0.01891, throughput: 1308.62 | 2022-04-10 23:01:36.630 [rank:5] [train], epoch: 0/50, iter: 400/834, loss: 0.78536, top1: 0.02688, throughput: 1300.13 | 2022-04-10 23:01:51.392 [rank:0] [train], epoch: 0/50, iter: 400/834, loss: 0.78507, top1: 0.02740, throughput: 1300.25 | 2022-04-10 23:01:51.392 [rank:2] [train], epoch: 0/50, iter: 400/834, loss: 0.78451, top1: 0.02578, throughput: 1300.40 | 2022-04-10 23:01:51.392 [rank:3] [train], epoch: 0/50, iter: 400/834, loss: 0.78563, top1: 0.02557, throughput: 1300.57 | 2022-04-10 23:01:51.393 [rank:7] [train], epoch: 0/50, iter: 400/834, loss: 0.78786, top1: 0.02427, throughput: 1300.14 | 2022-04-10 23:01:51.394 [rank:4] [train], epoch: 0/50, iter: 400/834, loss: 0.78638, top1: 0.02469, throughput: 1300.00 | 2022-04-10 23:01:51.394 [rank:1] [train], epoch: 0/50, iter: 400/834, loss: 0.78550, top1: 0.02594, throughput: 1300.47 | 2022-04-10 23:01:51.393 [rank:6] [train], epoch: 0/50, iter: 400/834, loss: 0.78579, top1: 0.02469, throughput: 1300.00 | 2022-04-10 23:01:51.393 [rank:6] [train], epoch: 0/50, iter: 500/834, loss: 0.76953, top1: 0.03375, throughput: 1268.20 | 2022-04-10 23:02:06.533 [rank:5] [train], epoch: 0/50, iter: 500/834, loss: 0.77195, top1: 0.03339, throughput: 1268.05 | 2022-04-10 23:02:06.533 [rank:7] [train], epoch: 0/50, iter: 500/834, loss: 0.77231, top1: 0.03354, throughput: 1268.09 | 2022-04-10 23:02:06.534 [rank:0] [train], epoch: 0/50, iter: 500/834, loss: 0.77133, top1: 0.03375, throughput: 1267.81 | 2022-04-10 23:02:06.536 [rank:4] [train], epoch: 0/50, iter: 500/834, loss: 0.77181, top1: 0.03302, throughput: 1268.20 | 2022-04-10 23:02:06.533 [rank:2] [train], epoch: 0/50, iter: 500/834, loss: 0.77224, top1: 0.03307, throughput: 1267.78 | 2022-04-10 23:02:06.536 [rank:3] [train], epoch: 0/50, iter: 500/834, loss: 0.76884, top1: 0.03448, throughput: 1267.52 | 2022-04-10 23:02:06.541 [rank:1] [train], epoch: 0/50, iter: 500/834, loss: 0.77202, top1: 0.03411, throughput: 1267.56 | 2022-04-10 23:02:06.541 [rank:5] [train], epoch: 0/50, iter: 600/834, loss: 0.75557, top1: 0.03995, throughput: 1288.77 | 2022-04-10 23:02:21.431 [rank:6] [train], epoch: 0/50, iter: 600/834, loss: 0.75658, top1: 0.04109, throughput: 1288.69 | 2022-04-10 23:02:21.432 [rank:4] [train], epoch: 0/50, iter: 600/834, loss: 0.75676, top1: 0.04021, throughput: 1288.82 | 2022-04-10 23:02:21.431 [rank:0] [train], epoch: 0/50, iter: 600/834, loss: 0.75568, top1: 0.04203, throughput: 1288.89 | 2022-04-10 23:02:21.433 [rank:2] [train], epoch: 0/50, iter: 600/834, loss: 0.75715, top1: 0.03984, throughput: 1288.92 | 2022-04-10 23:02:21.432 [rank:1] [train], epoch: 0/50, iter: 600/834, loss: 0.75605, top1: 0.04026, throughput: 1289.10 | 2022-04-10 23:02:21.435 [rank:3] [train], epoch: 0/50, iter: 600/834, loss: 0.75653, top1: 0.04120, throughput: 1289.12 | 2022-04-10 23:02:21.435 [rank:7] [train], epoch: 0/50, iter: 600/834, loss: 0.75836, top1: 0.04052, throughput: 1288.67 | 2022-04-10 23:02:21.433 [rank:0] [train], epoch: 0/50, iter: 700/834, loss: 0.74230, top1: 0.05130, throughput: 1308.38 | 2022-04-10 23:02:36.108 [rank:5] [train], epoch: 0/50, iter: 700/834, loss: 0.74138, top1: 0.05026, throughput: 1308.16 | 2022-04-10 23:02:36.108 [rank:7] [train], epoch: 0/50, iter: 700/834, loss: 0.74120, top1: 0.04932, throughput: 1308.48 | 2022-04-10 23:02:36.107 [rank:6] [train], epoch: 0/50, iter: 700/834, loss: 0.74190, top1: 0.05224, throughput: 1308.23 | 2022-04-10 23:02:36.108 [rank:2] [train], epoch: 0/50, iter: 700/834, loss: 0.74113, top1: 0.05109, throughput: 1308.35 | 2022-04-10 23:02:36.107 [rank:4] [train], epoch: 0/50, iter: 700/834, loss: 0.74057, top1: 0.05068, throughput: 1308.14 | 2022-04-10 23:02:36.108 [rank:1] [train], epoch: 0/50, iter: 700/834, loss: 0.74123, top1: 0.05109, throughput: 1308.38[rank:3] [train], epoch: 0/50, iter: 700/834, loss: 0.74157, top1: 0.05109, throughput: 1308.45 | 2022-04-10 23:02:36.109 | 2022-04-10 23:02:36.109 [rank:6] [train], epoch: 0/50, iter: 800/834, loss: 0.72399, top1: 0.05984, throughput: 1309.70 | 2022-04-10 23:02:50.768 [rank:5] [train], epoch: 0/50, iter: 800/834, loss: 0.72903, top1: 0.05661, throughput: 1309.66 | 2022-04-10 23:02:50.769 [rank:4] [train], epoch: 0/50, iter: 800/834, loss: 0.72674, top1: 0.06021, throughput: 1309.65 | 2022-04-10 23:02:50.768 [rank:2] [train], epoch: 0/50, iter: 800/834, loss: 0.72835, top1: 0.06146, throughput: 1309.48 | 2022-04-10 23:02:50.770 [rank:3] [train], epoch: 0/50, iter: 800/834, loss: 0.72676, top1: 0.06021, throughput: 1309.47 | 2022-04-10 23:02:50.771 [rank:7] [train], epoch: 0/50, iter: 800/834, loss: 0.72633, top1: 0.05875, throughput: 1309.43 | 2022-04-10 23:02:50.770 [rank:0] [train], epoch: 0/50, iter: 800/834, loss: 0.72725, top1: 0.05714, throughput: 1309.29 | 2022-04-10 23:02:50.772 [rank:1] [train], epoch: 0/50, iter: 800/834, loss: 0.72612, top1: 0.05630, throughput: 1309.53 | 2022-04-10 23:02:50.771 [rank:5] [train], epoch: 0/50, iter: 834/834, loss: 0.71695, top1: 0.06679, throughput: 1264.90 | 2022-04-10 23:02:55.929 [rank:0] [train], epoch: 0/50, iter: 834/834, loss: 0.71611, top1: 0.06679, throughput: 1265.34 | 2022-04-10 23:02:55.931 [rank:2] [train], epoch: 0/50, iter: 834/834, loss: 0.71668, top1: 0.06939, throughput: 1264.82 | 2022-04-10 23:02:55.931 [rank:4] [train], epoch: 0/50, iter: 834/834, loss: 0.71681, top1: 0.06664, throughput: 1264.59 | 2022-04-10 23:02:55.931 [rank:3] [train], epoch: 0/50, iter: 834/834, loss: 0.71716, top1: 0.06771, throughput: 1264.80 | 2022-04-10 23:02:55.932 [rank:6] [train], epoch: 0/50, iter: 834/834, loss: 0.71559, top1: 0.06878, throughput: 1263.82 | 2022-04-10 23:02:55.933 [rank:7] [train], epoch: 0/50, iter: 834/834, loss: 0.71895, top1: 0.06403, throughput: 1264.53 | 2022-04-10 23:02:55.932 [rank:1] [train], epoch: 0/50, iter: 834/834, loss: 0.71667, top1: 0.06464, throughput: 1264.52 | 2022-04-10 23:02:55.933 [rank:0] [eval], epoch: 0/50, iter: 125/125, loss: 0.00000, top1: 0.06240, throughput: 254.41 | 2022-04-10 23:03:20.498 [rank:7] [eval], epoch: 0/50, iter: 125/125, loss: 0.00000, top1: 0.05504, throughput: 253.99 | 2022-04-10 23:03:20.540 [rank:4] [eval], epoch: 0/50, iter: 125/125, loss: 0.00000, top1: 0.06096, throughput: 253.32 | 2022-04-10 23:03:20.603 [rank:6] [eval], epoch: 0/50, iter: 125/125, loss: 0.00000, top1: 0.05984, throughput: 252.92 | 2022-04-10 23:03:20.645 [rank:3] [eval], epoch: 0/50, iter: 125/125, loss: 0.00000, top1: 0.06304, throughput: 252.77 | 2022-04-10 23:03:20.658 [rank:2] [eval], epoch: 0/50, iter: 125/125, loss: 0.00000, top1: 0.06208, throughput: 252.16 | 2022-04-10 23:03:20.717 [rank:1] [eval], epoch: 0/50, iter: 125/125, loss: 0.00000, top1: 0.06576, throughput: 251.59 | 2022-04-10 23:03:20.775 [rank:5] [eval], epoch: 0/50, iter: 125/125, loss: 0.00000, top1: 0.06192, throughput: 250.47 | 2022-04-10 23:03:20.882 [rank:4] [train], epoch: 1/50, iter: 100/834, loss: 0.70669, top1: 0.07396, throughput: 1297.00 | 2022-04-10 23:03:35.407 [rank:2] [train], epoch: 1/50, iter: 100/834, loss: 0.70398, top1: 0.07703, throughput: 1306.98 | 2022-04-10 23:03:35.407 [rank:5] [train], epoch: 1/50, iter: 100/834, loss: 0.70652, top1: 0.07333, throughput: 1321.86 | 2022-04-10 23:03:35.407 [rank:0] [train], epoch: 1/50, iter: 100/834, loss: 0.70432, top1: 0.07573, throughput: 1287.74 | 2022-04-10 23:03:35.408 [rank:1] [train], epoch: 1/50, iter: 100/834, loss: 0.70617, top1: 0.07380, throughput: 1312.13 | 2022-04-10 23:03:35.408 [rank:6] [train], epoch: 1/50, iter: 100/834, loss: 0.70813, top1: 0.07297, throughput: 1300.57 | 2022-04-10 23:03:35.408 [rank:7] [train], epoch: 1/50, iter: 100/834, loss: 0.70612, top1: 0.07312, throughput: 1291.27 | 2022-04-10 23:03:35.409 [rank:3] [train], epoch: 1/50, iter: 100/834, loss: 0.70642, top1: 0.07354, throughput: 1301.43 | 2022-04-10 23:03:35.411 [rank:5] [train], epoch: 1/50, iter: 200/834, loss: 0.68940, top1: 0.08948, throughput: 1301.84 | 2022-04-10 23:03:50.156 [rank:1] [train], epoch: 1/50, iter: 200/834, loss: 0.69028, top1: 0.08719, throughput: 1301.90 | 2022-04-10 23:03:50.156 [rank:3] [train], epoch: 1/50, iter: 200/834, loss: 0.68904, top1: 0.09000, throughput: 1302.17 | 2022-04-10 23:03:50.156 [rank:7] [train], epoch: 1/50, iter: 200/834, loss: 0.69002, top1: 0.08953, throughput: 1302.07 | 2022-04-10 23:03:50.155 [rank:0] [train], epoch: 1/50, iter: 200/834, loss: 0.68869, top1: 0.08698, throughput: 1301.89 | 2022-04-10 23:03:50.156 [rank:4] [train], epoch: 1/50, iter: 200/834, loss: 0.68735, top1: 0.09146, throughput: 1301.81 | 2022-04-10 23:03:50.155 [rank:2] [train], epoch: 1/50, iter: 200/834, loss: 0.68788, top1: 0.08635, throughput: 1301.79 | 2022-04-10 23:03:50.156 [rank:6] [train], epoch: 1/50, iter: 200/834, loss: 0.69041, top1: 0.08708, throughput: 1301.95 | 2022-04-10 23:03:50.155 [rank:6] [train], epoch: 1/50, iter: 300/834, loss: 0.67721, top1: 0.10089, throughput: 1308.28 | 2022-04-10 23:04:04.831 [rank:4] [train], epoch: 1/50, iter: 300/834, loss: 0.67523, top1: 0.10385, throughput: 1308.24 | 2022-04-10 23:04:04.831 [rank:2] [train], epoch: 1/50, iter: 300/834, loss: 0.67485, top1: 0.09766, throughput: 1308.21 | 2022-04-10 23:04:04.833 [rank:7] [train], epoch: 1/50, iter: 300/834, loss: 0.67495, top1: 0.10042, throughput: 1308.05 | 2022-04-10 23:04:04.833 [rank:0] [train], epoch: 1/50, iter: 300/834, loss: 0.67690, top1: 0.09844, throughput: 1307.76 | 2022-04-10 23:04:04.837 [rank:5] [train], epoch: 1/50, iter: 300/834, loss: 0.67510, top1: 0.10276, throughput: 1308.10 | 2022-04-10 23:04:04.833 [rank:1] [train], epoch: 1/50, iter: 300/834, loss: 0.67534, top1: 0.09766, throughput: 1308.02 | 2022-04-10 23:04:04.834 [rank:3] [train], epoch: 1/50, iter: 300/834, loss: 0.67485, top1: 0.09781, throughput: 1308.10 | 2022-04-10 23:04:04.833 [rank:6] [train], epoch: 1/50, iter: 400/834, loss: 0.65844, top1: 0.11396, throughput: 1277.23 | 2022-04-10 23:04:19.863 [rank:1] [train], epoch: 1/50, iter: 400/834, loss: 0.65556, top1: 0.11516, throughput: 1277.41 | 2022-04-10 23:04:19.865 [rank:5] [train], epoch: 1/50, iter: 400/834, loss: 0.65679, top1: 0.11443, throughput: 1277.19 | 2022-04-10 23:04:19.866 [rank:3] [train], epoch: 1/50, iter: 400/834, loss: 0.65921, top1: 0.11443, throughput: 1277.13 | 2022-04-10 23:04:19.867 [rank:7] [train], epoch: 1/50, iter: 400/834, loss: 0.65684, top1: 0.11516, throughput: 1277.14 | 2022-04-10 23:04:19.867 [rank:4] [train], epoch: 1/50, iter: 400/834, loss: 0.65770, top1: 0.11344, throughput: 1276.83 | 2022-04-10 23:04:19.869 [rank:2] [train], epoch: 1/50, iter: 400/834, loss: 0.65571, top1: 0.11526, throughput: 1276.94 | 2022-04-10 23:04:19.869 [rank:0] [train], epoch: 1/50, iter: 400/834, loss: 0.65600, top1: 0.11870, throughput: 1277.55 | 2022-04-10 23:04:19.866 [rank:5] [train], epoch: 1/50, iter: 500/834, loss: 0.63845, top1: 0.13161, throughput: 1290.31 | 2022-04-10 23:04:34.747 [rank:6] [train], epoch: 1/50, iter: 500/834, loss: 0.64240, top1: 0.13125, throughput: 1290.05 | 2022-04-10 23:04:34.747 [rank:4] [train], epoch: 1/50, iter: 500/834, loss: 0.63792, top1: 0.14193, throughput: 1290.44 | 2022-04-10 23:04:34.747 [rank:7] [train], epoch: 1/50, iter: 500/834, loss: 0.64142, top1: 0.13328, throughput: 1290.24 | 2022-04-10 23:04:34.748 [rank:0] [train], epoch: 1/50, iter: 500/834, loss: 0.64239, top1: 0.12953, throughput: 1289.65 | 2022-04-10 23:04:34.754 [rank:2] [train], epoch: 1/50, iter: 500/834, loss: 0.64284, top1: 0.13271, throughput: 1290.31 | 2022-04-10 23:04:34.749 [rank:1] [train], epoch: 1/50, iter: 500/834, loss: 0.64044, top1: 0.13198, throughput: 1289.84 | 2022-04-10 23:04:34.750 [rank:3] [train], epoch: 1/50, iter: 500/834, loss: 0.64002, top1: 0.13573, throughput: 1290.07 | 2022-04-10 23:04:34.750 [rank:6] [train], epoch: 1/50, iter: 600/834, loss: 0.62477, top1: 0.14698, throughput: 1315.17 | 2022-04-10 23:04:49.345 [rank:1] [train], epoch: 1/50, iter: 600/834, loss: 0.62694, top1: 0.14750, throughput: 1315.38 | 2022-04-10 23:04:49.347 [rank:5] [train], epoch: 1/50, iter: 600/834, loss: 0.62398, top1: 0.14927, throughput: 1314.94 | 2022-04-10 23:04:49.348 [rank:7] [train], epoch: 1/50, iter: 600/834, loss: 0.62363, top1: 0.14833, throughput: 1314.76 | 2022-04-10 23:04:49.351[rank:2] [train], epoch: 1/50, iter: 600/834, loss: 0.62746, top1: 0.14958, throughput: 1314.89 | 2022-04-10 23:04:49.351 [rank:4] [train], epoch: 1/50, iter: 600/834, loss: 0.62627, top1: 0.14432, throughput: 1314.72 | 2022-04-10 23:04:49.351 [rank:3] [train], epoch: 1/50, iter: 600/834, loss: 0.62566, top1: 0.14609, throughput: 1314.91 | 2022-04-10 23:04:49.352 [rank:0] [train], epoch: 1/50, iter: 600/834, loss: 0.62524, top1: 0.15010, throughput: 1315.64 | 2022-04-10 23:04:49.347 [rank:6] [train], epoch: 1/50, iter: 700/834, loss: 0.61584, top1: 0.16198, throughput: 1285.37 | 2022-04-10 23:05:04.283 [rank:4] [train], epoch: 1/50, iter: 700/834, loss: 0.61247, top1: 0.16458, throughput: 1285.92 | 2022-04-10 23:05:04.282 [rank:2] [train], epoch: 1/50, iter: 700/834, loss: 0.61202, top1: 0.16448, throughput: 1285.82 | 2022-04-10 23:05:04.283 [rank:0] [train], epoch: 1/50, iter: 700/834, loss: 0.61138, top1: 0.16760, throughput: 1285.26 | 2022-04-10 23:05:04.286 [rank:7] [train], epoch: 1/50, iter: 700/834, loss: 0.61434, top1: 0.15990, throughput: 1285.87 | 2022-04-10 23:05:04.282 [rank:5] [train], epoch: 1/50, iter: 700/834, loss: 0.61269, top1: 0.16234, throughput: 1285.54 | 2022-04-10 23:05:04.283 [rank:1] [train], epoch: 1/50, iter: 700/834, loss: 0.61359, top1: 0.16266, throughput: 1285.37 | 2022-04-10 23:05:04.284 [rank:3] [train], epoch: 1/50, iter: 700/834, loss: 0.61180, top1: 0.16203, throughput: 1285.77 | 2022-04-10 23:05:04.285 [rank:6] [train], epoch: 1/50, iter: 800/834, loss: 0.59683, top1: 0.18120, throughput: 1304.96 | 2022-04-10 23:05:18.996 [rank:5] [train], epoch: 1/50, iter: 800/834, loss: 0.59968, top1: 0.17599, throughput: 1304.89 | 2022-04-10 23:05:18.997 [rank:7] [train], epoch: 1/50, iter: 800/834, loss: 0.59798, top1: 0.18370, throughput: 1304.76 | 2022-04-10 23:05:18.998 [rank:4] [train], epoch: 1/50, iter: 800/834, loss: 0.60109, top1: 0.17948, throughput: 1304.73 | 2022-04-10 23:05:18.998 [rank:2] [train], epoch: 1/50, iter: 800/834, loss: 0.59649, top1: 0.18203, throughput: 1304.84 | 2022-04-10 23:05:18.997 [rank:1] [train], epoch: 1/50, iter: 800/834, loss: 0.59420, top1: 0.18995, throughput: 1304.66 | 2022-04-10 23:05:19.001 [rank:0] [train], epoch: 1/50, iter: 800/834, loss: 0.59682, top1: 0.18328, throughput: 1304.97 | 2022-04-10 23:05:18.999 [rank:3] [train], epoch: 1/50, iter: 800/834, loss: 0.59523, top1: 0.18109, throughput: 1304.28 | 2022-04-10 23:05:19.005 [rank:4] [train], epoch: 1/50, iter: 834/834, loss: 0.58995, top1: 0.19792, throughput: 1309.70[rank:6] [train], epoch: 1/50, iter: 834/834, loss: 0.58722, top1: 0.18781, throughput: 1309.17 | 2022-04-10 23:05:23.982 | 2022-04-10 23:05:23.982 [rank:5] [train], epoch: 1/50, iter: 834/834, loss: 0.59275, top1: 0.18536, throughput: 1309.46 | 2022-04-10 23:05:23.982 [rank:2] [train], epoch: 1/50, iter: 834/834, loss: 0.58749, top1: 0.18658, throughput: 1309.41[rank:0] [train], epoch: 1/50, iter: 834/834, loss: 0.58043, top1: 0.19225, throughput: 1309.67 | 2022-04-10 23:05:23.983 | 2022-04-10 23:05:23.983 [rank:7] [train], epoch: 1/50, iter: 834/834, loss: 0.58649, top1: 0.19409, throughput: 1308.99 | 2022-04-10 23:05:23.985 [rank:1] [train], epoch: 1/50, iter: 834/834, loss: 0.58996, top1: 0.19332, throughput: 1309.53 | 2022-04-10 23:05:23.986 [rank:3] [train], epoch: 1/50, iter: 834/834, loss: 0.58766, top1: 0.19317, throughput: 1310.46 | 2022-04-10 23:05:23.987 [rank:7] [eval], epoch: 1/50, iter: 125/125, loss: 0.00000, top1: 0.15872, throughput: 572.73 | 2022-04-10 23:05:34.898 [rank:0] [eval], epoch: 1/50, iter: 125/125, loss: 0.00000, top1: 0.16720, throughput: 572.45 | 2022-04-10 23:05:34.901 [rank:6] [eval], epoch: 1/50, iter: 125/125, loss: 0.00000, top1: 0.16304, throughput: 567.19 | 2022-04-10 23:05:35.001 [rank:2] [eval], epoch: 1/50, iter: 125/125, loss: 0.00000, top1: 0.16656, throughput: 567.03 | 2022-04-10 23:05:35.005 [rank:3] [eval], epoch: 1/50, iter: 125/125, loss: 0.00000, top1: 0.15744, throughput: 566.97 | 2022-04-10 23:05:35.010 [rank:4] [eval], epoch: 1/50, iter: 125/125, loss: 0.00000, top1: 0.15440, throughput: 565.59 | 2022-04-10 23:05:35.033 [rank:1] [eval], epoch: 1/50, iter: 125/125, loss: 0.00000, top1: 0.16496, throughput: 561.05 | 2022-04-10 23:05:35.126 [rank:5] [eval], epoch: 1/50, iter: 125/125, loss: 0.00000, top1: 0.14480, throughput: 557.80 | 2022-04-10 23:05:35.187 [rank:4] [train], epoch: 2/50, iter: 100/834, loss: 0.57881, top1: 0.20339, throughput: 1304.50 | 2022-04-10 23:05:49.751 [rank:6] [train], epoch: 2/50, iter: 100/834, loss: 0.57652, top1: 0.20432, throughput: 1301.65 | 2022-04-10 23:05:49.752 [rank:1] [train], epoch: 2/50, iter: 100/834, loss: 0.57972, top1: 0.20224, throughput: 1312.58 | 2022-04-10 23:05:49.753 [rank:3] [train], epoch: 2/50, iter: 100/834, loss: 0.58074, top1: 0.20469, throughput: 1302.34 | 2022-04-10 23:05:49.753 [rank:5] [train], epoch: 2/50, iter: 100/834, loss: 0.57851, top1: 0.20359, throughput: 1318.06 | 2022-04-10 23:05:49.754 [rank:2] [train], epoch: 2/50, iter: 100/834, loss: 0.58134, top1: 0.20057, throughput: 1301.87 | 2022-04-10 23:05:49.753 [rank:0] [train], epoch: 2/50, iter: 100/834, loss: 0.57725, top1: 0.20906, throughput: 1292.77 | 2022-04-10 23:05:49.753 [rank:7] [train], epoch: 2/50, iter: 100/834, loss: 0.57867, top1: 0.20318, throughput: 1292.28 | 2022-04-10 23:05:49.755 [rank:4] [train], epoch: 2/50, iter: 200/834, loss: 0.56994, top1: 0.21557, throughput: 1314.24 | 2022-04-10 23:06:04.360 [rank:0] [train], epoch: 2/50, iter: 200/834, loss: 0.56981, top1: 0.21495, throughput: 1314.46 | 2022-04-10 23:06:04.360 [rank:1] [train], epoch: 2/50, iter: 200/834, loss: 0.56620, top1: 0.21370, throughput: 1314.38 | 2022-04-10 23:06:04.361 [rank:3] [train], epoch: 2/50, iter: 200/834, loss: 0.57035, top1: 0.21245, throughput: 1314.46 | 2022-04-10 23:06:04.360 [rank:7] [train], epoch: 2/50, iter: 200/834, loss: 0.57100, top1: 0.21594, throughput: 1314.68 | 2022-04-10 23:06:04.359 [rank:5] [train], epoch: 2/50, iter: 200/834, loss: 0.56744, top1: 0.21995, throughput: 1314.11 | 2022-04-10 23:06:04.365 [rank:6] [train], epoch: 2/50, iter: 200/834, loss: 0.56705, top1: 0.21797, throughput: 1314.29 | 2022-04-10 23:06:04.361 [rank:2] [train], epoch: 2/50, iter: 200/834, loss: 0.56953, top1: 0.21594, throughput: 1314.32 | 2022-04-10 23:06:04.362 [rank:6] [train], epoch: 2/50, iter: 300/834, loss: 0.55688, top1: 0.22714, throughput: 1297.62 | 2022-04-10 23:06:19.157 [rank:2] [train], epoch: 2/50, iter: 300/834, loss: 0.55856, top1: 0.23193, throughput: 1297.54 | 2022-04-10 23:06:19.159 [rank:4] [train], epoch: 2/50, iter: 300/834, loss: 0.55801, top1: 0.23078, throughput: 1297.50 | 2022-04-10 23:06:19.158 [rank:3] [train], epoch: 2/50, iter: 300/834, loss: 0.55924, top1: 0.22740, throughput: 1297.11 | 2022-04-10 23:06:19.162 [rank:1] [train], epoch: 2/50, iter: 300/834, loss: 0.55738, top1: 0.22964, throughput: 1297.48 | 2022-04-10 23:06:19.159 [rank:5] [train], epoch: 2/50, iter: 300/834, loss: 0.55683, top1: 0.23198, throughput: 1297.78 | 2022-04-10 23:06:19.159 [rank:0] [train], epoch: 2/50, iter: 300/834, loss: 0.55850, top1: 0.22677, throughput: 1297.34 | 2022-04-10 23:06:19.159 [rank:7] [train], epoch: 2/50, iter: 300/834, loss: 0.55928, top1: 0.23161, throughput: 1297.13 | 2022-04-10 23:06:19.161 [rank:6] [train], epoch: 2/50, iter: 400/834, loss: 0.54712, top1: 0.24885, throughput: 1303.42 | 2022-04-10 23:06:33.887 [rank:5] [train], epoch: 2/50, iter: 400/834, loss: 0.54889, top1: 0.24385, throughput: 1303.53 | 2022-04-10 23:06:33.888 [rank:4] [train], epoch: 2/50, iter: 400/834, loss: 0.54724, top1: 0.24172, throughput: 1303.32 | 2022-04-10 23:06:33.889 [rank:1] [train], epoch: 2/50, iter: 400/834, loss: 0.54856, top1: 0.24578, throughput: 1303.32 | 2022-04-10 23:06:33.890 [rank:2] [train], epoch: 2/50, iter: 400/834, loss: 0.54703, top1: 0.24125, throughput: 1303.57 | 2022-04-10 23:06:33.888 [rank:7] [train], epoch: 2/50, iter: 400/834, loss: 0.54938, top1: 0.23385, throughput: 1303.36 | 2022-04-10 23:06:33.892 [rank:3] [train], epoch: 2/50, iter: 400/834, loss: 0.54685, top1: 0.24615, throughput: 1303.67 | 2022-04-10 23:06:33.890 [rank:0] [train], epoch: 2/50, iter: 400/834, loss: 0.54775, top1: 0.25021, throughput: 1302.62 | 2022-04-10 23:06:33.899 [rank:2] [train], epoch: 2/50, iter: 500/834, loss: 0.54153, top1: 0.25109, throughput: 1303.80 | 2022-04-10 23:06:48.614 [rank:3] [train], epoch: 2/50, iter: 500/834, loss: 0.54031, top1: 0.25854, throughput: 1303.87 | 2022-04-10 23:06:48.615 [rank:4] [train], epoch: 2/50, iter: 500/834, loss: 0.53643, top1: 0.25776, throughput: 1304.00 | 2022-04-10 23:06:48.613 [rank:6] [train], epoch: 2/50, iter: 500/834, loss: 0.53705, top1: 0.25724, throughput: 1303.70 | 2022-04-10 23:06:48.615 [rank:0] [train], epoch: 2/50, iter: 500/834, loss: 0.53819, top1: 0.25672, throughput: 1304.80 | 2022-04-10 23:06:48.614 [rank:7] [train], epoch: 2/50, iter: 500/834, loss: 0.53843, top1: 0.25740, throughput: 1304.17 | 2022-04-10 23:06:48.614 [rank:5] [train], epoch: 2/50, iter: 500/834, loss: 0.53712, top1: 0.25807, throughput: 1303.83 | 2022-04-10 23:06:48.614 [rank:1] [train], epoch: 2/50, iter: 500/834, loss: 0.53968, top1: 0.25594, throughput: 1303.70 | 2022-04-10 23:06:48.618 [rank:5] [train], epoch: 2/50, iter: 600/834, loss: 0.52835, top1: 0.27286, throughput: 1298.04[rank:7] [train], epoch: 2/50, iter: 600/834, loss: 0.53111, top1: 0.26698, throughput: 1298.07 | 2022-04-10 23:07:03.406 | 2022-04-10 23:07:03.406 [rank:4] [train], epoch: 2/50, iter: 600/834, loss: 0.53087, top1: 0.26531, throughput: 1297.97 | 2022-04-10 23:07:03.406 [rank:6] [train], epoch: 2/50, iter: 600/834, loss: 0.53413, top1: 0.26391, throughput: 1297.99 | 2022-04-10 23:07:03.407 [rank:0] [train], epoch: 2/50, iter: 600/834, loss: 0.52956, top1: 0.27151, throughput: 1297.81 | 2022-04-10 23:07:03.408 [rank:1] [train], epoch: 2/50, iter: 600/834, loss: 0.53280, top1: 0.26542, throughput: 1298.17 | 2022-04-10 23:07:03.408 [rank:2] [train], epoch: 2/50, iter: 600/834, loss: 0.52931, top1: 0.26688, throughput: 1298.01 | 2022-04-10 23:07:03.406 [rank:3] [train], epoch: 2/50, iter: 600/834, loss: 0.52879, top1: 0.27073, throughput: 1297.99 | 2022-04-10 23:07:03.407 [rank:2] [train], epoch: 2/50, iter: 700/834, loss: 0.51985, top1: 0.28234, throughput: 1311.86 | 2022-04-10 23:07:18.041 [rank:4] [train], epoch: 2/50, iter: 700/834, loss: 0.52187, top1: 0.28302, throughput: 1311.90 | 2022-04-10 23:07:18.041 [rank:5] [train], epoch: 2/50, iter: 700/834, loss: 0.52335, top1: 0.27943, throughput: 1311.90 | 2022-04-10 23:07:18.041 [rank:3] [train], epoch: 2/50, iter: 700/834, loss: 0.52169, top1: 0.27906, throughput: 1311.70 | 2022-04-10 23:07:18.045 [rank:7] [train], epoch: 2/50, iter: 700/834, loss: 0.52352, top1: 0.27531, throughput: 1311.78 | 2022-04-10 23:07:18.042 [rank:6] [train], epoch: 2/50, iter: 700/834, loss: 0.52328, top1: 0.27938, throughput: 1311.86 | 2022-04-10 23:07:18.043 [rank:0] [train], epoch: 2/50, iter: 700/834, loss: 0.52295, top1: 0.27677, throughput: 1311.79 | 2022-04-10 23:07:18.045 [rank:1] [train], epoch: 2/50, iter: 700/834, loss: 0.52012, top1: 0.28771, throughput: 1311.73 | 2022-04-10 23:07:18.045 [rank:5] [train], epoch: 2/50, iter: 800/834, loss: 0.51662, top1: 0.28911, throughput: 1310.04 | 2022-04-10 23:07:32.697 [rank:4] [train], epoch: 2/50, iter: 800/834, loss: 0.51164, top1: 0.28911, throughput: 1310.00 | 2022-04-10 23:07:32.697 [rank:6] [train], epoch: 2/50, iter: 800/834, loss: 0.51352, top1: 0.29094, throughput: 1310.05 | 2022-04-10 23:07:32.698 [rank:1] [train], epoch: 2/50, iter: 800/834, loss: 0.51433, top1: 0.29135, throughput: 1310.22 | 2022-04-10 23:07:32.699 [rank:0] [train], epoch: 2/50, iter: 800/834, loss: 0.51648, top1: 0.29234, throughput: 1310.09 | 2022-04-10 23:07:32.700 [rank:7] [train], epoch: 2/50, iter: 800/834, loss: 0.51351, top1: 0.29557, throughput: 1309.86 | 2022-04-10 23:07:32.700 [rank:3] [train], epoch: 2/50, iter: 800/834, loss: 0.51585, top1: 0.28708, throughput: 1310.16 | 2022-04-10 23:07:32.699 [rank:2] [train], epoch: 2/50, iter: 800/834, loss: 0.51746, top1: 0.28786, throughput: 1309.84 | 2022-04-10 23:07:32.700 [rank:4] [train], epoch: 2/50, iter: 834/834, loss: 0.50230, top1: 0.31587, throughput: 1313.47 | 2022-04-10 23:07:37.667 [rank:5] [train], epoch: 2/50, iter: 834/834, loss: 0.51012, top1: 0.30515, throughput: 1313.23 | 2022-04-10 23:07:37.668 [rank:2] [train], epoch: 2/50, iter: 834/834, loss: 0.50995, top1: 0.29473, throughput: 1313.69 | 2022-04-10 23:07:37.669 [rank:0] [train], epoch: 2/50, iter: 834/834, loss: 0.50198, top1: 0.31403, throughput: 1313.51 | 2022-04-10 23:07:37.670 [rank:6] [train], epoch: 2/50, iter: 834/834, loss: 0.51023, top1: 0.30025, throughput: 1313.01 | 2022-04-10 23:07:37.670 [rank:7] [train], epoch: 2/50, iter: 834/834, loss: 0.51037, top1: 0.30025, throughput: 1313.27 | 2022-04-10 23:07:37.671 [rank:1] [train], epoch: 2/50, iter: 834/834, loss: 0.50947, top1: 0.29764, throughput: 1312.70 | 2022-04-10 23:07:37.672 [rank:3] [train], epoch: 2/50, iter: 834/834, loss: 0.51010, top1: 0.29427, throughput: 1312.35 | 2022-04-10 23:07:37.674 [rank:0] [eval], epoch: 2/50, iter: 125/125, loss: 0.00000, top1: 0.27280, throughput: 579.40 | 2022-04-10 23:07:48.457 [rank:7] [eval], epoch: 2/50, iter: 125/125, loss: 0.00000, top1: 0.27840, throughput: 578.96 | 2022-04-10 23:07:48.466 [rank:6] [eval], epoch: 2/50, iter: 125/125, loss: 0.00000, top1: 0.27296, throughput: 572.27 | 2022-04-10 23:07:48.592 [rank:2] [eval], epoch: 2/50, iter: 125/125, loss: 0.00000, top1: 0.28016, throughput: 571.62 | 2022-04-10 23:07:48.603 [rank:1] [eval], epoch: 2/50, iter: 125/125, loss: 0.00000, top1: 0.27680, throughput: 571.41 | 2022-04-10 23:07:48.610 [rank:3] [eval], epoch: 2/50, iter: 125/125, loss: 0.00000, top1: 0.26864, throughput: 568.39 | 2022-04-10 23:07:48.669 [rank:4] [eval], epoch: 2/50, iter: 125/125, loss: 0.00000, top1: 0.26560, throughput: 563.82 | 2022-04-10 23:07:48.752 [rank:5] [eval], epoch: 2/50, iter: 125/125, loss: 0.00000, top1: 0.26672, throughput: 563.69 | 2022-04-10 23:07:48.756 [rank:4] [train], epoch: 3/50, iter: 100/834, loss: 0.50158, top1: 0.31297, throughput: 1315.63 | 2022-04-10 23:08:03.346 [rank:6] [train], epoch: 3/50, iter: 100/834, loss: 0.50138, top1: 0.30818, throughput: 1301.25 | 2022-04-10 23:08:03.347 [rank:5] [train], epoch: 3/50, iter: 100/834, loss: 0.50063, top1: 0.31266, throughput: 1315.83 | 2022-04-10 23:08:03.347 [rank:0] [train], epoch: 3/50, iter: 100/834, loss: 0.50072, top1: 0.30854, throughput: 1289.44 | 2022-04-10 23:08:03.347 [rank:7] [train], epoch: 3/50, iter: 100/834, loss: 0.49971, top1: 0.31609, throughput: 1290.20[rank:1] [train], epoch: 3/50, iter: 100/834, loss: 0.49855, top1: 0.31234, throughput: 1302.72 | 2022-04-10 23:08:03.348 | 2022-04-10 23:08:03.348 [rank:3] [train], epoch: 3/50, iter: 100/834, loss: 0.49924, top1: 0.31776, throughput: 1307.79 | 2022-04-10 23:08:03.351 [rank:2] [train], epoch: 3/50, iter: 100/834, loss: 0.50108, top1: 0.31266, throughput: 1302.04 | 2022-04-10 23:08:03.349 [rank:5] [train], epoch: 3/50, iter: 200/834, loss: 0.49510, top1: 0.32094, throughput: 1311.49 | 2022-04-10 23:08:17.987 [rank:4] [train], epoch: 3/50, iter: 200/834, loss: 0.49770, top1: 0.31391, throughput: 1311.24 | 2022-04-10 23:08:17.989 [rank:3] [train], epoch: 3/50, iter: 200/834, loss: 0.49645, top1: 0.31958, throughput: 1311.68 | 2022-04-10 23:08:17.989 [rank:6] [train], epoch: 3/50, iter: 200/834, loss: 0.49411, top1: 0.32682, throughput: 1311.27 | 2022-04-10 23:08:17.989 [rank:2] [train], epoch: 3/50, iter: 200/834, loss: 0.49742, top1: 0.32052, throughput: 1311.36 | 2022-04-10 23:08:17.990 [rank:0] [train], epoch: 3/50, iter: 200/834, loss: 0.49648, top1: 0.31734, throughput: 1311.32 | 2022-04-10 23:08:17.989 [rank:7] [train], epoch: 3/50, iter: 200/834, loss: 0.49721, top1: 0.31547, throughput: 1311.38 | 2022-04-10 23:08:17.989 [rank:1] [train], epoch: 3/50, iter: 200/834, loss: 0.49444, top1: 0.32125, throughput: 1311.22 | 2022-04-10 23:08:17.991 [rank:7] [train], epoch: 3/50, iter: 300/834, loss: 0.48871, top1: 0.32693, throughput: 1311.92 | 2022-04-10 23:08:32.624 [rank:5] [train], epoch: 3/50, iter: 300/834, loss: 0.49032, top1: 0.32516, throughput: 1311.96 | 2022-04-10 23:08:32.622 [rank:0] [train], epoch: 3/50, iter: 300/834, loss: 0.49164, top1: 0.32635, throughput: 1311.79 | 2022-04-10 23:08:32.625 [rank:2] [train], epoch: 3/50, iter: 300/834, loss: 0.48467, top1: 0.33917, throughput: 1312.13 | 2022-04-10 23:08:32.623 [rank:4] [train], epoch: 3/50, iter: 300/834, loss: 0.48691, top1: 0.33187, throughput: 1312.06 | 2022-04-10 23:08:32.622 [rank:1] [train], epoch: 3/50, iter: 300/834, loss: 0.48877, top1: 0.33005, throughput: 1311.95 | 2022-04-10 23:08:32.626 [rank:6] [train], epoch: 3/50, iter: 300/834, loss: 0.49080, top1: 0.32385, throughput: 1311.97 | 2022-04-10 23:08:32.623 [rank:3] [train], epoch: 3/50, iter: 300/834, loss: 0.48894, top1: 0.33042, throughput: 1311.76 | 2022-04-10 23:08:32.625 [rank:2] [train], epoch: 3/50, iter: 400/834, loss: 0.48547, top1: 0.33422, throughput: 1299.80 | 2022-04-10 23:08:47.394 [rank:6] [train], epoch: 3/50, iter: 400/834, loss: 0.48608, top1: 0.33620, throughput: 1299.89 | 2022-04-10 23:08:47.394 [rank:4] [train], epoch: 3/50, iter: 400/834, loss: 0.48621, top1: 0.33276, throughput: 1299.79 | 2022-04-10 23:08:47.394 [rank:5] [train], epoch: 3/50, iter: 400/834, loss: 0.48125, top1: 0.33807, throughput: 1299.58 | 2022-04-10 23:08:47.396 [rank:3] [train], epoch: 3/50, iter: 400/834, loss: 0.48435, top1: 0.33479, throughput: 1299.88 | 2022-04-10 23:08:47.396 [rank:7] [train], epoch: 3/50, iter: 400/834, loss: 0.48060, top1: 0.34734, throughput: 1299.83 | 2022-04-10 23:08:47.395 [rank:1] [train], epoch: 3/50, iter: 400/834, loss: 0.48205, top1: 0.33995, throughput: 1299.66 | 2022-04-10 23:08:47.399 [rank:0] [train], epoch: 3/50, iter: 400/834, loss: 0.48180, top1: 0.34302, throughput: 1299.60 | 2022-04-10 23:08:47.399 [rank:1] [train], epoch: 3/50, iter: 500/834, loss: 0.47882, top1: 0.34609, throughput: 1295.49 | 2022-04-10 23:09:02.220 [rank:2] [train], epoch: 3/50, iter: 500/834, loss: 0.47724, top1: 0.34948, throughput: 1295.35 | 2022-04-10 23:09:02.216 [rank:6] [train], epoch: 3/50, iter: 500/834, loss: 0.48050, top1: 0.34578, throughput: 1295.34 | 2022-04-10 23:09:02.216 [rank:5] [train], epoch: 3/50, iter: 500/834, loss: 0.47689, top1: 0.34953, throughput: 1295.39 | 2022-04-10 23:09:02.217 [rank:0] [train], epoch: 3/50, iter: 500/834, loss: 0.48033, top1: 0.34255, throughput: 1295.45 | 2022-04-10 23:09:02.220 [rank:4] [train], epoch: 3/50, iter: 500/834, loss: 0.47877, top1: 0.34510, throughput: 1295.19[rank:7] [train], epoch: 3/50, iter: 500/834, loss: 0.47699, top1: 0.34760, throughput: 1295.03 | 2022-04-10 23:09:02.221 | 2022-04-10 23:09:02.218 [rank:3] [train], epoch: 3/50, iter: 500/834, loss: 0.47891, top1: 0.34531, throughput: 1295.15 | 2022-04-10 23:09:02.220 [rank:4] [train], epoch: 3/50, iter: 600/834, loss: 0.47718, top1: 0.34745, throughput: 1300.70 | 2022-04-10 23:09:16.979 [rank:6] [train], epoch: 3/50, iter: 600/834, loss: 0.47359, top1: 0.35589, throughput: 1300.54 | 2022-04-10 23:09:16.979 [rank:5] [train], epoch: 3/50, iter: 600/834, loss: 0.47678, top1: 0.35062, throughput: 1300.61 | 2022-04-10 23:09:16.980 [rank:2] [train], epoch: 3/50, iter: 600/834, loss: 0.47340, top1: 0.35266, throughput: 1300.47 | 2022-04-10 23:09:16.980 [rank:3] [train], epoch: 3/50, iter: 600/834, loss: 0.47748, top1: 0.34401, throughput: 1300.69 | 2022-04-10 23:09:16.982 [rank:1] [train], epoch: 3/50, iter: 600/834, loss: 0.47378, top1: 0.35187, throughput: 1300.65 | 2022-04-10 23:09:16.982 [rank:7] [train], epoch: 3/50, iter: 600/834, loss: 0.47277, top1: 0.35458, throughput: 1300.78 | 2022-04-10 23:09:16.981 [rank:0] [train], epoch: 3/50, iter: 600/834, loss: 0.47400, top1: 0.35583, throughput: 1300.53 | 2022-04-10 23:09:16.983 [rank:1] [train], epoch: 3/50, iter: 700/834, loss: 0.46793, top1: 0.36203, throughput: 1313.38 | 2022-04-10 23:09:31.600 [rank:6] [train], epoch: 3/50, iter: 700/834, loss: 0.47194, top1: 0.35484, throughput: 1313.24 | 2022-04-10 23:09:31.600 [rank:5] [train], epoch: 3/50, iter: 700/834, loss: 0.47076, top1: 0.35995, throughput: 1313.27 | 2022-04-10 23:09:31.600 [rank:2] [train], epoch: 3/50, iter: 700/834, loss: 0.47081, top1: 0.35724, throughput: 1313.33 | 2022-04-10 23:09:31.600 [rank:7] [train], epoch: 3/50, iter: 700/834, loss: 0.46902, top1: 0.36276, throughput: 1313.21 | 2022-04-10 23:09:31.602 [rank:3] [train], epoch: 3/50, iter: 700/834, loss: 0.46809, top1: 0.35984, throughput: 1313.38 | 2022-04-10 23:09:31.601 [rank:0] [train], epoch: 3/50, iter: 700/834, loss: 0.47287, top1: 0.35526, throughput: 1313.33 | 2022-04-10 23:09:31.603 [rank:4] [train], epoch: 3/50, iter: 700/834, loss: 0.47329, top1: 0.35766, throughput: 1313.16 | 2022-04-10 23:09:31.601 [rank:4] [train], epoch: 3/50, iter: 800/834, loss: 0.46568, top1: 0.37047, throughput: 1304.62 | 2022-04-10 23:09:46.318 [rank:3] [train], epoch: 3/50, iter: 800/834, loss: 0.46349, top1: 0.37089, throughput: 1304.42 | 2022-04-10 23:09:46.320 [rank:2] [train], epoch: 3/50, iter: 800/834, loss: 0.46412, top1: 0.37109, throughput: 1304.35 | 2022-04-10 23:09:46.320[rank:5] [train], epoch: 3/50, iter: 800/834, loss: 0.46418, top1: 0.36672, throughput: 1304.32 | 2022-04-10 23:09:46.320 [rank:6] [train], epoch: 3/50, iter: 800/834, loss: 0.46475, top1: 0.36359, throughput: 1304.48 | 2022-04-10 23:09:46.318 [rank:1] [train], epoch: 3/50, iter: 800/834, loss: 0.46398, top1: 0.36958, throughput: 1304.20 | 2022-04-10 23:09:46.322 [rank:0] [train], epoch: 3/50, iter: 800/834, loss: 0.46287, top1: 0.37313, throughput: 1304.53 | 2022-04-10 23:09:46.321 [rank:7] [train], epoch: 3/50, iter: 800/834, loss: 0.46748, top1: 0.36339, throughput: 1304.41 | 2022-04-10 23:09:46.321 [rank:4] [train], epoch: 3/50, iter: 834/834, loss: 0.46419, top1: 0.37255, throughput: 1293.51 | 2022-04-10 23:09:51.364 [rank:6] [train], epoch: 3/50, iter: 834/834, loss: 0.46474, top1: 0.36581, throughput: 1293.70 | 2022-04-10 23:09:51.364 [rank:7] [train], epoch: 3/50, iter: 834/834, loss: 0.46214, top1: 0.37515, throughput: 1294.32 | 2022-04-10 23:09:51.364 [rank:2] [train], epoch: 3/50, iter: 834/834, loss: 0.45937, top1: 0.38097, throughput: 1293.88 | 2022-04-10 23:09:51.365 [rank:5] [train], epoch: 3/50, iter: 834/834, loss: 0.45948, top1: 0.37990, throughput: 1294.00 | 2022-04-10 23:09:51.365 [rank:3] [train], epoch: 3/50, iter: 834/834, loss: 0.46371, top1: 0.37362, throughput: 1293.72 | 2022-04-10 23:09:51.366 [rank:1] [train], epoch: 3/50, iter: 834/834, loss: 0.46130, top1: 0.37362, throughput: 1294.35 | 2022-04-10 23:09:51.365 [rank:0] [train], epoch: 3/50, iter: 834/834, loss: 0.46126, top1: 0.37669, throughput: 1294.00 | 2022-04-10 23:09:51.366 [rank:7] [eval], epoch: 3/50, iter: 125/125, loss: 0.00000, top1: 0.34128, throughput: 583.78 | 2022-04-10 23:10:02.071 [rank:0] [eval], epoch: 3/50, iter: 125/125, loss: 0.00000, top1: 0.35344, throughput: 579.89 | 2022-04-10 23:10:02.143 [rank:2] [eval], epoch: 3/50, iter: 125/125, loss: 0.00000, top1: 0.35056, throughput: 576.67 | 2022-04-10 23:10:02.203 [rank:6] [eval], epoch: 3/50, iter: 125/125, loss: 0.00000, top1: 0.34384, throughput: 575.17 | 2022-04-10 23:10:02.231 [rank:5] [eval], epoch: 3/50, iter: 125/125, loss: 0.00000, top1: 0.32560, throughput: 575.09 | 2022-04-10 23:10:02.233 [rank:3] [eval], epoch: 3/50, iter: 125/125, loss: 0.00000, top1: 0.33392, throughput: 574.36 | 2022-04-10 23:10:02.247 [rank:4] [eval], epoch: 3/50, iter: 125/125, loss: 0.00000, top1: 0.34176, throughput: 571.63 | 2022-04-10 23:10:02.298 [rank:1] [eval], epoch: 3/50, iter: 125/125, loss: 0.00000, top1: 0.34320, throughput: 568.38 | 2022-04-10 23:10:02.362 [rank:6] [train], epoch: 4/50, iter: 100/834, loss: 0.46000, top1: 0.37599, throughput: 1305.90 | 2022-04-10 23:10:16.933 [rank:5] [train], epoch: 4/50, iter: 100/834, loss: 0.45324, top1: 0.38464, throughput: 1305.95 | 2022-04-10 23:10:16.935 [rank:2] [train], epoch: 4/50, iter: 100/834, loss: 0.45244, top1: 0.38745, throughput: 1303.28 | 2022-04-10 23:10:16.935 [rank:7] [train], epoch: 4/50, iter: 100/834, loss: 0.45776, top1: 0.37526, throughput: 1291.69 | 2022-04-10 23:10:16.935 [rank:3] [train], epoch: 4/50, iter: 100/834, loss: 0.45370, top1: 0.38526, throughput: 1307.01 | 2022-04-10 23:10:16.937 [rank:1] [train], epoch: 4/50, iter: 100/834, loss: 0.45632, top1: 0.37880, throughput: 1317.19 | 2022-04-10 23:10:16.938 [rank:0] [train], epoch: 4/50, iter: 100/834, loss: 0.45686, top1: 0.38312, throughput: 1297.81 | 2022-04-10 23:10:16.938 [rank:4] [train], epoch: 4/50, iter: 100/834, loss: 0.45571, top1: 0.37682, throughput: 1311.58 | 2022-04-10 23:10:16.937 [rank:5] [train], epoch: 4/50, iter: 200/834, loss: 0.45255, top1: 0.38932, throughput: 1306.28 | 2022-04-10 23:10:31.633 [rank:1] [train], epoch: 4/50, iter: 200/834, loss: 0.45323, top1: 0.38599, throughput: 1306.48 | 2022-04-10 23:10:31.634 [rank:4] [train], epoch: 4/50, iter: 200/834, loss: 0.45440, top1: 0.38005, throughput: 1306.23 | 2022-04-10 23:10:31.636 [rank:2] [train], epoch: 4/50, iter: 200/834, loss: 0.45269, top1: 0.38401, throughput: 1306.31 | 2022-04-10 23:10:31.633 [rank:7] [train], epoch: 4/50, iter: 200/834, loss: 0.45349, top1: 0.38839, throughput: 1306.08 | 2022-04-10 23:10:31.635 [rank:6] [train], epoch: 4/50, iter: 200/834, loss: 0.45444, top1: 0.38724, throughput: 1306.03[rank:0] [train], epoch: 4/50, iter: 200/834, loss: 0.45006, top1: 0.38828, throughput: 1306.24 | 2022-04-10 23:10:31.636 | 2022-04-10 23:10:31.634 [rank:3] [train], epoch: 4/50, iter: 200/834, loss: 0.45207, top1: 0.38698, throughput: 1306.10 | 2022-04-10 23:10:31.638 [rank:2] [train], epoch: 4/50, iter: 300/834, loss: 0.44978, top1: 0.38943, throughput: 1313.17 | 2022-04-10 23:10:46.254 [rank:5] [train], epoch: 4/50, iter: 300/834, loss: 0.44923, top1: 0.39328, throughput: 1313.23 | 2022-04-10 23:10:46.253 [rank:0] [train], epoch: 4/50, iter: 300/834, loss: 0.44866, top1: 0.39443, throughput: 1313.24 | 2022-04-10 23:10:46.257 [rank:7] [train], epoch: 4/50, iter: 300/834, loss: 0.45072, top1: 0.39172, throughput: 1313.25 | 2022-04-10 23:10:46.256 [rank:3] [train], epoch: 4/50, iter: 300/834, loss: 0.45080, top1: 0.39245, throughput: 1313.38 | 2022-04-10 23:10:46.256 [rank:6] [train], epoch: 4/50, iter: 300/834, loss: 0.44805, top1: 0.39391, throughput: 1313.14 | 2022-04-10 23:10:46.256 [rank:4] [train], epoch: 4/50, iter: 300/834, loss: 0.44857, top1: 0.39026, throughput: 1313.17 | 2022-04-10 23:10:46.257 [rank:1] [train], epoch: 4/50, iter: 300/834, loss: 0.44764, top1: 0.39365, throughput: 1312.91 | 2022-04-10 23:10:46.258 [rank:4] [train], epoch: 4/50, iter: 400/834, loss: 0.44599, top1: 0.39562, throughput: 1306.90 | 2022-04-10 23:11:00.948 [rank:5] [train], epoch: 4/50, iter: 400/834, loss: 0.44603, top1: 0.39833, throughput: 1306.60 | 2022-04-10 23:11:00.948 [rank:6] [train], epoch: 4/50, iter: 400/834, loss: 0.44838, top1: 0.39151, throughput: 1306.62[rank:1] [train], epoch: 4/50, iter: 400/834, loss: 0.44884, top1: 0.39682, throughput: 1306.89 | 2022-04-10 23:11:00.950 | 2022-04-10 23:11:00.949 [rank:3] [train], epoch: 4/50, iter: 400/834, loss: 0.44672, top1: 0.39849, throughput: 1306.47 | 2022-04-10 23:11:00.952 [rank:0] [train], epoch: 4/50, iter: 400/834, loss: 0.44733, top1: 0.39115, throughput: 1306.68[rank:7] [train], epoch: 4/50, iter: 400/834, loss: 0.44534, top1: 0.39599, throughput: 1306.64 | 2022-04-10 23:11:00.950 | 2022-04-10 23:11:00.950 [rank:2] [train], epoch: 4/50, iter: 400/834, loss: 0.44852, top1: 0.39552, throughput: 1306.33 | 2022-04-10 23:11:00.952 [rank:2] [train], epoch: 4/50, iter: 500/834, loss: 0.44515, top1: 0.39536, throughput: 1308.27 | 2022-04-10 23:11:15.628 [rank:4] [train], epoch: 4/50, iter: 500/834, loss: 0.44729, top1: 0.39422, throughput: 1307.97 | 2022-04-10 23:11:15.627 [rank:7] [train], epoch: 4/50, iter: 500/834, loss: 0.44759, top1: 0.39500, throughput: 1307.87 | 2022-04-10 23:11:15.630 [rank:6] [train], epoch: 4/50, iter: 500/834, loss: 0.44434, top1: 0.39589, throughput: 1308.15 | 2022-04-10 23:11:15.627 [rank:5] [train], epoch: 4/50, iter: 500/834, loss: 0.44539, top1: 0.39552, throughput: 1307.82 | 2022-04-10 23:11:15.629 [rank:0] [train], epoch: 4/50, iter: 500/834, loss: 0.44559, top1: 0.39203, throughput: 1307.88 | 2022-04-10 23:11:15.631 [rank:3] [train], epoch: 4/50, iter: 500/834, loss: 0.44319, top1: 0.40365, throughput: 1308.22 | 2022-04-10 23:11:15.629 [rank:1] [train], epoch: 4/50, iter: 500/834, loss: 0.44486, top1: 0.39703, throughput: 1307.84 | 2022-04-10 23:11:15.630 [rank:4] [train], epoch: 4/50, iter: 600/834, loss: 0.44204, top1: 0.40125, throughput: 1304.48 | 2022-04-10 23:11:30.346 [rank:6] [train], epoch: 4/50, iter: 600/834, loss: 0.44230, top1: 0.40021, throughput: 1304.49 | 2022-04-10 23:11:30.346 [rank:1] [train], epoch: 4/50, iter: 600/834, loss: 0.44097, top1: 0.40307, throughput: 1304.54 | 2022-04-10 23:11:30.348 [rank:5] [train], epoch: 4/50, iter: 600/834, loss: 0.44140, top1: 0.40115, throughput: 1304.40 | 2022-04-10 23:11:30.348 [rank:3] [train], epoch: 4/50, iter: 600/834, loss: 0.44244, top1: 0.40167, throughput: 1304.36 | 2022-04-10 23:11:30.349 [rank:2] [train], epoch: 4/50, iter: 600/834, loss: 0.44272, top1: 0.40068, throughput: 1304.44 | 2022-04-10 23:11:30.347 [rank:0] [train], epoch: 4/50, iter: 600/834, loss: 0.44361, top1: 0.40026, throughput: 1304.43 | 2022-04-10 23:11:30.350 [rank:7] [train], epoch: 4/50, iter: 600/834, loss: 0.44177, top1: 0.39974, throughput: 1304.39 | 2022-04-10 23:11:30.350 [rank:2] [train], epoch: 4/50, iter: 700/834, loss: 0.43769, top1: 0.40917, throughput: 1313.28 | 2022-04-10 23:11:44.967 [rank:7] [train], epoch: 4/50, iter: 700/834, loss: 0.44044, top1: 0.40693, throughput: 1313.40 | 2022-04-10 23:11:44.968 [rank:4] [train], epoch: 4/50, iter: 700/834, loss: 0.43757, top1: 0.40776, throughput: 1313.19 | 2022-04-10 23:11:44.967 [rank:5] [train], epoch: 4/50, iter: 700/834, loss: 0.43846, top1: 0.40161, throughput: 1313.41 | 2022-04-10 23:11:44.967 [rank:1] [train], epoch: 4/50, iter: 700/834, loss: 0.43987, top1: 0.40646, throughput: 1313.28 | 2022-04-10 23:11:44.968 [rank:0] [train], epoch: 4/50, iter: 700/834, loss: 0.43666, top1: 0.41219, throughput: 1313.28 | 2022-04-10 23:11:44.970 [rank:6] [train], epoch: 4/50, iter: 700/834, loss: 0.43900, top1: 0.40583, throughput: 1313.07 | 2022-04-10 23:11:44.968 [rank:3] [train], epoch: 4/50, iter: 700/834, loss: 0.43812, top1: 0.41177, throughput: 1313.09 | 2022-04-10 23:11:44.971 [rank:4] [train], epoch: 4/50, iter: 800/834, loss: 0.43635, top1: 0.41172, throughput: 1308.79 | 2022-04-10 23:11:59.637 [rank:5] [train], epoch: 4/50, iter: 800/834, loss: 0.43303, top1: 0.41536, throughput: 1308.77 | 2022-04-10 23:11:59.637 [rank:6] [train], epoch: 4/50, iter: 800/834, loss: 0.43620, top1: 0.41198, throughput: 1308.75 | 2022-04-10 23:11:59.638 [rank:1] [train], epoch: 4/50, iter: 800/834, loss: 0.43732, top1: 0.40521, throughput: 1308.52 | 2022-04-10 23:11:59.641 [rank:3] [train], epoch: 4/50, iter: 800/834, loss: 0.43602, top1: 0.40969, throughput: 1308.66 | 2022-04-10 23:11:59.642 [rank:7] [train], epoch: 4/50, iter: 800/834, loss: 0.43846, top1: 0.40776, throughput: 1308.75 | 2022-04-10 23:11:59.639 [rank:2] [train], epoch: 4/50, iter: 800/834, loss: 0.43584, top1: 0.40922, throughput: 1308.32 | 2022-04-10 23:11:59.642 [rank:0] [train], epoch: 4/50, iter: 800/834, loss: 0.43440, top1: 0.41193, throughput: 1308.71 | 2022-04-10 23:11:59.640 [rank:6] [train], epoch: 4/50, iter: 834/834, loss: 0.43492, top1: 0.41008, throughput: 1308.02 | 2022-04-10 23:12:04.629 [rank:2] [train], epoch: 4/50, iter: 834/834, loss: 0.43103, top1: 0.41161, throughput: 1308.63 | 2022-04-10 23:12:04.630 [rank:1] [train], epoch: 4/50, iter: 834/834, loss: 0.43424, top1: 0.41559, throughput: 1308.03 | 2022-04-10 23:12:04.632 [rank:5] [train], epoch: 4/50, iter: 834/834, loss: 0.43503, top1: 0.40763, throughput: 1306.99 | 2022-04-10 23:12:04.632 [rank:0] [train], epoch: 4/50, iter: 834/834, loss: 0.43190, top1: 0.42187, throughput: 1307.91 | 2022-04-10 23:12:04.632 [rank:3] [train], epoch: 4/50, iter: 834/834, loss: 0.43724, top1: 0.40916, throughput: 1308.31 | 2022-04-10 23:12:04.632 [rank:4] [train], epoch: 4/50, iter: 834/834, loss: 0.43188, top1: 0.42142, throughput: 1306.79 | 2022-04-10 23:12:04.632 [rank:7] [train], epoch: 4/50, iter: 834/834, loss: 0.43687, top1: 0.40962, throughput: 1306.94 | 2022-04-10 23:12:04.634 [rank:0] [eval], epoch: 4/50, iter: 125/125, loss: 0.00000, top1: 0.39776, throughput: 567.67 | 2022-04-10 23:12:15.642 [rank:7] [eval], epoch: 4/50, iter: 125/125, loss: 0.00000, top1: 0.38896, throughput: 564.30 | 2022-04-10 23:12:15.709 [rank:5] [eval], epoch: 4/50, iter: 125/125, loss: 0.00000, top1: 0.39024, throughput: 560.61 | 2022-04-10 23:12:15.780 [rank:2] [eval], epoch: 4/50, iter: 125/125, loss: 0.00000, top1: 0.40064, throughput: 559.47 | 2022-04-10 23:12:15.802 [rank:6] [eval], epoch: 4/50, iter: 125/125, loss: 0.00000, top1: 0.39696, throughput: 559.35 | 2022-04-10 23:12:15.803 [rank:3] [eval], epoch: 4/50, iter: 125/125, loss: 0.00000, top1: 0.38880, throughput: 557.74 | 2022-04-10 23:12:15.838 [rank:1] [eval], epoch: 4/50, iter: 125/125, loss: 0.00000, top1: 0.40416, throughput: 555.86 | 2022-04-10 23:12:15.875 [rank:4] [eval], epoch: 4/50, iter: 125/125, loss: 0.00000, top1: 0.38352, throughput: 553.62 | 2022-04-10 23:12:15.921 [rank:5] [train], epoch: 5/50, iter: 100/834, loss: 0.42996, top1: 0.41687, throughput: 1305.41 | 2022-04-10 23:12:30.488 [rank:6] [train], epoch: 5/50, iter: 100/834, loss: 0.42688, top1: 0.42729, throughput: 1307.40[rank:0] [train], epoch: 5/50, iter: 100/834, loss: 0.42724, top1: 0.42266, throughput: 1293.16 | 2022-04-10 23:12:30.489 | 2022-04-10 23:12:30.489 [rank:7] [train], epoch: 5/50, iter: 100/834, loss: 0.42668, top1: 0.42786, throughput: 1299.02 | 2022-04-10 23:12:30.490 [rank:3] [train], epoch: 5/50, iter: 100/834, loss: 0.43095, top1: 0.42000, throughput: 1310.27 | 2022-04-10 23:12:30.491 [rank:4] [train], epoch: 5/50, iter: 100/834, loss: 0.43154, top1: 0.41995, throughput: 1317.70 | 2022-04-10 23:12:30.492 [rank:1] [train], epoch: 5/50, iter: 100/834, loss: 0.42923, top1: 0.42297, throughput: 1313.43 | 2022-04-10 23:12:30.494 [rank:2] [train], epoch: 5/50, iter: 100/834, loss: 0.43014, top1: 0.42375, throughput: 1306.80 | 2022-04-10 23:12:30.494 [rank:5] [train], epoch: 5/50, iter: 200/834, loss: 0.43082, top1: 0.41792, throughput: 1304.35 | 2022-04-10 23:12:45.208 [rank:4] [train], epoch: 5/50, iter: 200/834, loss: 0.42488, top1: 0.42568, throughput: 1304.70 | 2022-04-10 23:12:45.208 [rank:6] [train], epoch: 5/50, iter: 200/834, loss: 0.42656, top1: 0.42589, throughput: 1304.37 | 2022-04-10 23:12:45.208 [rank:1] [train], epoch: 5/50, iter: 200/834, loss: 0.42864, top1: 0.42385, throughput: 1304.59 | 2022-04-10 23:12:45.211 [rank:2] [train], epoch: 5/50, iter: 200/834, loss: 0.43010, top1: 0.42099, throughput: 1304.77 | 2022-04-10 23:12:45.209 [rank:7] [train], epoch: 5/50, iter: 200/834, loss: 0.42989, top1: 0.42318, throughput: 1304.11 | 2022-04-10 23:12:45.212 [rank:3] [train], epoch: 5/50, iter: 200/834, loss: 0.42438, top1: 0.43010, throughput: 1304.34 | 2022-04-10 23:12:45.211 [rank:0] [train], epoch: 5/50, iter: 200/834, loss: 0.42668, top1: 0.42687, throughput: 1304.24 | 2022-04-10 23:12:45.210 [rank:7] [train], epoch: 5/50, iter: 300/834, loss: 0.42417, top1: 0.42974, throughput: 1303.86 | 2022-04-10 23:12:59.938 [rank:0] [train], epoch: 5/50, iter: 300/834, loss: 0.42614, top1: 0.42479, throughput: 1303.65 | 2022-04-10 23:12:59.938 [rank:5] [train], epoch: 5/50, iter: 300/834, loss: 0.42444, top1: 0.42776, throughput: 1303.55 | 2022-04-10 23:12:59.937 [rank:1] [train], epoch: 5/50, iter: 300/834, loss: 0.42487, top1: 0.42953, throughput: 1303.76 | 2022-04-10 23:12:59.938 [rank:6] [train], epoch: 5/50, iter: 300/834, loss: 0.42418, top1: 0.42750, throughput: 1303.46 | 2022-04-10 23:12:59.938 [rank:4] [train], epoch: 5/50, iter: 300/834, loss: 0.42410, top1: 0.42521, throughput: 1303.46 | 2022-04-10 23:12:59.938 [rank:2] [train], epoch: 5/50, iter: 300/834, loss: 0.42611, top1: 0.42615, throughput: 1303.54 | 2022-04-10 23:12:59.938 [rank:3] [train], epoch: 5/50, iter: 300/834, loss: 0.42260, top1: 0.42792, throughput: 1303.68 | 2022-04-10 23:12:59.939 [rank:5] [train], epoch: 5/50, iter: 400/834, loss: 0.42008, top1: 0.43229, throughput: 1313.02 | 2022-04-10 23:13:14.560 [rank:1] [train], epoch: 5/50, iter: 400/834, loss: 0.41996, top1: 0.43724, throughput: 1312.90 | 2022-04-10 23:13:14.562 [rank:6] [train], epoch: 5/50, iter: 400/834, loss: 0.42283, top1: 0.42875, throughput: 1313.02 | 2022-04-10 23:13:14.561 [rank:2] [train], epoch: 5/50, iter: 400/834, loss: 0.42027, top1: 0.43344, throughput: 1312.88 | 2022-04-10 23:13:14.563 [rank:3] [train], epoch: 5/50, iter: 400/834, loss: 0.41897, top1: 0.43750, throughput: 1312.77 | 2022-04-10 23:13:14.565 [rank:4] [train], epoch: 5/50, iter: 400/834, loss: 0.42362, top1: 0.43474, throughput: 1312.67 | 2022-04-10 23:13:14.565 [rank:0] [train], epoch: 5/50, iter: 400/834, loss: 0.42078, top1: 0.43333, throughput: 1312.69 | 2022-04-10 23:13:14.564 [rank:7] [train], epoch: 5/50, iter: 400/834, loss: 0.42246, top1: 0.43385, throughput: 1312.52 | 2022-04-10 23:13:14.566 [rank:7] [train], epoch: 5/50, iter: 500/834, loss: 0.41755, top1: 0.44078, throughput: 1304.71 | 2022-04-10 23:13:29.282 [rank:5] [train], epoch: 5/50, iter: 500/834, loss: 0.42024, top1: 0.44021, throughput: 1304.27 | 2022-04-10 23:13:29.281 [rank:2] [train], epoch: 5/50, iter: 500/834, loss: 0.41784, top1: 0.43708, throughput: 1304.47 | 2022-04-10 23:13:29.281 [rank:0] [train], epoch: 5/50, iter: 500/834, loss: 0.41791, top1: 0.43708, throughput: 1304.42 | 2022-04-10 23:13:29.284 [rank:6] [train], epoch: 5/50, iter: 500/834, loss: 0.41912, top1: 0.43797, throughput: 1304.31 | 2022-04-10 23:13:29.281 [rank:4] [train], epoch: 5/50, iter: 500/834, loss: 0.41984, top1: 0.43802, throughput: 1304.67 | 2022-04-10 23:13:29.281 [rank:1] [train], epoch: 5/50, iter: 500/834, loss: 0.41946, top1: 0.43958, throughput: 1304.28 | 2022-04-10 23:13:29.282 [rank:3] [train], epoch: 5/50, iter: 500/834, loss: 0.42263, top1: 0.43208, throughput: 1304.39 | 2022-04-10 23:13:29.284 [rank:4] [train], epoch: 5/50, iter: 600/834, loss: 0.41753, top1: 0.43958, throughput: 1313.28 | 2022-04-10 23:13:43.901 [rank:5] [train], epoch: 5/50, iter: 600/834, loss: 0.41463, top1: 0.44531, throughput: 1313.25 | 2022-04-10 23:13:43.901 [rank:1] [train], epoch: 5/50, iter: 600/834, loss: 0.41419, top1: 0.45109, throughput: 1313.21 | 2022-04-10 23:13:43.903 [rank:3] [train], epoch: 5/50, iter: 600/834, loss: 0.41587, top1: 0.44786, throughput: 1313.34 | 2022-04-10 23:13:43.903 [rank:6] [train], epoch: 5/50, iter: 600/834, loss: 0.41287, top1: 0.44745, throughput: 1313.36 | 2022-04-10 23:13:43.900 [rank:2] [train], epoch: 5/50, iter: 600/834, loss: 0.41946, top1: 0.43870, throughput: 1313.12 | 2022-04-10 23:13:43.903 [rank:0] [train], epoch: 5/50, iter: 600/834, loss: 0.41602, top1: 0.44375, throughput: 1313.20 | 2022-04-10 23:13:43.904 [rank:7] [train], epoch: 5/50, iter: 600/834, loss: 0.41669, top1: 0.44260, throughput: 1313.24 | 2022-04-10 23:13:43.902 [rank:7] [train], epoch: 5/50, iter: 700/834, loss: 0.41480, top1: 0.44656, throughput: 1316.11 | 2022-04-10 23:13:58.491 [rank:4] [train], epoch: 5/50, iter: 700/834, loss: 0.41381, top1: 0.45073, throughput: 1316.15 | 2022-04-10 23:13:58.489 [rank:0] [train], epoch: 5/50, iter: 700/834, loss: 0.41629, top1: 0.43969, throughput: 1316.12 | 2022-04-10 23:13:58.493 [rank:2] [train], epoch: 5/50, iter: 700/834, loss: 0.41350, top1: 0.44917, throughput: 1316.20 | 2022-04-10 23:13:58.490 [rank:1] [train], epoch: 5/50, iter: 700/834, loss: 0.41241, top1: 0.44932, throughput: 1316.07 | 2022-04-10 23:13:58.492 [rank:5] [train], epoch: 5/50, iter: 700/834, loss: 0.41474, top1: 0.44776, throughput: 1316.02 | 2022-04-10 23:13:58.491 [rank:6] [train], epoch: 5/50, iter: 700/834, loss: 0.41599, top1: 0.44818, throughput: 1315.83 | 2022-04-10 23:13:58.492 [rank:3] [train], epoch: 5/50, iter: 700/834, loss: 0.41267, top1: 0.44891, throughput: 1316.16 | 2022-04-10 23:13:58.491 [rank:6] [train], epoch: 5/50, iter: 800/834, loss: 0.41168, top1: 0.45104, throughput: 1303.67 | 2022-04-10 23:14:13.219 [rank:5] [train], epoch: 5/50, iter: 800/834, loss: 0.41286, top1: 0.45240, throughput: 1303.56 | 2022-04-10 23:14:13.220 [rank:1] [train], epoch: 5/50, iter: 800/834, loss: 0.41393, top1: 0.44563, throughput: 1303.51 | 2022-04-10 23:14:13.221 [rank:4] [train], epoch: 5/50, iter: 800/834, loss: 0.41190, top1: 0.45391, throughput: 1303.31 | 2022-04-10 23:14:13.221 [rank:3] [train], epoch: 5/50, iter: 800/834, loss: 0.41426, top1: 0.44620, throughput: 1303.38 | 2022-04-10 23:14:13.222 [rank:2] [train], epoch: 5/50, iter: 800/834, loss: 0.41030, top1: 0.44745, throughput: 1303.29 | 2022-04-10 23:14:13.222 [rank:0] [train], epoch: 5/50, iter: 800/834, loss: 0.41310, top1: 0.45146, throughput: 1303.54 | 2022-04-10 23:14:13.222 [rank:7] [train], epoch: 5/50, iter: 800/834, loss: 0.40998, top1: 0.45667, throughput: 1303.46 | 2022-04-10 23:14:13.221 [rank:5] [train], epoch: 5/50, iter: 834/834, loss: 0.41301, top1: 0.45496, throughput: 1289.42 | 2022-04-10 23:14:18.282 [rank:4] [train], epoch: 5/50, iter: 834/834, loss: 0.40924, top1: 0.46078, throughput: 1289.55 | 2022-04-10 23:14:18.283 [rank:6] [train], epoch: 5/50, iter: 834/834, loss: 0.41058, top1: 0.45542, throughput: 1289.20 | 2022-04-10 23:14:18.283 [rank:2] [train], epoch: 5/50, iter: 834/834, loss: 0.40804, top1: 0.45650, throughput: 1289.54 | 2022-04-10 23:14:18.285 [rank:0] [train], epoch: 5/50, iter: 834/834, loss: 0.40216, top1: 0.46952, throughput: 1289.47 | 2022-04-10 23:14:18.284 [rank:1] [train], epoch: 5/50, iter: 834/834, loss: 0.41047, top1: 0.45236, throughput: 1289.11 | 2022-04-10 23:14:18.285 [rank:3] [train], epoch: 5/50, iter: 834/834, loss: 0.40709, top1: 0.45588, throughput: 1289.21 | 2022-04-10 23:14:18.286 [rank:7] [train], epoch: 5/50, iter: 834/834, loss: 0.40912, top1: 0.46109, throughput: 1289.02 | 2022-04-10 23:14:18.285 [rank:7] [eval], epoch: 5/50, iter: 125/125, loss: 0.00000, top1: 0.45536, throughput: 570.76 | 2022-04-10 23:14:29.235 [rank:0] [eval], epoch: 5/50, iter: 125/125, loss: 0.00000, top1: 0.45520, throughput: 570.13 | 2022-04-10 23:14:29.247 [rank:4] [eval], epoch: 5/50, iter: 125/125, loss: 0.00000, top1: 0.43440, throughput: 567.87 | 2022-04-10 23:14:29.289 [rank:6] [eval], epoch: 5/50, iter: 125/125, loss: 0.00000, top1: 0.44352, throughput: 565.01 | 2022-04-10 23:14:29.345 [rank:2] [eval], epoch: 5/50, iter: 125/125, loss: 0.00000, top1: 0.44736, throughput: 563.00 | 2022-04-10 23:14:29.386 [rank:3] [eval], epoch: 5/50, iter: 125/125, loss: 0.00000, top1: 0.43232, throughput: 561.78 | 2022-04-10 23:14:29.411 [rank:1] [eval], epoch: 5/50, iter: 125/125, loss: 0.00000, top1: 0.44848, throughput: 558.45 | 2022-04-10 23:14:29.477 [rank:5] [eval], epoch: 5/50, iter: 125/125, loss: 0.00000, top1: 0.43568, throughput: 555.48 | 2022-04-10 23:14:29.534 [rank:6] [train], epoch: 6/50, iter: 100/834, loss: 0.40640, top1: 0.46302, throughput: 1302.77 | 2022-04-10 23:14:44.083 [rank:4] [train], epoch: 6/50, iter: 100/834, loss: 0.40577, top1: 0.46292, throughput: 1297.92 | 2022-04-10 23:14:44.082 [rank:7] [train], epoch: 6/50, iter: 100/834, loss: 0.40433, top1: 0.46823, throughput: 1293.11 | 2022-04-10 23:14:44.083 [rank:2] [train], epoch: 6/50, iter: 100/834, loss: 0.40552, top1: 0.45870, throughput: 1306.36 | 2022-04-10 23:14:44.083 [rank:3] [train], epoch: 6/50, iter: 100/834, loss: 0.40283, top1: 0.46635, throughput: 1308.44 | 2022-04-10 23:14:44.085 [rank:5] [train], epoch: 6/50, iter: 100/834, loss: 0.40493, top1: 0.46229, throughput: 1319.52 | 2022-04-10 23:14:44.084 [rank:0] [train], epoch: 6/50, iter: 100/834, loss: 0.40679, top1: 0.46219, throughput: 1293.90 | 2022-04-10 23:14:44.086 [rank:1] [train], epoch: 6/50, iter: 100/834, loss: 0.40622, top1: 0.46016, throughput: 1314.23 | 2022-04-10 23:14:44.086 [rank:4] [train], epoch: 6/50, iter: 200/834, loss: 0.40649, top1: 0.46339, throughput: 1311.28 | 2022-04-10 23:14:58.724 [rank:5] [train], epoch: 6/50, iter: 200/834, loss: 0.40442, top1: 0.46656, throughput: 1311.51 | 2022-04-10 23:14:58.724 [rank:6] [train], epoch: 6/50, iter: 200/834, loss: 0.40462, top1: 0.46620, throughput: 1311.40 | 2022-04-10 23:14:58.723 [rank:1] [train], epoch: 6/50, iter: 200/834, loss: 0.40533, top1: 0.46057, throughput: 1311.47 | 2022-04-10 23:14:58.726 [rank:3] [train], epoch: 6/50, iter: 200/834, loss: 0.40402, top1: 0.46182, throughput: 1311.34 | 2022-04-10 23:14:58.727 [rank:7] [train], epoch: 6/50, iter: 200/834, loss: 0.40664, top1: 0.46036, throughput: 1311.38 | 2022-04-10 23:14:58.724 [rank:2] [train], epoch: 6/50, iter: 200/834, loss: 0.40455, top1: 0.46083, throughput: 1311.28 | 2022-04-10 23:14:58.726 [rank:0] [train], epoch: 6/50, iter: 200/834, loss: 0.40365, top1: 0.47010, throughput: 1311.41 | 2022-04-10 23:14:58.727 [rank:4] [train], epoch: 6/50, iter: 300/834, loss: 0.40129, top1: 0.46901, throughput: 1311.57 | 2022-04-10 23:15:13.363 [rank:0] [train], epoch: 6/50, iter: 300/834, loss: 0.40228, top1: 0.46693, throughput: 1311.42 | 2022-04-10 23:15:13.367 [rank:2] [train], epoch: 6/50, iter: 300/834, loss: 0.40351, top1: 0.46474, throughput: 1311.50 | 2022-04-10 23:15:13.365 [rank:5] [train], epoch: 6/50, iter: 300/834, loss: 0.40304, top1: 0.46370, throughput: 1311.23 | 2022-04-10 23:15:13.367 [rank:6] [train], epoch: 6/50, iter: 300/834, loss: 0.40377, top1: 0.46292, throughput: 1311.16 | 2022-04-10 23:15:13.367 [rank:7] [train], epoch: 6/50, iter: 300/834, loss: 0.40148, top1: 0.47073, throughput: 1310.94 | 2022-04-10 23:15:13.370 [rank:3] [train], epoch: 6/50, iter: 300/834, loss: 0.40447, top1: 0.46333, throughput: 1311.34 | 2022-04-10 23:15:13.368 [rank:1] [train], epoch: 6/50, iter: 300/834, loss: 0.40474, top1: 0.46203, throughput: 1311.32 | 2022-04-10 23:15:13.368 [rank:4] [train], epoch: 6/50, iter: 400/834, loss: 0.40311, top1: 0.46599, throughput: 1297.57 | 2022-04-10 23:15:28.160 [rank:5] [train], epoch: 6/50, iter: 400/834, loss: 0.40252, top1: 0.47068, throughput: 1297.78 | 2022-04-10 23:15:28.161 [rank:6] [train], epoch: 6/50, iter: 400/834, loss: 0.40252, top1: 0.46333, throughput: 1297.74 | 2022-04-10 23:15:28.162 [rank:2] [train], epoch: 6/50, iter: 400/834, loss: 0.40237, top1: 0.46760, throughput: 1297.61 | 2022-04-10 23:15:28.162 [rank:1] [train], epoch: 6/50, iter: 400/834, loss: 0.40400, top1: 0.45687, throughput: 1297.82 | 2022-04-10 23:15:28.162 [rank:3] [train], epoch: 6/50, iter: 400/834, loss: 0.40177, top1: 0.46823, throughput: 1297.70 | 2022-04-10 23:15:28.163 [rank:7] [train], epoch: 6/50, iter: 400/834, loss: 0.40002, top1: 0.46859, throughput: 1298.13 | 2022-04-10 23:15:28.161 [rank:0] [train], epoch: 6/50, iter: 400/834, loss: 0.40228, top1: 0.46656, throughput: 1297.56 | 2022-04-10 23:15:28.164 [rank:5] [train], epoch: 6/50, iter: 500/834, loss: 0.39762, top1: 0.47177, throughput: 1313.44 | 2022-04-10 23:15:42.779 [rank:2] [train], epoch: 6/50, iter: 500/834, loss: 0.40135, top1: 0.47266, throughput: 1313.36 | 2022-04-10 23:15:42.781 [rank:4] [train], epoch: 6/50, iter: 500/834, loss: 0.40315, top1: 0.46453, throughput: 1313.17 | 2022-04-10 23:15:42.781 [rank:6] [train], epoch: 6/50, iter: 500/834, loss: 0.40177, top1: 0.46896, throughput: 1313.30 | 2022-04-10 23:15:42.782 [rank:1] [train], epoch: 6/50, iter: 500/834, loss: 0.40370, top1: 0.46568, throughput: 1313.21 | 2022-04-10 23:15:42.783 [rank:7] [train], epoch: 6/50, iter: 500/834, loss: 0.40437, top1: 0.46594, throughput: 1313.24 | 2022-04-10 23:15:42.781 [rank:3] [train], epoch: 6/50, iter: 500/834, loss: 0.40099, top1: 0.47182, throughput: 1313.28 | 2022-04-10 23:15:42.783 [rank:0] [train], epoch: 6/50, iter: 500/834, loss: 0.40060, top1: 0.47229, throughput: 1313.46 | 2022-04-10 23:15:42.782 [rank:4] [train], epoch: 6/50, iter: 600/834, loss: 0.39887, top1: 0.47214, throughput: 1304.56 | 2022-04-10 23:15:57.499 [rank:1] [train], epoch: 6/50, iter: 600/834, loss: 0.40223, top1: 0.47292, throughput: 1304.61 | 2022-04-10 23:15:57.500 [rank:3] [train], epoch: 6/50, iter: 600/834, loss: 0.39912, top1: 0.47349, throughput: 1304.56 | 2022-04-10 23:15:57.501 [rank:0] [train], epoch: 6/50, iter: 600/834, loss: 0.40082, top1: 0.46953, throughput: 1304.56 | 2022-04-10 23:15:57.500 [rank:5] [train], epoch: 6/50, iter: 600/834, loss: 0.39925, top1: 0.47104, throughput: 1304.21 | 2022-04-10 23:15:57.501 [rank:2] [train], epoch: 6/50, iter: 600/834, loss: 0.39852, top1: 0.46984, throughput: 1304.16 | 2022-04-10 23:15:57.503 [rank:6] [train], epoch: 6/50, iter: 600/834, loss: 0.40167, top1: 0.46901, throughput: 1304.17 | 2022-04-10 23:15:57.504 [rank:7] [train], epoch: 6/50, iter: 600/834, loss: 0.40170, top1: 0.46984, throughput: 1303.97 | 2022-04-10 23:15:57.505 [rank:6] [train], epoch: 6/50, iter: 700/834, loss: 0.39859, top1: 0.47380, throughput: 1311.80 | 2022-04-10 23:16:12.140 [rank:4] [train], epoch: 6/50, iter: 700/834, loss: 0.39663, top1: 0.47115, throughput: 1311.36 | 2022-04-10 23:16:12.140 [rank:5] [train], epoch: 6/50, iter: 700/834, loss: 0.40087, top1: 0.46703, throughput: 1311.60 | 2022-04-10 23:16:12.140 [rank:7] [train], epoch: 6/50, iter: 700/834, loss: 0.40004, top1: 0.46922, throughput: 1311.85 | 2022-04-10 23:16:12.141 [rank:3] [train], epoch: 6/50, iter: 700/834, loss: 0.39921, top1: 0.47141, throughput: 1311.36 | 2022-04-10 23:16:12.142 [rank:0] [train], epoch: 6/50, iter: 700/834, loss: 0.39730, top1: 0.47307, throughput: 1311.21 | 2022-04-10 23:16:12.143 [rank:2] [train], epoch: 6/50, iter: 700/834, loss: 0.39764, top1: 0.47797, throughput: 1311.63 | 2022-04-10 23:16:12.141 [rank:1] [train], epoch: 6/50, iter: 700/834, loss: 0.39798, top1: 0.47094, throughput: 1311.16 | 2022-04-10 23:16:12.143 [rank:6] [train], epoch: 6/50, iter: 800/834, loss: 0.39494, top1: 0.48245, throughput: 1313.22 | 2022-04-10 23:16:26.761 [rank:4] [train], epoch: 6/50, iter: 800/834, loss: 0.39677, top1: 0.47510, throughput: 1313.24 | 2022-04-10 23:16:26.761 [rank:7] [train], epoch: 6/50, iter: 800/834, loss: 0.40076, top1: 0.47120, throughput: 1313.13 | 2022-04-10 23:16:26.763 [rank:1] [train], epoch: 6/50, iter: 800/834, loss: 0.39476, top1: 0.48391, throughput: 1313.10 | 2022-04-10 23:16:26.765 [rank:5] [train], epoch: 6/50, iter: 800/834, loss: 0.39833, top1: 0.47411, throughput: 1312.98 | 2022-04-10 23:16:26.763 [rank:3] [train], epoch: 6/50, iter: 800/834, loss: 0.39706, top1: 0.48052, throughput: 1313.19 | 2022-04-10 23:16:26.763 [rank:0] [train], epoch: 6/50, iter: 800/834, loss: 0.39660, top1: 0.47344, throughput: 1313.26 | 2022-04-10 23:16:26.763 [rank:2] [train], epoch: 6/50, iter: 800/834, loss: 0.39438, top1: 0.48099, throughput: 1313.08 | 2022-04-10 23:16:26.763 [rank:6] [train], epoch: 6/50, iter: 834/834, loss: 0.39698, top1: 0.47457, throughput: 1310.67 | 2022-04-10 23:16:31.741 [rank:2] [train], epoch: 6/50, iter: 834/834, loss: 0.40038, top1: 0.46630, throughput: 1311.35 | 2022-04-10 23:16:31.741 [rank:7] [train], epoch: 6/50, iter: 834/834, loss: 0.39771, top1: 0.47105, throughput: 1311.12 | 2022-04-10 23:16:31.742 [rank:4] [train], epoch: 6/50, iter: 834/834, loss: 0.39833, top1: 0.47672, throughput: 1310.55 | 2022-04-10 23:16:31.742 [rank:0] [train], epoch: 6/50, iter: 834/834, loss: 0.39194, top1: 0.49173, throughput: 1310.62 | 2022-04-10 23:16:31.744 [rank:1] [train], epoch: 6/50, iter: 834/834, loss: 0.39680, top1: 0.47947, throughput: 1311.50 | 2022-04-10 23:16:31.743 [rank:5] [train], epoch: 6/50, iter: 834/834, loss: 0.40012, top1: 0.46860, throughput: 1310.76 | 2022-04-10 23:16:31.743 [rank:3] [train], epoch: 6/50, iter: 834/834, loss: 0.39629, top1: 0.47917, throughput: 1310.41 | 2022-04-10 23:16:31.745 [rank:0] [eval], epoch: 6/50, iter: 125/125, loss: 0.00000, top1: 0.44400, throughput: 591.74 | 2022-04-10 23:16:42.306 [rank:2] [eval], epoch: 6/50, iter: 125/125, loss: 0.00000, top1: 0.43376, throughput: 586.07 | 2022-04-10 23:16:42.406 [rank:7] [eval], epoch: 6/50, iter: 125/125, loss: 0.00000, top1: 0.44272, throughput: 585.51 | 2022-04-10 23:16:42.416 [rank:6] [eval], epoch: 6/50, iter: 125/125, loss: 0.00000, top1: 0.44448, throughput: 583.87 | 2022-04-10 23:16:42.446 [rank:1] [eval], epoch: 6/50, iter: 125/125, loss: 0.00000, top1: 0.44784, throughput: 581.72 | 2022-04-10 23:16:42.487 [rank:3] [eval], epoch: 6/50, iter: 125/125, loss: 0.00000, top1: 0.43216, throughput: 580.77 | 2022-04-10 23:16:42.506 [rank:5] [eval], epoch: 6/50, iter: 125/125, loss: 0.00000, top1: 0.43248, throughput: 578.44 | 2022-04-10 23:16:42.548 [rank:4] [eval], epoch: 6/50, iter: 125/125, loss: 0.00000, top1: 0.43344, throughput: 577.58 | 2022-04-10 23:16:42.563 [rank:6] [train], epoch: 7/50, iter: 100/834, loss: 0.38967, top1: 0.48964, throughput: 1299.96 | 2022-04-10 23:16:57.215 [rank:2] [train], epoch: 7/50, iter: 100/834, loss: 0.39214, top1: 0.48833, throughput: 1296.39 | 2022-04-10 23:16:57.216 [rank:4] [train], epoch: 7/50, iter: 100/834, loss: 0.39162, top1: 0.48484, throughput: 1310.22[rank:5] [train], epoch: 7/50, iter: 100/834, loss: 0.38800, top1: 0.49125, throughput: 1309.02 | 2022-04-10 23:16:57.216 | 2022-04-10 23:16:57.217 [rank:3] [train], epoch: 7/50, iter: 100/834, loss: 0.39283, top1: 0.49182, throughput: 1305.19 | 2022-04-10 23:16:57.217 [rank:1] [train], epoch: 7/50, iter: 100/834, loss: 0.39130, top1: 0.48505, throughput: 1303.31 | 2022-04-10 23:16:57.218 [rank:0] [train], epoch: 7/50, iter: 100/834, loss: 0.38554, top1: 0.49698, throughput: 1287.62 | 2022-04-10 23:16:57.217 [rank:7] [train], epoch: 7/50, iter: 100/834, loss: 0.38957, top1: 0.48885, throughput: 1297.05 | 2022-04-10 23:16:57.219 [rank:6] [train], epoch: 7/50, iter: 200/834, loss: 0.39154, top1: 0.48401, throughput: 1314.98 | 2022-04-10 23:17:11.816 [rank:4] [train], epoch: 7/50, iter: 200/834, loss: 0.39181, top1: 0.48193, throughput: 1315.27 | 2022-04-10 23:17:11.815 [rank:2] [train], epoch: 7/50, iter: 200/834, loss: 0.39083, top1: 0.48781, throughput: 1315.11 | 2022-04-10 23:17:11.815 [rank:1] [train], epoch: 7/50, iter: 200/834, loss: 0.38848, top1: 0.49214, throughput: 1314.95 | 2022-04-10 23:17:11.820 [rank:7] [train], epoch: 7/50, iter: 200/834, loss: 0.39220, top1: 0.48365, throughput: 1315.19 | 2022-04-10 23:17:11.818 [rank:5] [train], epoch: 7/50, iter: 200/834, loss: 0.39106, top1: 0.48583, throughput: 1314.83 | 2022-04-10 23:17:11.818 [rank:3] [train], epoch: 7/50, iter: 200/834, loss: 0.39010, top1: 0.48828, throughput: 1314.65 | 2022-04-10 23:17:11.822 [rank:0] [train], epoch: 7/50, iter: 200/834, loss: 0.38872, top1: 0.49062, throughput: 1314.65 | 2022-04-10 23:17:11.821 [rank:4] [train], epoch: 7/50, iter: 300/834, loss: 0.38903, top1: 0.48750, throughput: 1305.04 | 2022-04-10 23:17:26.527 [rank:5] [train], epoch: 7/50, iter: 300/834, loss: 0.39022, top1: 0.48656, throughput: 1305.29 | 2022-04-10 23:17:26.528 [rank:6] [train], epoch: 7/50, iter: 300/834, loss: 0.38706, top1: 0.49083, throughput: 1305.11[rank:2] [train], epoch: 7/50, iter: 300/834, loss: 0.39262, top1: 0.49026, throughput: 1305.07 | 2022-04-10 23:17:26.527 | 2022-04-10 23:17:26.528 [rank:0] [train], epoch: 7/50, iter: 300/834, loss: 0.38965, top1: 0.48943, throughput: 1305.44 | 2022-04-10 23:17:26.529 [rank:3] [train], epoch: 7/50, iter: 300/834, loss: 0.38868, top1: 0.49083, throughput: 1305.43 | 2022-04-10 23:17:26.529 [rank:1] [train], epoch: 7/50, iter: 300/834, loss: 0.39430, top1: 0.48146, throughput: 1305.29 | 2022-04-10 23:17:26.529 [rank:7] [train], epoch: 7/50, iter: 300/834, loss: 0.38985, top1: 0.48927, throughput: 1305.11 | 2022-04-10 23:17:26.529 [rank:6] [train], epoch: 7/50, iter: 400/834, loss: 0.38469, top1: 0.49833, throughput: 1313.82 | 2022-04-10 23:17:41.141 [rank:4] [train], epoch: 7/50, iter: 400/834, loss: 0.38656, top1: 0.49260, throughput: 1313.78 | 2022-04-10 23:17:41.141[rank:1] [train], epoch: 7/50, iter: 400/834, loss: 0.38938, top1: 0.48922, throughput: 1313.70 | 2022-04-10 23:17:41.144 [rank:2] [train], epoch: 7/50, iter: 400/834, loss: 0.38717, top1: 0.49448, throughput: 1313.81 | 2022-04-10 23:17:41.141 [rank:5] [train], epoch: 7/50, iter: 400/834, loss: 0.38926, top1: 0.49125, throughput: 1313.75 | 2022-04-10 23:17:41.142 [rank:7] [train], epoch: 7/50, iter: 400/834, loss: 0.39164, top1: 0.48615, throughput: 1313.87 | 2022-04-10 23:17:41.142 [rank:0] [train], epoch: 7/50, iter: 400/834, loss: 0.38827, top1: 0.49396, throughput: 1313.83 | 2022-04-10 23:17:41.143 [rank:3] [train], epoch: 7/50, iter: 400/834, loss: 0.38700, top1: 0.49708, throughput: 1313.61 | 2022-04-10 23:17:41.146 [rank:5] [train], epoch: 7/50, iter: 500/834, loss: 0.38747, top1: 0.49125, throughput: 1308.18 | 2022-04-10 23:17:55.819 [rank:6] [train], epoch: 7/50, iter: 500/834, loss: 0.39019, top1: 0.48625, throughput: 1308.23 | 2022-04-10 23:17:55.818 [rank:4] [train], epoch: 7/50, iter: 500/834, loss: 0.39062, top1: 0.48844, throughput: 1308.11 | 2022-04-10 23:17:55.819 [rank:3] [train], epoch: 7/50, iter: 500/834, loss: 0.38825, top1: 0.49010, throughput: 1308.09 [rank:2] [train], epoch: 7/50, iter: 500/834, loss: 0.38434, top1: 0.49750, throughput: 1308.00| 2022-04-10 23:17:55.823 | 2022-04-10 23:17:55.820 [rank:7] [train], epoch: 7/50, iter: 500/834, loss: 0.38871, top1: 0.49281, throughput: 1307.93 | 2022-04-10 23:17:55.822 [rank:0] [train], epoch: 7/50, iter: 500/834, loss: 0.38683, top1: 0.49547, throughput: 1307.94 | 2022-04-10 23:17:55.822 [rank:1] [train], epoch: 7/50, iter: 500/834, loss: 0.38714, top1: 0.49661, throughput: 1308.24 | 2022-04-10 23:17:55.821 [rank:4] [train], epoch: 7/50, iter: 600/834, loss: 0.38961, top1: 0.48536, throughput: 1315.01 | 2022-04-10 23:18:10.419 [rank:6] [train], epoch: 7/50, iter: 600/834, loss: 0.38664, top1: 0.49172, throughput: 1314.94 | 2022-04-10 23:18:10.419 [rank:5] [train], epoch: 7/50, iter: 600/834, loss: 0.38824, top1: 0.48953, throughput: 1315.08 | 2022-04-10 23:18:10.419 [rank:1] [train], epoch: 7/50, iter: 600/834, loss: 0.38957, top1: 0.48911, throughput: 1314.92 | 2022-04-10 23:18:10.422 [rank:3] [train], epoch: 7/50, iter: 600/834, loss: 0.39117, top1: 0.48526, throughput: 1315.23 | 2022-04-10 23:18:10.422 [rank:7] [train], epoch: 7/50, iter: 600/834, loss: 0.38410, top1: 0.49573, throughput: 1315.25 | 2022-04-10 23:18:10.420 [rank:2] [train], epoch: 7/50, iter: 600/834, loss: 0.38837, top1: 0.49047, throughput: 1314.61 | 2022-04-10 23:18:10.425 [rank:0] [train], epoch: 7/50, iter: 600/834, loss: 0.38898, top1: 0.49156, throughput: 1314.78 | 2022-04-10 23:18:10.426 [rank:6] [train], epoch: 7/50, iter: 700/834, loss: 0.38817, top1: 0.49214, throughput: 1311.62 | 2022-04-10 23:18:25.058 [rank:7] [train], epoch: 7/50, iter: 700/834, loss: 0.38686, top1: 0.49141, throughput: 1311.65 | 2022-04-10 23:18:25.058 [rank:5] [train], epoch: 7/50, iter: 700/834, loss: 0.39076, top1: 0.48510, throughput: 1311.55 | 2022-04-10 23:18:25.058 [rank:4] [train], epoch: 7/50, iter: 700/834, loss: 0.38570, top1: 0.49724, throughput: 1311.61 | 2022-04-10 23:18:25.058 [rank:1] [train], epoch: 7/50, iter: 700/834, loss: 0.38626, top1: 0.49630, throughput: 1311.74 | 2022-04-10 23:18:25.059 [rank:2] [train], epoch: 7/50, iter: 700/834, loss: 0.38763, top1: 0.49224, throughput: 1311.94 | 2022-04-10 23:18:25.060 [rank:0] [train], epoch: 7/50, iter: 700/834, loss: 0.38739, top1: 0.48995, throughput: 1311.89 | 2022-04-10 23:18:25.061 [rank:3] [train], epoch: 7/50, iter: 700/834, loss: 0.38518, top1: 0.49328, throughput: 1311.57 | 2022-04-10 23:18:25.061 [rank:4] [train], epoch: 7/50, iter: 800/834, loss: 0.39005, top1: 0.48687, throughput: 1313.42 | 2022-04-10 23:18:39.676 [rank:2] [train], epoch: 7/50, iter: 800/834, loss: 0.38436, top1: 0.49802, throughput: 1313.60[rank:6] [train], epoch: 7/50, iter: 800/834, loss: 0.38791, top1: 0.48979, throughput: 1313.34 | 2022-04-10 23:18:39.677| 2022-04-10 23:18:39.676 [rank:5] [train], epoch: 7/50, iter: 800/834, loss: 0.38836, top1: 0.49380, throughput: 1313.31 | 2022-04-10 23:18:39.678 [rank:1] [train], epoch: 7/50, iter: 800/834, loss: 0.38722, top1: 0.49271, throughput: 1313.20 | 2022-04-10 23:18:39.680 [rank:3] [train], epoch: 7/50, iter: 800/834, loss: 0.38820, top1: 0.49021, throughput: 1313.46 | 2022-04-10 23:18:39.678 [rank:7] [train], epoch: 7/50, iter: 800/834, loss: 0.38684, top1: 0.49375, throughput: 1313.17 | 2022-04-10 23:18:39.679 [rank:0] [train], epoch: 7/50, iter: 800/834, loss: 0.38638, top1: 0.49328, throughput: 1313.40 | 2022-04-10 23:18:39.680 [rank:5] [train], epoch: 7/50, iter: 834/834, loss: 0.38306, top1: 0.49954, throughput: 1315.29 | 2022-04-10 23:18:44.641 [rank:4] [train], epoch: 7/50, iter: 834/834, loss: 0.38334, top1: 0.49724, throughput: 1314.92 | 2022-04-10 23:18:44.641 [rank:7] [train], epoch: 7/50, iter: 834/834, loss: 0.38376, top1: 0.50000, throughput: 1315.57 | 2022-04-10 23:18:44.641 [rank:0] [train], epoch: 7/50, iter: 834/834, loss: 0.38988, top1: 0.48591, throughput: 1315.34 | 2022-04-10 23:18:44.643 [rank:6] [train], epoch: 7/50, iter: 834/834, loss: 0.38744, top1: 0.49219, throughput: 1314.65 | 2022-04-10 23:18:44.642 [rank:3] [train], epoch: 7/50, iter: 834/834, loss: 0.38567, top1: 0.49387, throughput: 1315.03 | 2022-04-10 23:18:44.643 [rank:1] [train], epoch: 7/50, iter: 834/834, loss: 0.38552, top1: 0.49877, throughput: 1315.52 | 2022-04-10 23:18:44.642 [rank:2] [train], epoch: 7/50, iter: 834/834, loss: 0.38502, top1: 0.49004, throughput: 1314.40 | 2022-04-10 23:18:44.643 [rank:0] [eval], epoch: 7/50, iter: 125/125, loss: 0.00000, top1: 0.50576, throughput: 577.25 | 2022-04-10 23:18:55.470 [rank:7] [eval], epoch: 7/50, iter: 125/125, loss: 0.00000, top1: 0.50112, throughput: 577.16 | 2022-04-10 23:18:55.470 [rank:5] [eval], epoch: 7/50, iter: 125/125, loss: 0.00000, top1: 0.49264, throughput: 571.52 | 2022-04-10 23:18:55.577 [rank:2] [eval], epoch: 7/50, iter: 125/125, loss: 0.00000, top1: 0.50208, throughput: 570.68 | 2022-04-10 23:18:55.595 [rank:1] [eval], epoch: 7/50, iter: 125/125, loss: 0.00000, top1: 0.51456, throughput: 568.85 | 2022-04-10 23:18:55.629 [rank:6] [eval], epoch: 7/50, iter: 125/125, loss: 0.00000, top1: 0.50592, throughput: 568.59 | 2022-04-10 23:18:55.634 [rank:3] [eval], epoch: 7/50, iter: 125/125, loss: 0.00000, top1: 0.49280, throughput: 567.25 | 2022-04-10 23:18:55.661 [rank:4] [eval], epoch: 7/50, iter: 125/125, loss: 0.00000, top1: 0.50352, throughput: 563.94 | 2022-04-10 23:18:55.724 [rank:6] [train], epoch: 8/50, iter: 100/834, loss: 0.38021, top1: 0.50260, throughput: 1312.49 | 2022-04-10 23:19:10.263 [rank:1] [train], epoch: 8/50, iter: 100/834, loss: 0.37811, top1: 0.50964, throughput: 1311.84[rank:5] [train], epoch: 8/50, iter: 100/834, loss: 0.38027, top1: 0.50307, throughput: 1307.22 | 2022-04-10 23:19:10.265| 2022-04-10 23:19:10.264 [rank:7] [train], epoch: 8/50, iter: 100/834, loss: 0.37930, top1: 0.50484, throughput: 1297.80 | 2022-04-10 23:19:10.264 [rank:4] [train], epoch: 8/50, iter: 100/834, loss: 0.37675, top1: 0.51089, throughput: 1320.48 | 2022-04-10 23:19:10.264 [rank:0] [train], epoch: 8/50, iter: 100/834, loss: 0.37758, top1: 0.50995, throughput: 1297.77 | 2022-04-10 23:19:10.264 [rank:3] [train], epoch: 8/50, iter: 100/834, loss: 0.38198, top1: 0.50229, throughput: 1314.63 [rank:2] [train], epoch: 8/50, iter: 100/834, loss: 0.37778, top1: 0.50677, throughput: 1308.71 | 2022-04-10 23:19:10.266 | 2022-04-10 23:19:10.265 [rank:6] [train], epoch: 8/50, iter: 200/834, loss: 0.37784, top1: 0.51000, throughput: 1303.61 | 2022-04-10 23:19:24.991 [rank:5] [train], epoch: 8/50, iter: 200/834, loss: 0.38099, top1: 0.50349, throughput: 1303.54 | 2022-04-10 23:19:24.993 [rank:1] [train], epoch: 8/50, iter: 200/834, loss: 0.38000, top1: 0.50297, throughput: 1303.62 | 2022-04-10 23:19:24.994 [rank:3] [train], epoch: 8/50, iter: 200/834, loss: 0.38117, top1: 0.50115, throughput: 1303.53 | 2022-04-10 23:19:24.995 [rank:2] [train], epoch: 8/50, iter: 200/834, loss: 0.38102, top1: 0.50438, throughput: 1303.61 | 2022-04-10 23:19:24.994 [rank:0] [train], epoch: 8/50, iter: 200/834, loss: 0.38359, top1: 0.50177, throughput: 1303.43 | 2022-04-10 23:19:24.995 [rank:4] [train], epoch: 8/50, iter: 200/834, loss: 0.37915, top1: 0.50781, throughput: 1303.31 | 2022-04-10 23:19:24.996 [rank:7] [train], epoch: 8/50, iter: 200/834, loss: 0.37877, top1: 0.50521, throughput: 1303.34 | 2022-04-10 23:19:24.996 [rank:4] [train], epoch: 8/50, iter: 300/834, loss: 0.37800, top1: 0.50734, throughput: 1315.82 | 2022-04-10 23:19:39.587 [rank:5] [train], epoch: 8/50, iter: 300/834, loss: 0.38284, top1: 0.49917, throughput: 1315.49 | 2022-04-10 23:19:39.589 [rank:2] [train], epoch: 8/50, iter: 300/834, loss: 0.37920, top1: 0.50698, throughput: 1315.54 | 2022-04-10 23:19:39.589 [rank:0] [train], epoch: 8/50, iter: 300/834, loss: 0.38179, top1: 0.50068, throughput: 1315.63 | 2022-04-10 23:19:39.589 [rank:7] [train], epoch: 8/50, iter: 300/834, loss: 0.37748, top1: 0.51187, throughput: 1315.81 | 2022-04-10 23:19:39.588 [rank:1] [train], epoch: 8/50, iter: 300/834, loss: 0.38173, top1: 0.49859, throughput: 1315.50 | 2022-04-10 23:19:39.589 [rank:3] [train], epoch: 8/50, iter: 300/834, loss: 0.38014, top1: 0.50578, throughput: 1315.53 [rank:6] [train], epoch: 8/50, iter: 300/834, loss: 0.37875, top1: 0.50562, throughput: 1315.11 | 2022-04-10 23:19:39.589| 2022-04-10 23:19:39.591 [rank:4] [train], epoch: 8/50, iter: 400/834, loss: 0.37718, top1: 0.50958, throughput: 1315.00 | 2022-04-10 23:19:54.188 [rank:2] [train], epoch: 8/50, iter: 400/834, loss: 0.37810, top1: 0.50984, throughput: 1315.13 | 2022-04-10 23:19:54.188 [rank:6] [train], epoch: 8/50, iter: 400/834, loss: 0.38009, top1: 0.50755, throughput: 1315.17 | 2022-04-10 23:19:54.190 [rank:5] [train], epoch: 8/50, iter: 400/834, loss: 0.38022, top1: 0.50578, throughput: 1314.99 | 2022-04-10 23:19:54.190 [rank:1] [train], epoch: 8/50, iter: 400/834, loss: 0.37913, top1: 0.50958, throughput: 1314.98 | 2022-04-10 23:19:54.190 [rank:3] [train], epoch: 8/50, iter: 400/834, loss: 0.38097, top1: 0.50490, throughput: 1314.98 | 2022-04-10 23:19:54.190 [rank:0] [train], epoch: 8/50, iter: 400/834, loss: 0.38174, top1: 0.50328, throughput: 1314.88 | 2022-04-10 23:19:54.191 [rank:7] [train], epoch: 8/50, iter: 400/834, loss: 0.37820, top1: 0.50901, throughput: 1314.75 | 2022-04-10 23:19:54.191 [rank:4] [train], epoch: 8/50, iter: 500/834, loss: 0.37934, top1: 0.50833, throughput: 1315.04 | 2022-04-10 23:20:08.788 [rank:6] [train], epoch: 8/50, iter: 500/834, loss: 0.38139, top1: 0.50271, throughput: 1315.19 | 2022-04-10 23:20:08.789 [rank:1] [train], epoch: 8/50, iter: 500/834, loss: 0.37981, top1: 0.50292, throughput: 1315.20 | 2022-04-10 23:20:08.788 [rank:3] [train], epoch: 8/50, iter: 500/834, loss: 0.37944, top1: 0.50641, throughput: 1315.06 | 2022-04-10 23:20:08.791 [rank:5] [train], epoch: 8/50, iter: 500/834, loss: 0.37963, top1: 0.50844, throughput: 1315.16 | 2022-04-10 23:20:08.789 [rank:2] [train], epoch: 8/50, iter: 500/834, loss: 0.37715, top1: 0.51568, throughput: 1315.06 | 2022-04-10 23:20:08.788 [rank:0] [train], epoch: 8/50, iter: 500/834, loss: 0.38046, top1: 0.50370, throughput: 1315.18 | 2022-04-10 23:20:08.789 [rank:7] [train], epoch: 8/50, iter: 500/834, loss: 0.38162, top1: 0.50292, throughput: 1315.31 | 2022-04-10 23:20:08.788 [rank:4] [train], epoch: 8/50, iter: 600/834, loss: 0.37988, top1: 0.50292, throughput: 1315.07 | 2022-04-10 23:20:23.388 [rank:5] [train], epoch: 8/50, iter: 600/834, loss: 0.38181, top1: 0.50172, throughput: 1315.08 | 2022-04-10 23:20:23.388 [rank:2] [train], epoch: 8/50, iter: 600/834, loss: 0.37745, top1: 0.50974, throughput: 1314.78 | 2022-04-10 23:20:23.391 [rank:6] [train], epoch: 8/50, iter: 600/834, loss: 0.37630, top1: 0.50703, throughput: 1315.06 | 2022-04-10 23:20:23.389 [rank:0] [train], epoch: 8/50, iter: 600/834, loss: 0.37784, top1: 0.50979, throughput: 1315.02 | 2022-04-10 23:20:23.390 [rank:1] [train], epoch: 8/50, iter: 600/834, loss: 0.38028, top1: 0.50208, throughput: 1314.88 | 2022-04-10 23:20:23.391 [rank:7] [train], epoch: 8/50, iter: 600/834, loss: 0.37789, top1: 0.51328, throughput: 1314.81 | 2022-04-10 23:20:23.391 [rank:3] [train], epoch: 8/50, iter: 600/834, loss: 0.37836, top1: 0.50349, throughput: 1314.86 | 2022-04-10 23:20:23.393 [rank:4] [train], epoch: 8/50, iter: 700/834, loss: 0.37796, top1: 0.50932, throughput: 1314.30 | 2022-04-10 23:20:37.997 [rank:2] [train], epoch: 8/50, iter: 700/834, loss: 0.38105, top1: 0.50703, throughput: 1314.48 | 2022-04-10 23:20:37.998 [rank:6] [train], epoch: 8/50, iter: 700/834, loss: 0.37981, top1: 0.50125, throughput: 1314.28 | 2022-04-10 23:20:37.997 [rank:5] [train], epoch: 8/50, iter: 700/834, loss: 0.37977, top1: 0.50448, throughput: 1314.14 | 2022-04-10 23:20:37.999 [rank:7] [train], epoch: 8/50, iter: 700/834, loss: 0.37982, top1: 0.50370, throughput: 1314.38 | 2022-04-10 23:20:37.999 [rank:0] [train], epoch: 8/50, iter: 700/834, loss: 0.37870, top1: 0.50661, throughput: 1314.27 | 2022-04-10 23:20:37.999 [rank:1] [train], epoch: 8/50, iter: 700/834, loss: 0.37938, top1: 0.50797, throughput: 1313.88 | 2022-04-10 23:20:38.004 [rank:3] [train], epoch: 8/50, iter: 700/834, loss: 0.37622, top1: 0.51380, throughput: 1314.03 | 2022-04-10 23:20:38.004 [rank:2] [train], epoch: 8/50, iter: 800/834, loss: 0.37786, top1: 0.51125, throughput: 1315.63 | 2022-04-10 23:20:52.592 [rank:5] [train], epoch: 8/50, iter: 800/834, loss: 0.37866, top1: 0.50510, throughput: 1315.83 | 2022-04-10 23:20:52.590 [rank:6] [train], epoch: 8/50, iter: 800/834, loss: 0.37953, top1: 0.50682, throughput: 1315.65 | 2022-04-10 23:20:52.591 [rank:4] [train], epoch: 8/50, iter: 800/834, loss: 0.37666, top1: 0.51464, throughput: 1315.54 | 2022-04-10 23:20:52.592 [rank:7] [train], epoch: 8/50, iter: 800/834, loss: 0.37968, top1: 0.50698, throughput: 1315.71 | 2022-04-10 23:20:52.592 [rank:1] [train], epoch: 8/50, iter: 800/834, loss: 0.37898, top1: 0.50729, throughput: 1315.99 | 2022-04-10 23:20:52.594 [rank:3] [train], epoch: 8/50, iter: 800/834, loss: 0.37814, top1: 0.51036, throughput: 1316.05 | 2022-04-10 23:20:52.594 [rank:0] [train], epoch: 8/50, iter: 800/834, loss: 0.37696, top1: 0.50922, throughput: 1315.52 | 2022-04-10 23:20:52.594 [rank:5] [train], epoch: 8/50, iter: 834/834, loss: 0.37801, top1: 0.51271, throughput: 1311.99 | 2022-04-10 23:20:57.566 [rank:2] [train], epoch: 8/50, iter: 834/834, loss: 0.37607, top1: 0.51072, throughput: 1311.82 | 2022-04-10 23:20:57.568 [rank:6] [train], epoch: 8/50, iter: 834/834, loss: 0.37723, top1: 0.50551, throughput: 1311.56 | 2022-04-10 23:20:57.568 [rank:1] [train], epoch: 8/50, iter: 834/834, loss: 0.37661, top1: 0.50383, throughput: 1312.24 | 2022-04-10 23:20:57.568 [rank:7] [train], epoch: 8/50, iter: 834/834, loss: 0.37437, top1: 0.51854, throughput: 1311.76 | 2022-04-10 23:20:57.568 [rank:0] [train], epoch: 8/50, iter: 834/834, loss: 0.38208, top1: 0.49985, throughput: 1312.20[rank:4] [train], epoch: 8/50, iter: 834/834, loss: 0.38056, top1: 0.50138, throughput: 1311.63 | 2022-04-10 23:20:57.569 | 2022-04-10 23:20:57.569 [rank:3] [train], epoch: 8/50, iter: 834/834, loss: 0.37938, top1: 0.50674, throughput: 1311.37 | 2022-04-10 23:20:57.572 [rank:7] [eval], epoch: 8/50, iter: 125/125, loss: 0.00000, top1: 0.47712, throughput: 579.90 | 2022-04-10 23:21:08.346 [rank:0] [eval], epoch: 8/50, iter: 125/125, loss: 0.00000, top1: 0.47664, throughput: 578.23 | 2022-04-10 23:21:08.378 [rank:5] [eval], epoch: 8/50, iter: 125/125, loss: 0.00000, top1: 0.46496, throughput: 576.46 | 2022-04-10 23:21:08.408 [rank:2] [eval], epoch: 8/50, iter: 125/125, loss: 0.00000, top1: 0.46416, throughput: 573.10 | 2022-04-10 23:21:08.474 [rank:6] [eval], epoch: 8/50, iter: 125/125, loss: 0.00000, top1: 0.48048, throughput: 570.93 | 2022-04-10 23:21:08.515 [rank:3] [eval], epoch: 8/50, iter: 125/125, loss: 0.00000, top1: 0.47552, throughput: 570.86 | 2022-04-10 23:21:08.520 [rank:4] [eval], epoch: 8/50, iter: 125/125, loss: 0.00000, top1: 0.47216, throughput: 570.69 | 2022-04-10 23:21:08.520 [rank:1] [eval], epoch: 8/50, iter: 125/125, loss: 0.00000, top1: 0.48128, throughput: 562.18 | 2022-04-10 23:21:08.686 [rank:6] [train], epoch: 9/50, iter: 100/834, loss: 0.37470, top1: 0.51266, throughput: 1304.29 | 2022-04-10 23:21:23.236 [rank:4] [train], epoch: 9/50, iter: 100/834, loss: 0.36972, top1: 0.52344, throughput: 1304.71 | 2022-04-10 23:21:23.236 [rank:1] [train], epoch: 9/50, iter: 100/834, loss: 0.37220, top1: 0.51760, throughput: 1319.51 | 2022-04-10 23:21:23.237 [rank:5] [train], epoch: 9/50, iter: 100/834, loss: 0.37437, top1: 0.51599, throughput: 1294.84 | 2022-04-10 23:21:23.236 [rank:3] [train], epoch: 9/50, iter: 100/834, loss: 0.37168, top1: 0.52005, throughput: 1304.50 | 2022-04-10 23:21:23.238 [rank:7] [train], epoch: 9/50, iter: 100/834, loss: 0.37253, top1: 0.51714, throughput: 1289.24 | 2022-04-10 23:21:23.238 [rank:0] [train], epoch: 9/50, iter: 100/834, loss: 0.37203, top1: 0.51844, throughput: 1292.01 | 2022-04-10 23:21:23.238 [rank:2] [train], epoch: 9/50, iter: 100/834, loss: 0.37148, top1: 0.52240, throughput: 1300.37 | 2022-04-10 23:21:23.239 [rank:6] [train], epoch: 9/50, iter: 200/834, loss: 0.37494, top1: 0.51318, throughput: 1307.84 | 2022-04-10 23:21:37.917 [rank:4] [train], epoch: 9/50, iter: 200/834, loss: 0.37101, top1: 0.52073, throughput: 1307.86[rank:5] [train], epoch: 9/50, iter: 200/834, loss: 0.37134, top1: 0.52193, throughput: 1307.86 | 2022-04-10 23:21:37.917 | 2022-04-10 23:21:37.917 [rank:1] [train], epoch: 9/50, iter: 200/834, loss: 0.37192, top1: 0.51943, throughput: 1307.78 | 2022-04-10 23:21:37.918 [rank:2] [train], epoch: 9/50, iter: 200/834, loss: 0.37211, top1: 0.52318, throughput: 1307.98 | 2022-04-10 23:21:37.918 [rank:0] [train], epoch: 9/50, iter: 200/834, loss: 0.37058, top1: 0.52146, throughput: 1307.75 | 2022-04-10 23:21:37.920 [rank:3] [train], epoch: 9/50, iter: 200/834, loss: 0.37157, top1: 0.52068, throughput: 1307.75 | 2022-04-10 23:21:37.920 [rank:7] [train], epoch: 9/50, iter: 200/834, loss: 0.37126, top1: 0.52016, throughput: 1307.89 | 2022-04-10 23:21:37.918 [rank:6] [train], epoch: 9/50, iter: 300/834, loss: 0.37152, top1: 0.51781, throughput: 1317.51 | 2022-04-10 23:21:52.490 [rank:4] [train], epoch: 9/50, iter: 300/834, loss: 0.37145, top1: 0.51896, throughput: 1317.54 | 2022-04-10 23:21:52.489 [rank:5] [train], epoch: 9/50, iter: 300/834, loss: 0.37593, top1: 0.51406, throughput: 1317.39 | 2022-04-10 23:21:52.491 [rank:1] [train], epoch: 9/50, iter: 300/834, loss: 0.37309, top1: 0.51938, throughput: 1317.45 | 2022-04-10 23:21:52.492 [rank:3] [train], epoch: 9/50, iter: 300/834, loss: 0.37234, top1: 0.51719, throughput: 1317.49 | 2022-04-10 23:21:52.493 [rank:0] [train], epoch: 9/50, iter: 300/834, loss: 0.37095, top1: 0.52245, throughput: 1317.58 | 2022-04-10 23:21:52.492 [rank:2] [train], epoch: 9/50, iter: 300/834, loss: 0.37485, top1: 0.51203, throughput: 1317.38 | 2022-04-10 23:21:52.492 [rank:7] [train], epoch: 9/50, iter: 300/834, loss: 0.37548, top1: 0.51000, throughput: 1317.46 | 2022-04-10 23:21:52.492 [rank:6] [train], epoch: 9/50, iter: 400/834, loss: 0.37376, top1: 0.51453, throughput: 1314.87 | 2022-04-10 23:22:07.092 [rank:1] [train], epoch: 9/50, iter: 400/834, loss: 0.37189, top1: 0.52130, throughput: 1314.97 | 2022-04-10 23:22:07.093 [rank:4] [train], epoch: 9/50, iter: 400/834, loss: 0.37206, top1: 0.51969, throughput: 1314.84 | 2022-04-10 23:22:07.092 [rank:2] [train], epoch: 9/50, iter: 400/834, loss: 0.37137, top1: 0.52016, throughput: 1315.01 | 2022-04-10 23:22:07.093 [rank:5] [train], epoch: 9/50, iter: 400/834, loss: 0.37511, top1: 0.51604, throughput: 1314.98 | 2022-04-10 23:22:07.092 [rank:3] [train], epoch: 9/50, iter: 400/834, loss: 0.37472, top1: 0.51422, throughput: 1314.97 | 2022-04-10 23:22:07.094 [rank:0] [train], epoch: 9/50, iter: 400/834, loss: 0.37204, top1: 0.51854, throughput: 1314.96 | 2022-04-10 23:22:07.093 [rank:7] [train], epoch: 9/50, iter: 400/834, loss: 0.37627, top1: 0.51370, throughput: 1314.97 | 2022-04-10 23:22:07.093 [rank:6] [train], epoch: 9/50, iter: 500/834, loss: 0.37133, top1: 0.52339, throughput: 1314.66 | 2022-04-10 23:22:21.696 [rank:5] [train], epoch: 9/50, iter: 500/834, loss: 0.37103, top1: 0.52188, throughput: 1314.67 | 2022-04-10 23:22:21.696 [rank:4] [train], epoch: 9/50, iter: 500/834, loss: 0.37119, top1: 0.52208, throughput: 1314.45[rank:0] [train], epoch: 9/50, iter: 500/834, loss: 0.36856, top1: 0.52797, throughput: 1314.75 | 2022-04-10 23:22:21.697 | 2022-04-10 23:22:21.699 [rank:2] [train], epoch: 9/50, iter: 500/834, loss: 0.37286, top1: 0.51729, throughput: 1314.67 | 2022-04-10 23:22:21.697 [rank:1] [train], epoch: 9/50, iter: 500/834, loss: 0.37227, top1: 0.51599, throughput: 1314.44 | 2022-04-10 23:22:21.700 [rank:3] [train], epoch: 9/50, iter: 500/834, loss: 0.37114, top1: 0.52130, throughput: 1314.51 | 2022-04-10 23:22:21.700 [rank:7] [train], epoch: 9/50, iter: 500/834, loss: 0.37201, top1: 0.51688, throughput: 1314.59 | 2022-04-10 23:22:21.698 [rank:2] [train], epoch: 9/50, iter: 600/834, loss: 0.37266, top1: 0.51849, throughput: 1314.43 | 2022-04-10 23:22:36.304 [rank:5] [train], epoch: 9/50, iter: 600/834, loss: 0.37406, top1: 0.51729, throughput: 1314.40 | 2022-04-10 23:22:36.304 [rank:1] [train], epoch: 9/50, iter: 600/834, loss: 0.37130, top1: 0.51651, throughput: 1314.56 | 2022-04-10 23:22:36.305 [rank:6] [train], epoch: 9/50, iter: 600/834, loss: 0.37250, top1: 0.51870, throughput: 1314.28 | 2022-04-10 23:22:36.305 [rank:4] [train], epoch: 9/50, iter: 600/834, loss: 0.37144, top1: 0.52089, throughput: 1314.42 | 2022-04-10 23:22:36.306 [rank:0] [train], epoch: 9/50, iter: 600/834, loss: 0.37029, top1: 0.52661, throughput: 1314.34 | 2022-04-10 23:22:36.305 [rank:3] [train], epoch: 9/50, iter: 600/834, loss: 0.36959, top1: 0.52432, throughput: 1314.32 | 2022-04-10 23:22:36.309 [rank:7] [train], epoch: 9/50, iter: 600/834, loss: 0.36915, top1: 0.52365, throughput: 1314.23 | 2022-04-10 23:22:36.308 [rank:5] [train], epoch: 9/50, iter: 700/834, loss: 0.37013, top1: 0.52260, throughput: 1311.49 | 2022-04-10 23:22:50.944 [rank:4] [train], epoch: 9/50, iter: 700/834, loss: 0.37414, top1: 0.51635, throughput: 1311.62 | 2022-04-10 23:22:50.944 [rank:0] [train], epoch: 9/50, iter: 700/834, loss: 0.37575, top1: 0.51115, throughput: 1311.41 | 2022-04-10 23:22:50.946 [rank:3] [train], epoch: 9/50, iter: 700/834, loss: 0.37062, top1: 0.52146, throughput: 1311.61 | 2022-04-10 23:22:50.947 [rank:1] [train], epoch: 9/50, iter: 700/834, loss: 0.37089, top1: 0.52755, throughput: 1311.30 | 2022-04-10 23:22:50.947 [rank:6] [train], epoch: 9/50, iter: 700/834, loss: 0.37331, top1: 0.51437, throughput: 1311.37 | 2022-04-10 23:22:50.946 [rank:2] [train], epoch: 9/50, iter: 700/834, loss: 0.37094, top1: 0.52031, throughput: 1311.39 | 2022-04-10 23:22:50.945 [rank:7] [train], epoch: 9/50, iter: 700/834, loss: 0.37344, top1: 0.52036, throughput: 1311.51 | 2022-04-10 23:22:50.947 [rank:2] [train], epoch: 9/50, iter: 800/834, loss: 0.37205, top1: 0.51792, throughput: 1314.12 | 2022-04-10 23:23:05.556 [rank:3] [train], epoch: 9/50, iter: 800/834, loss: 0.37331, top1: 0.51776, throughput: 1314.09 | 2022-04-10 23:23:05.558 [rank:5] [train], epoch: 9/50, iter: 800/834, loss: 0.36897, top1: 0.52745, throughput: 1314.01 | 2022-04-10 23:23:05.555 [rank:4] [train], epoch: 9/50, iter: 800/834, loss: 0.37337, top1: 0.51776, throughput: 1314.02 | 2022-04-10 23:23:05.556 [rank:6] [train], epoch: 9/50, iter: 800/834, loss: 0.37191, top1: 0.51464, throughput: 1314.25 | 2022-04-10 23:23:05.555 [rank:0] [train], epoch: 9/50, iter: 800/834, loss: 0.37177, top1: 0.52047, throughput: 1314.03 | 2022-04-10 23:23:05.557 [rank:1] [train], epoch: 9/50, iter: 800/834, loss: 0.37232, top1: 0.51786, throughput: 1314.08 | 2022-04-10 23:23:05.558 [rank:7] [train], epoch: 9/50, iter: 800/834, loss: 0.36992, top1: 0.52234, throughput: 1314.09 | 2022-04-10 23:23:05.558 [rank:5] [train], epoch: 9/50, iter: 834/834, loss: 0.37597, top1: 0.51195, throughput: 1313.58 | 2022-04-10 23:23:10.525 [rank:4] [train], epoch: 9/50, iter: 834/834, loss: 0.37445, top1: 0.51195, throughput: 1313.30 | 2022-04-10 23:23:10.526 [rank:6] [train], epoch: 9/50, iter: 834/834, loss: 0.37485, top1: 0.51501, throughput: 1313.10 | 2022-04-10 23:23:10.527 [rank:0] [train], epoch: 9/50, iter: 834/834, loss: 0.37399, top1: 0.51945, throughput: 1313.57 | 2022-04-10 23:23:10.527 [rank:7] [train], epoch: 9/50, iter: 834/834, loss: 0.37093, top1: 0.52834, throughput: 1313.90 | 2022-04-10 23:23:10.527 [rank:1] [train], epoch: 9/50, iter: 834/834, loss: 0.36993, top1: 0.52466, throughput: 1313.50 | 2022-04-10 23:23:10.528 [rank:2] [train], epoch: 9/50, iter: 834/834, loss: 0.37066, top1: 0.51455, throughput: 1312.83 | 2022-04-10 23:23:10.528 [rank:3] [train], epoch: 9/50, iter: 834/834, loss: 0.37011, top1: 0.51501, throughput: 1313.36 | 2022-04-10 23:23:10.529 [rank:0] [eval], epoch: 9/50, iter: 125/125, loss: 0.00000, top1: 0.52032, throughput: 574.42 | 2022-04-10 23:23:21.407 [rank:7] [eval], epoch: 9/50, iter: 125/125, loss: 0.00000, top1: 0.50496, throughput: 570.57 | 2022-04-10 23:23:21.481 [rank:1] [eval], epoch: 9/50, iter: 125/125, loss: 0.00000, top1: 0.51456, throughput: 568.64 | 2022-04-10 23:23:21.519 [rank:5] [eval], epoch: 9/50, iter: 125/125, loss: 0.00000, top1: 0.49664, throughput: 567.34 | 2022-04-10 23:23:21.541 [rank:2] [eval], epoch: 9/50, iter: 125/125, loss: 0.00000, top1: 0.51264, throughput: 564.76 | 2022-04-10 23:23:21.595 [rank:6] [eval], epoch: 9/50, iter: 125/125, loss: 0.00000, top1: 0.50720, throughput: 564.37 | 2022-04-10 23:23:21.601 [rank:3] [eval], epoch: 9/50, iter: 125/125, loss: 0.00000, top1: 0.49760, throughput: 564.12 | 2022-04-10 23:23:21.608 [rank:4] [eval], epoch: 9/50, iter: 125/125, loss: 0.00000, top1: 0.50048, throughput: 559.62 | 2022-04-10 23:23:21.695 [rank:4] [train], epoch: 10/50, iter: 100/834, loss: 0.36162, top1: 0.54198, throughput: 1320.93[rank:2] [train], epoch: 10/50, iter: 100/834, loss: 0.35905, top1: 0.54531, throughput: 1311.90 | 2022-04-10 23:23:36.230| 2022-04-10 23:23:36.230 [rank:3] [train], epoch: 10/50, iter: 100/834, loss: 0.36270, top1: 0.53620, throughput: 1312.95 | 2022-04-10 23:23:36.231 [rank:5] [train], epoch: 10/50, iter: 100/834, loss: 0.36472, top1: 0.53682, throughput: 1307.02 | 2022-04-10 23:23:36.231 [rank:7] [train], epoch: 10/50, iter: 100/834, loss: 0.36857, top1: 0.52224, throughput: 1301.71 | 2022-04-10 23:23:36.231 [rank:0] [train], epoch: 10/50, iter: 100/834, loss: 0.36404, top1: 0.53802, throughput: 1295.19 | 2022-04-10 23:23:36.231 [rank:6] [train], epoch: 10/50, iter: 100/834, loss: 0.36419, top1: 0.53500, throughput: 1312.31 | 2022-04-10 23:23:36.232 [rank:1] [train], epoch: 10/50, iter: 100/834, loss: 0.36757, top1: 0.53214, throughput: 1304.95 | 2022-04-10 23:23:36.232 [rank:6] [train], epoch: 10/50, iter: 200/834, loss: 0.36492, top1: 0.53203, throughput: 1320.05 | 2022-04-10 23:23:50.777 [rank:5] [train], epoch: 10/50, iter: 200/834, loss: 0.36593, top1: 0.53078, throughput: 1319.97 | 2022-04-10 23:23:50.777 [rank:1] [train], epoch: 10/50, iter: 200/834, loss: 0.36556, top1: 0.52818, throughput: 1319.99 | 2022-04-10 23:23:50.778 [rank:3] [train], epoch: 10/50, iter: 200/834, loss: 0.36919, top1: 0.52375, throughput: 1319.73 | 2022-04-10 23:23:50.780 [rank:2] [train], epoch: 10/50, iter: 200/834, loss: 0.36534, top1: 0.53286, throughput: 1319.64 | 2022-04-10 23:23:50.780 [rank:7] [train], epoch: 10/50, iter: 200/834, loss: 0.36686, top1: 0.52562, throughput: 1319.77 | 2022-04-10 23:23:50.779 [rank:4] [train], epoch: 10/50, iter: 200/834, loss: 0.36691, top1: 0.53099, throughput: 1319.68 | 2022-04-10 23:23:50.779 [rank:0] [train], epoch: 10/50, iter: 200/834, loss: 0.36421, top1: 0.53318, throughput: 1319.65 | 2022-04-10 23:23:50.781 [rank:3] [train], epoch: 10/50, iter: 300/834, loss: 0.36290, top1: 0.53245, throughput: 1306.45 | 2022-04-10 23:24:05.476 [rank:2] [train], epoch: 10/50, iter: 300/834, loss: 0.36560, top1: 0.52969, throughput: 1306.49 | 2022-04-10 23:24:05.476 [rank:5] [train], epoch: 10/50, iter: 300/834, loss: 0.36507, top1: 0.53250, throughput: 1306.28 | 2022-04-10 23:24:05.475 [rank:1] [train], epoch: 10/50, iter: 300/834, loss: 0.36699, top1: 0.52661, throughput: 1306.33 | 2022-04-10 23:24:05.476 [rank:6] [train], epoch: 10/50, iter: 300/834, loss: 0.36440, top1: 0.53365, throughput: 1306.17 | 2022-04-10 23:24:05.476 [rank:0] [train], epoch: 10/50, iter: 300/834, loss: 0.36704, top1: 0.52719, throughput: 1306.54 | 2022-04-10 23:24:05.476 [rank:4] [train], epoch: 10/50, iter: 300/834, loss: 0.36505, top1: 0.52964, throughput: 1306.41 | 2022-04-10 23:24:05.476 [rank:7] [train], epoch: 10/50, iter: 300/834, loss: 0.36564, top1: 0.53292, throughput: 1306.41 | 2022-04-10 23:24:05.475 [rank:5] [train], epoch: 10/50, iter: 400/834, loss: 0.36749, top1: 0.52755, throughput: 1315.67 | 2022-04-10 23:24:20.069 [rank:6] [train], epoch: 10/50, iter: 400/834, loss: 0.36547, top1: 0.53161, throughput: 1315.58 | 2022-04-10 23:24:20.070 [rank:1] [train], epoch: 10/50, iter: 400/834, loss: 0.36670, top1: 0.52745, throughput: 1315.60 | 2022-04-10 23:24:20.070 [rank:2] [train], epoch: 10/50, iter: 400/834, loss: 0.36592, top1: 0.52609, throughput: 1315.66 | 2022-04-10 23:24:20.069 [rank:4] [train], epoch: 10/50, iter: 400/834, loss: 0.36796, top1: 0.52385, throughput: 1315.67 | 2022-04-10 23:24:20.069 [rank:7] [train], epoch: 10/50, iter: 400/834, loss: 0.36564, top1: 0.53521, throughput: 1315.59 | 2022-04-10 23:24:20.070 [rank:3] [train], epoch: 10/50, iter: 400/834, loss: 0.36561, top1: 0.52771, throughput: 1315.36 | 2022-04-10 23:24:20.073 [rank:0] [train], epoch: 10/50, iter: 400/834, loss: 0.36924, top1: 0.52224, throughput: 1315.36 | 2022-04-10 23:24:20.073 [rank:5] [train], epoch: 10/50, iter: 500/834, loss: 0.36725, top1: 0.52562, throughput: 1296.80 | 2022-04-10 23:24:34.874 [rank:0] [train], epoch: 10/50, iter: 500/834, loss: 0.36608, top1: 0.52703, throughput: 1297.04 | 2022-04-10 23:24:34.876 [rank:6] [train], epoch: 10/50, iter: 500/834, loss: 0.36568, top1: 0.53010, throughput: 1296.81 | 2022-04-10 23:24:34.876 [rank:2] [train], epoch: 10/50, iter: 500/834, loss: 0.36821, top1: 0.52531, throughput: 1296.27 | 2022-04-10 23:24:34.881 [rank:3] [train], epoch: 10/50, iter: 500/834, loss: 0.36797, top1: 0.52812, throughput: 1296.93 | 2022-04-10 23:24:34.877 [rank:4] [train], epoch: 10/50, iter: 500/834, loss: 0.36753, top1: 0.52839, throughput: 1296.63 | 2022-04-10 23:24:34.877 [rank:1] [train], epoch: 10/50, iter: 500/834, loss: 0.36560, top1: 0.53042, throughput: 1296.37 | 2022-04-10 23:24:34.880 [rank:7] [train], epoch: 10/50, iter: 500/834, loss: 0.36926, top1: 0.52635, throughput: 1296.59 | 2022-04-10 23:24:34.878 [rank:4] [train], epoch: 10/50, iter: 600/834, loss: 0.36767, top1: 0.52406, throughput: 1312.27 | 2022-04-10 23:24:49.508 [rank:5] [train], epoch: 10/50, iter: 600/834, loss: 0.36785, top1: 0.52391, throughput: 1312.12 | 2022-04-10 23:24:49.507 [rank:1] [train], epoch: 10/50, iter: 600/834, loss: 0.36554, top1: 0.52760, throughput: 1312.48 | 2022-04-10 23:24:49.509 [rank:6] [train], epoch: 10/50, iter: 600/834, loss: 0.36810, top1: 0.52719, throughput: 1312.21 | 2022-04-10 23:24:49.508 [rank:3] [train], epoch: 10/50, iter: 600/834, loss: 0.36384, top1: 0.53661, throughput: 1311.92 | 2022-04-10 23:24:49.512 [rank:7] [train], epoch: 10/50, iter: 600/834, loss: 0.36648, top1: 0.53099, throughput: 1312.09 | 2022-04-10 23:24:49.511 [rank:0] [train], epoch: 10/50, iter: 600/834, loss: 0.36906, top1: 0.52656, throughput: 1311.95 | 2022-04-10 23:24:49.510 [rank:2] [train], epoch: 10/50, iter: 600/834, loss: 0.36858, top1: 0.52536, throughput: 1312.56 | 2022-04-10 23:24:49.508 [rank:2] [train], epoch: 10/50, iter: 700/834, loss: 0.36390, top1: 0.53490, throughput: 1312.90 | 2022-04-10 23:25:04.133 [rank:4] [train], epoch: 10/50, iter: 700/834, loss: 0.36809, top1: 0.52260, throughput: 1312.84 | 2022-04-10 23:25:04.132 [rank:5] [train], epoch: 10/50, iter: 700/834, loss: 0.36674, top1: 0.52729, throughput: 1312.81 | 2022-04-10 23:25:04.132 [rank:6] [train], epoch: 10/50, iter: 700/834, loss: 0.37156, top1: 0.51990, throughput: 1312.84 | 2022-04-10 23:25:04.132 [rank:3] [train], epoch: 10/50, iter: 700/834, loss: 0.36827, top1: 0.52755, throughput: 1313.08 | 2022-04-10 23:25:04.134 [rank:0] [train], epoch: 10/50, iter: 700/834, loss: 0.36708, top1: 0.52286, throughput: 1312.99 | 2022-04-10 23:25:04.134 [rank:1] [train], epoch: 10/50, iter: 700/834, loss: 0.36590, top1: 0.53281, throughput: 1312.67 | 2022-04-10 23:25:04.136 [rank:7] [train], epoch: 10/50, iter: 700/834, loss: 0.36883, top1: 0.52604, throughput: 1313.05 | 2022-04-10 23:25:04.133 [rank:3] [train], epoch: 10/50, iter: 800/834, loss: 0.36514, top1: 0.53255, throughput: 1301.00 | 2022-04-10 23:25:18.892 [rank:6] [train], epoch: 10/50, iter: 800/834, loss: 0.36420, top1: 0.53505, throughput: 1300.98 | 2022-04-10 23:25:18.891 [rank:0] [train], epoch: 10/50, iter: 800/834, loss: 0.36964, top1: 0.52339, throughput: 1301.06 | 2022-04-10 23:25:18.891 [rank:4] [train], epoch: 10/50, iter: 800/834, loss: 0.36588, top1: 0.53063, throughput: 1301.01 | 2022-04-10 23:25:18.890 [rank:5] [train], epoch: 10/50, iter: 800/834, loss: 0.36361, top1: 0.53214, throughput: 1300.96 [rank:1] [train], epoch: 10/50, iter: 800/834, loss: 0.36609, top1: 0.53297, throughput: 1301.23 | 2022-04-10 23:25:18.891 | 2022-04-10 23:25:18.891 [rank:7] [train], epoch: 10/50, iter: 800/834, loss: 0.36551, top1: 0.53333, throughput: 1301.07 | 2022-04-10 23:25:18.890 [rank:2] [train], epoch: 10/50, iter: 800/834, loss: 0.36754, top1: 0.52417, throughput: 1300.93 | 2022-04-10 23:25:18.891 [rank:4] [train], epoch: 10/50, iter: 834/834, loss: 0.36797, top1: 0.52313, throughput: 1312.78 | 2022-04-10 23:25:23.863 [rank:5] [train], epoch: 10/50, iter: 834/834, loss: 0.36561, top1: 0.53309, throughput: 1312.77 | 2022-04-10 23:25:23.863 [rank:1] [train], epoch: 10/50, iter: 834/834, loss: 0.36459, top1: 0.53385, throughput: 1312.66 | 2022-04-10 23:25:23.864 [rank:2] [train], epoch: 10/50, iter: 834/834, loss: 0.36706, top1: 0.53416, throughput: 1312.52 | 2022-04-10 23:25:23.865 [rank:6] [train], epoch: 10/50, iter: 834/834, loss: 0.36755, top1: 0.52696, throughput: 1312.25 | 2022-04-10 23:25:23.865 [rank:3] [train], epoch: 10/50, iter: 834/834, loss: 0.36605, top1: 0.52420, throughput: 1312.51 | 2022-04-10 23:25:23.866 [rank:0] [train], epoch: 10/50, iter: 834/834, loss: 0.36139, top1: 0.53431, throughput: 1312.02 | 2022-04-10 23:25:23.866 [rank:7] [train], epoch: 10/50, iter: 834/834, loss: 0.36160, top1: 0.53906, throughput: 1311.97 | 2022-04-10 23:25:23.866 [rank:0] [eval], epoch: 10/50, iter: 125/125, loss: 0.00000, top1: 0.53632, throughput: 579.68 | 2022-04-10 23:25:34.648 [rank:7] [eval], epoch: 10/50, iter: 125/125, loss: 0.00000, top1: 0.53712, throughput: 577.99 | 2022-04-10 23:25:34.679 [rank:2] [eval], epoch: 10/50, iter: 125/125, loss: 0.00000, top1: 0.52992, throughput: 572.38 | 2022-04-10 23:25:34.784 [rank:3] [eval], epoch: 10/50, iter: 125/125, loss: 0.00000, top1: 0.51808, throughput: 570.64 | 2022-04-10 23:25:34.818 [rank:6] [eval], epoch: 10/50, iter: 125/125, loss: 0.00000, top1: 0.52864, throughput: 570.41 | 2022-04-10 23:25:34.822 [rank:5] [eval], epoch: 10/50, iter: 125/125, loss: 0.00000, top1: 0.52576, throughput: 569.99 | 2022-04-10 23:25:34.829 [rank:4] [eval], epoch: 10/50, iter: 125/125, loss: 0.00000, top1: 0.52112, throughput: 567.09 | 2022-04-10 23:25:34.884 [rank:1] [eval], epoch: 10/50, iter: 125/125, loss: 0.00000, top1: 0.52880, throughput: 562.82 | 2022-04-10 23:25:34.969 [rank:6] [train], epoch: 11/50, iter: 100/834, loss: 0.35944, top1: 0.54490, throughput: 1298.50 | 2022-04-10 23:25:49.608 [rank:4] [train], epoch: 11/50, iter: 100/834, loss: 0.36044, top1: 0.53964, throughput: 1303.87 | 2022-04-10 23:25:49.609 [rank:3] [train], epoch: 11/50, iter: 100/834, loss: 0.36176, top1: 0.53635, throughput: 1297.92 | 2022-04-10 23:25:49.611 [rank:1] [train], epoch: 11/50, iter: 100/834, loss: 0.35892, top1: 0.53786, throughput: 1311.26 | 2022-04-10 23:25:49.611[rank:0] [train], epoch: 11/50, iter: 100/834, loss: 0.35876, top1: 0.54151, throughput: 1283.21 | 2022-04-10 23:25:49.611 [rank:5] [train], epoch: 11/50, iter: 100/834, loss: 0.35984, top1: 0.54255, throughput: 1298.79 | 2022-04-10 23:25:49.612 [rank:2] [train], epoch: 11/50, iter: 100/834, loss: 0.36193, top1: 0.53729, throughput: 1294.94 | 2022-04-10 23:25:49.611 [rank:7] [train], epoch: 11/50, iter: 100/834, loss: 0.35870, top1: 0.54297, throughput: 1285.77 | 2022-04-10 23:25:49.612 [rank:6] [train], epoch: 11/50, iter: 200/834, loss: 0.36242, top1: 0.53781, throughput: 1315.37 | 2022-04-10 23:26:04.205 [rank:4] [train], epoch: 11/50, iter: 200/834, loss: 0.36170, top1: 0.53385, throughput: 1315.48 | 2022-04-10 23:26:04.205 [rank:5] [train], epoch: 11/50, iter: 200/834, loss: 0.36167, top1: 0.53672, throughput: 1315.55 | 2022-04-10 23:26:04.206 [rank:3] [train], epoch: 11/50, iter: 200/834, loss: 0.36010, top1: 0.54000, throughput: 1315.27 | 2022-04-10 23:26:04.209 [rank:7] [train], epoch: 11/50, iter: 200/834, loss: 0.36320, top1: 0.53406, throughput: 1315.50 | 2022-04-10 23:26:04.207 [rank:0] [train], epoch: 11/50, iter: 200/834, loss: 0.36042, top1: 0.53833, throughput: 1315.34 | 2022-04-10 23:26:04.208 [rank:1] [train], epoch: 11/50, iter: 200/834, loss: 0.36211, top1: 0.53729, throughput: 1315.02 | 2022-04-10 23:26:04.212 [rank:2] [train], epoch: 11/50, iter: 200/834, loss: 0.35574, top1: 0.54849, throughput: 1315.00 | 2022-04-10 23:26:04.212 [rank:5] [train], epoch: 11/50, iter: 300/834, loss: 0.36092, top1: 0.53927, throughput: 1313.96 | 2022-04-10 23:26:18.819 [rank:4] [train], epoch: 11/50, iter: 300/834, loss: 0.36004, top1: 0.53927, throughput: 1313.65 | 2022-04-10 23:26:18.821 [rank:3] [train], epoch: 11/50, iter: 300/834, loss: 0.36210, top1: 0.53599, throughput: 1313.79 | 2022-04-10 23:26:18.823 [rank:2] [train], epoch: 11/50, iter: 300/834, loss: 0.36452, top1: 0.53125, throughput: 1314.26 | 2022-04-10 23:26:18.821 [rank:6] [train], epoch: 11/50, iter: 300/834, loss: 0.36337, top1: 0.53479, throughput: 1313.66 | 2022-04-10 23:26:18.821 [rank:0] [train], epoch: 11/50, iter: 300/834, loss: 0.36014, top1: 0.53969, throughput: 1313.79 | 2022-04-10 23:26:18.822 [rank:7] [train], epoch: 11/50, iter: 300/834, loss: 0.36457, top1: 0.53250, throughput: 1313.82 | 2022-04-10 23:26:18.821 [rank:1] [train], epoch: 11/50, iter: 300/834, loss: 0.36379, top1: 0.53812, throughput: 1314.06 | 2022-04-10 23:26:18.823 [rank:4] [train], epoch: 11/50, iter: 400/834, loss: 0.35897, top1: 0.53766, throughput: 1316.11 | 2022-04-10 23:26:33.409 [rank:6] [train], epoch: 11/50, iter: 400/834, loss: 0.36255, top1: 0.53781, throughput: 1316.00 | 2022-04-10 23:26:33.411 [rank:1] [train], epoch: 11/50, iter: 400/834, loss: 0.36316, top1: 0.53526, throughput: 1316.23 | 2022-04-10 23:26:33.410 [rank:5] [train], epoch: 11/50, iter: 400/834, loss: 0.35921, top1: 0.54151, throughput: 1315.72 | 2022-04-10 23:26:33.411 [rank:0] [train], epoch: 11/50, iter: 400/834, loss: 0.36253, top1: 0.53312, throughput: 1316.08 | 2022-04-10 23:26:33.411 [rank:2] [train], epoch: 11/50, iter: 400/834, loss: 0.36440, top1: 0.53474, throughput: 1315.94 | 2022-04-10 23:26:33.411 [rank:7] [train], epoch: 11/50, iter: 400/834, loss: 0.36425, top1: 0.53354, throughput: 1316.04 | 2022-04-10 23:26:33.410 [rank:3] [train], epoch: 11/50, iter: 400/834, loss: 0.36587, top1: 0.53026, throughput: 1316.12 | 2022-04-10 23:26:33.411 [rank:4] [train], epoch: 11/50, iter: 500/834, loss: 0.36385, top1: 0.53255, throughput: 1304.62 | 2022-04-10 23:26:48.126 [rank:2] [train], epoch: 11/50, iter: 500/834, loss: 0.36305, top1: 0.53151, throughput: 1304.84 | 2022-04-10 23:26:48.126 [rank:1] [train], epoch: 11/50, iter: 500/834, loss: 0.36404, top1: 0.53484, throughput: 1304.74 | 2022-04-10 23:26:48.126 [rank:5] [train], epoch: 11/50, iter: 500/834, loss: 0.36137, top1: 0.54016, throughput: 1304.87 | 2022-04-10 23:26:48.125 [rank:0] [train], epoch: 11/50, iter: 500/834, loss: 0.36139, top1: 0.53818, throughput: 1304.76 | 2022-04-10 23:26:48.126 [rank:6] [train], epoch: 11/50, iter: 500/834, loss: 0.36063, top1: 0.53667, throughput: 1304.72 | 2022-04-10 23:26:48.126 [rank:3] [train], epoch: 11/50, iter: 500/834, loss: 0.36081, top1: 0.53958, throughput: 1304.57[rank:7] [train], epoch: 11/50, iter: 500/834, loss: 0.36081, top1: 0.53859, throughput: 1304.60 | 2022-04-10 23:26:48.128 | 2022-04-10 23:26:48.129 [rank:6] [train], epoch: 11/50, iter: 600/834, loss: 0.35959, top1: 0.53870, throughput: 1314.51 | 2022-04-10 23:27:02.732 [rank:2] [train], epoch: 11/50, iter: 600/834, loss: 0.35889, top1: 0.54396, throughput: 1314.57 | 2022-04-10 23:27:02.731 [rank:5] [train], epoch: 11/50, iter: 600/834, loss: 0.36329, top1: 0.53589, throughput: 1314.53 | 2022-04-10 23:27:02.731 [rank:4] [train], epoch: 11/50, iter: 600/834, loss: 0.35843, top1: 0.54734, throughput: 1314.60 | 2022-04-10 23:27:02.731 [rank:7] [train], epoch: 11/50, iter: 600/834, loss: 0.35896, top1: 0.53745, throughput: 1314.64 | 2022-04-10 23:27:02.732 [rank:1] [train], epoch: 11/50, iter: 600/834, loss: 0.36305, top1: 0.53354, throughput: 1314.30 | 2022-04-10 23:27:02.734 [rank:0] [train], epoch: 11/50, iter: 600/834, loss: 0.36309, top1: 0.53578, throughput: 1314.32 | 2022-04-10 23:27:02.734 [rank:3] [train], epoch: 11/50, iter: 600/834, loss: 0.36159, top1: 0.53651, throughput: 1314.57 | 2022-04-10 23:27:02.734 [rank:2] [train], epoch: 11/50, iter: 700/834, loss: 0.36449, top1: 0.53109, throughput: 1316.44 | 2022-04-10 23:27:17.316 [rank:6] [train], epoch: 11/50, iter: 700/834, loss: 0.35776, top1: 0.54005, throughput: 1316.62 | 2022-04-10 23:27:17.315 [rank:5] [train], epoch: 11/50, iter: 700/834, loss: 0.36121, top1: 0.53609, throughput: 1316.52 | 2022-04-10 23:27:17.315 [rank:1] [train], epoch: 11/50, iter: 700/834, loss: 0.36215, top1: 0.53521, throughput: 1316.75 | 2022-04-10 23:27:17.316 [rank:4] [train], epoch: 11/50, iter: 700/834, loss: 0.36230, top1: 0.53729, throughput: 1316.49[rank:3] [train], epoch: 11/50, iter: 700/834, loss: 0.35846, top1: 0.54448, throughput: 1316.60 | 2022-04-10 23:27:17.316 | 2022-04-10 23:27:17.317 [rank:0] [train], epoch: 11/50, iter: 700/834, loss: 0.36153, top1: 0.53792, throughput: 1316.75 | 2022-04-10 23:27:17.316 [rank:7] [train], epoch: 11/50, iter: 700/834, loss: 0.36189, top1: 0.53927, throughput: 1316.37 | 2022-04-10 23:27:17.318 [rank:3] [train], epoch: 11/50, iter: 800/834, loss: 0.36107, top1: 0.53859, throughput: 1315.97 | 2022-04-10 23:27:31.907 [rank:5] [train], epoch: 11/50, iter: 800/834, loss: 0.36246, top1: 0.53682, throughput: 1316.07 | 2022-04-10 23:27:31.904 [rank:6] [train], epoch: 11/50, iter: 800/834, loss: 0.36239, top1: 0.53375, throughput: 1315.96 | 2022-04-10 23:27:31.905 [rank:7] [train], epoch: 11/50, iter: 800/834, loss: 0.36273, top1: 0.53568, throughput: 1316.05 | 2022-04-10 23:27:31.907 [rank:2] [train], epoch: 11/50, iter: 800/834, loss: 0.36607, top1: 0.52958, throughput: 1315.89 | 2022-04-10 23:27:31.907 [rank:4] [train], epoch: 11/50, iter: 800/834, loss: 0.36179, top1: 0.53797, throughput: 1315.83 | 2022-04-10 23:27:31.907 [rank:1] [train], epoch: 11/50, iter: 800/834, loss: 0.36104, top1: 0.54292, throughput: 1315.82 | 2022-04-10 23:27:31.907 [rank:0] [train], epoch: 11/50, iter: 800/834, loss: 0.36276, top1: 0.53437, throughput: 1315.87 | 2022-04-10 23:27:31.907 [rank:5] [train], epoch: 11/50, iter: 834/834, loss: 0.36262, top1: 0.53248, throughput: 1312.34 | 2022-04-10 23:27:36.879 [rank:4] [train], epoch: 11/50, iter: 834/834, loss: 0.36135, top1: 0.53370, throughput: 1312.80 | 2022-04-10 23:27:36.880 [rank:6] [train], epoch: 11/50, iter: 834/834, loss: 0.35985, top1: 0.54396, throughput: 1312.57 | 2022-04-10 23:27:36.879 [rank:2] [train], epoch: 11/50, iter: 834/834, loss: 0.35779, top1: 0.54458, throughput: 1312.71 | 2022-04-10 23:27:36.880 [rank:1] [train], epoch: 11/50, iter: 834/834, loss: 0.35687, top1: 0.55116, throughput: 1312.42 | 2022-04-10 23:27:36.881 [rank:0] [train], epoch: 11/50, iter: 834/834, loss: 0.35905, top1: 0.53646, throughput: 1312.18 | 2022-04-10 23:27:36.882 [rank:3] [train], epoch: 11/50, iter: 834/834, loss: 0.35683, top1: 0.54596, throughput: 1312.32 | 2022-04-10 23:27:36.882 [rank:7] [train], epoch: 11/50, iter: 834/834, loss: 0.35633, top1: 0.53998, throughput: 1311.89 | 2022-04-10 23:27:36.883 [rank:0] [eval], epoch: 11/50, iter: 125/125, loss: 0.00000, top1: 0.53600, throughput: 572.62 | 2022-04-10 23:27:47.796 [rank:7] [eval], epoch: 11/50, iter: 125/125, loss: 0.00000, top1: 0.52192, throughput: 571.70 | 2022-04-10 23:27:47.815 [rank:2] [eval], epoch: 11/50, iter: 125/125, loss: 0.00000, top1: 0.51248, throughput: 567.30 | 2022-04-10 23:27:47.897 [rank:3] [eval], epoch: 11/50, iter: 125/125, loss: 0.00000, top1: 0.51392, throughput: 567.38 | 2022-04-10 23:27:47.897 [rank:6] [eval], epoch: 11/50, iter: 125/125, loss: 0.00000, top1: 0.52768, throughput: 565.31 | 2022-04-10 23:27:47.935 [rank:4] [eval], epoch: 11/50, iter: 125/125, loss: 0.00000, top1: 0.52320, throughput: 564.88 | 2022-04-10 23:27:47.944 [rank:1] [eval], epoch: 11/50, iter: 125/125, loss: 0.00000, top1: 0.53040, throughput: 564.36 | 2022-04-10 23:27:47.956 [rank:5] [eval], epoch: 11/50, iter: 125/125, loss: 0.00000, top1: 0.51616, throughput: 558.88 | 2022-04-10 23:27:48.062 [rank:5] [train], epoch: 12/50, iter: 100/834, loss: 0.35169, top1: 0.55859, throughput: 1320.59 | 2022-04-10 23:28:02.601 [rank:6] [train], epoch: 12/50, iter: 100/834, loss: 0.35697, top1: 0.54349, throughput: 1309.07 | 2022-04-10 23:28:02.602 [rank:2] [train], epoch: 12/50, iter: 100/834, loss: 0.35445, top1: 0.54844, throughput: 1305.70 | 2022-04-10 23:28:02.602 [rank:7] [train], epoch: 12/50, iter: 100/834, loss: 0.35612, top1: 0.54844, throughput: 1298.43 | 2022-04-10 23:28:02.602 [rank:3] [train], epoch: 12/50, iter: 100/834, loss: 0.35685, top1: 0.54354, throughput: 1305.60 | 2022-04-10 23:28:02.603 [rank:1] [train], epoch: 12/50, iter: 100/834, loss: 0.35378, top1: 0.55365, throughput: 1310.72 | 2022-04-10 23:28:02.604 [rank:4] [train], epoch: 12/50, iter: 100/834, loss: 0.35751, top1: 0.54427, throughput: 1309.74 | 2022-04-10 23:28:02.603 [rank:0] [train], epoch: 12/50, iter: 100/834, loss: 0.35735, top1: 0.54984, throughput: 1296.58 | 2022-04-10 23:28:02.605 [rank:4] [train], epoch: 12/50, iter: 200/834, loss: 0.35432, top1: 0.55281, throughput: 1314.97 | 2022-04-10 23:28:17.204 [rank:6] [train], epoch: 12/50, iter: 200/834, loss: 0.35781, top1: 0.54255, throughput: 1314.68 | 2022-04-10 23:28:17.206 [rank:7] [train], epoch: 12/50, iter: 200/834, loss: 0.35706, top1: 0.54797, throughput: 1314.84 | 2022-04-10 23:28:17.205 [rank:0] [train], epoch: 12/50, iter: 200/834, loss: 0.35170, top1: 0.55714, throughput: 1315.01 | 2022-04-10 23:28:17.205 [rank:3] [train], epoch: 12/50, iter: 200/834, loss: 0.35944, top1: 0.53828, throughput: 1314.74 | 2022-04-10 23:28:17.207 [rank:1] [train], epoch: 12/50, iter: 200/834, loss: 0.35843, top1: 0.54229, throughput: 1314.96 | 2022-04-10 23:28:17.206 [rank:2] [train], epoch: 12/50, iter: 200/834, loss: 0.35535, top1: 0.55031, throughput: 1314.57 | 2022-04-10 23:28:17.207 [rank:5] [train], epoch: 12/50, iter: 200/834, loss: 0.35794, top1: 0.54016, throughput: 1314.60 | 2022-04-10 23:28:17.206 [rank:5] [train], epoch: 12/50, iter: 300/834, loss: 0.35566, top1: 0.55073, throughput: 1314.56 | 2022-04-10 23:28:31.811 [rank:2] [train], epoch: 12/50, iter: 300/834, loss: 0.35834, top1: 0.54318, throughput: 1314.61 | 2022-04-10 23:28:31.812 [rank:6] [train], epoch: 12/50, iter: 300/834, loss: 0.35731, top1: 0.54781, throughput: 1314.45 | 2022-04-10 23:28:31.813 [rank:0] [train], epoch: 12/50, iter: 300/834, loss: 0.35389, top1: 0.55135, throughput: 1314.37 | 2022-04-10 23:28:31.813 [rank:3] [train], epoch: 12/50, iter: 300/834, loss: 0.35863, top1: 0.54214, throughput: 1314.37 | 2022-04-10 23:28:31.815 [rank:7] [train], epoch: 12/50, iter: 300/834, loss: 0.35679, top1: 0.54495, throughput: 1314.36 | 2022-04-10 23:28:31.813 [rank:4] [train], epoch: 12/50, iter: 300/834, loss: 0.35760, top1: 0.54750, throughput: 1314.24 | 2022-04-10 23:28:31.814 [rank:1] [train], epoch: 12/50, iter: 300/834, loss: 0.36009, top1: 0.54646, throughput: 1314.26 | 2022-04-10 23:28:31.815 [rank:6] [train], epoch: 12/50, iter: 400/834, loss: 0.35772, top1: 0.54380, throughput: 1314.85 | 2022-04-10 23:28:46.415 [rank:2] [train], epoch: 12/50, iter: 400/834, loss: 0.35841, top1: 0.53958, throughput: 1314.76 | 2022-04-10 23:28:46.416 [rank:3] [train], epoch: 12/50, iter: 400/834, loss: 0.35345, top1: 0.54969, throughput: 1314.88 | 2022-04-10 23:28:46.417 [rank:5] [train], epoch: 12/50, iter: 400/834, loss: 0.35296, top1: 0.55182, throughput: 1314.72 | 2022-04-10 23:28:46.415 [rank:1] [train], epoch: 12/50, iter: 400/834, loss: 0.35543, top1: 0.54547, throughput: 1314.76 | 2022-04-10 23:28:46.418 [rank:4] [train], epoch: 12/50, iter: 400/834, loss: 0.35604, top1: 0.54682, throughput: 1314.74 | 2022-04-10 23:28:46.417 [rank:0] [train], epoch: 12/50, iter: 400/834, loss: 0.35563, top1: 0.54813, throughput: 1314.73 | 2022-04-10 23:28:46.417 [rank:7] [train], epoch: 12/50, iter: 400/834, loss: 0.35682, top1: 0.54359, throughput: 1314.51 | 2022-04-10 23:28:46.419 [rank:4] [train], epoch: 12/50, iter: 500/834, loss: 0.35509, top1: 0.54844, throughput: 1311.40 | 2022-04-10 23:29:01.058 [rank:2] [train], epoch: 12/50, iter: 500/834, loss: 0.35585, top1: 0.54974, throughput: 1311.36 | 2022-04-10 23:29:01.057 [rank:6] [train], epoch: 12/50, iter: 500/834, loss: 0.35701, top1: 0.54724, throughput: 1311.32 | 2022-04-10 23:29:01.057 [rank:5] [train], epoch: 12/50, iter: 500/834, loss: 0.35584, top1: 0.54849, throughput: 1311.16 | 2022-04-10 23:29:01.059 [rank:7] [train], epoch: 12/50, iter: 500/834, loss: 0.35714, top1: 0.54755, throughput: 1311.55 | 2022-04-10 23:29:01.058 [rank:1] [train], epoch: 12/50, iter: 500/834, loss: 0.35901, top1: 0.54146, throughput: 1311.29 | 2022-04-10 23:29:01.060 [rank:0] [train], epoch: 12/50, iter: 500/834, loss: 0.35893, top1: 0.54745, throughput: 1311.33 | 2022-04-10 23:29:01.058 [rank:3] [train], epoch: 12/50, iter: 500/834, loss: 0.35805, top1: 0.54078, throughput: 1311.25 | 2022-04-10 23:29:01.059 [rank:5] [train], epoch: 12/50, iter: 600/834, loss: 0.35825, top1: 0.54151, throughput: 1314.32 | 2022-04-10 23:29:15.667 [rank:6] [train], epoch: 12/50, iter: 600/834, loss: 0.35630, top1: 0.54510, throughput: 1314.20 | 2022-04-10 23:29:15.667 [rank:2] [train], epoch: 12/50, iter: 600/834, loss: 0.35665, top1: 0.54745, throughput: 1314.21 | 2022-04-10 23:29:15.667 [rank:1] [train], epoch: 12/50, iter: 600/834, loss: 0.36089, top1: 0.53911, throughput: 1314.44 | 2022-04-10 23:29:15.667 [rank:4] [train], epoch: 12/50, iter: 600/834, loss: 0.35897, top1: 0.53870, throughput: 1314.29 | 2022-04-10 23:29:15.667 [rank:0] [train], epoch: 12/50, iter: 600/834, loss: 0.35699, top1: 0.54682, throughput: 1314.15 | 2022-04-10 23:29:15.669 [rank:7] [train], epoch: 12/50, iter: 600/834, loss: 0.35635, top1: 0.54583, throughput: 1314.07 | 2022-04-10 23:29:15.669 [rank:3] [train], epoch: 12/50, iter: 600/834, loss: 0.35857, top1: 0.54276, throughput: 1314.05 | 2022-04-10 23:29:15.671 [rank:4] [train], epoch: 12/50, iter: 700/834, loss: 0.35897, top1: 0.53812, throughput: 1310.01 | 2022-04-10 23:29:30.323 [rank:2] [train], epoch: 12/50, iter: 700/834, loss: 0.35882, top1: 0.54589, throughput: 1309.93 | 2022-04-10 23:29:30.324 [rank:5] [train], epoch: 12/50, iter: 700/834, loss: 0.35708, top1: 0.54505, throughput: 1309.82 | 2022-04-10 23:29:30.326 [rank:6] [train], epoch: 12/50, iter: 700/834, loss: 0.35853, top1: 0.53724, throughput: 1309.93 | 2022-04-10 23:29:30.324 [rank:3] [train], epoch: 12/50, iter: 700/834, loss: 0.35693, top1: 0.54365, throughput: 1309.87 | 2022-04-10 23:29:30.329 [rank:7] [train], epoch: 12/50, iter: 700/834, loss: 0.35590, top1: 0.54979, throughput: 1310.03 | 2022-04-10 23:29:30.325 [rank:1] [train], epoch: 12/50, iter: 700/834, loss: 0.35526, top1: 0.55172, throughput: 1309.64 | 2022-04-10 23:29:30.327 [rank:0] [train], epoch: 12/50, iter: 700/834, loss: 0.35582, top1: 0.54188, throughput: 1309.79 | 2022-04-10 23:29:30.327 [rank:2] [train], epoch: 12/50, iter: 800/834, loss: 0.35767, top1: 0.54302, throughput: 1315.77 | 2022-04-10 23:29:44.916 [rank:6] [train], epoch: 12/50, iter: 800/834, loss: 0.35776, top1: 0.54464, throughput: 1315.72 | 2022-04-10 23:29:44.917 [rank:3] [train], epoch: 12/50, iter: 800/834, loss: 0.36224, top1: 0.53344, throughput: 1316.10 | 2022-04-10 23:29:44.917 [rank:4] [train], epoch: 12/50, iter: 800/834, loss: 0.35450, top1: 0.54620, throughput: 1315.76 | 2022-04-10 23:29:44.916 [rank:5] [train], epoch: 12/50, iter: 800/834, loss: 0.35638, top1: 0.54385, throughput: 1315.87 | 2022-04-10 23:29:44.917 [rank:0] [train], epoch: 12/50, iter: 800/834, loss: 0.35797, top1: 0.54260, throughput: 1315.96 | 2022-04-10 23:29:44.918 [rank:7] [train], epoch: 12/50, iter: 800/834, loss: 0.35840, top1: 0.53917, throughput: 1315.77 | 2022-04-10 23:29:44.918 [rank:1] [train], epoch: 12/50, iter: 800/834, loss: 0.35607, top1: 0.55276, throughput: 1315.81 | 2022-04-10 23:29:44.919 [rank:5] [train], epoch: 12/50, iter: 834/834, loss: 0.35568, top1: 0.55362, throughput: 1314.90 | 2022-04-10 23:29:49.881 [rank:6] [train], epoch: 12/50, iter: 834/834, loss: 0.35833, top1: 0.54642, throughput: 1314.97 | 2022-04-10 23:29:49.881 [rank:4] [train], epoch: 12/50, iter: 834/834, loss: 0.35696, top1: 0.54979, throughput: 1314.32 | 2022-04-10 23:29:49.882 [rank:2] [train], epoch: 12/50, iter: 834/834, loss: 0.36451, top1: 0.53048, throughput: 1314.51 | 2022-04-10 23:29:49.882 [rank:1] [train], epoch: 12/50, iter: 834/834, loss: 0.35533, top1: 0.54534, throughput: 1315.04 | 2022-04-10 23:29:49.883 [rank:0] [train], epoch: 12/50, iter: 834/834, loss: 0.36086, top1: 0.53569, throughput: 1314.63 | 2022-04-10 23:29:49.883 [rank:3] [train], epoch: 12/50, iter: 834/834, loss: 0.35426, top1: 0.55239, throughput: 1314.35 | 2022-04-10 23:29:49.884 [rank:7] [train], epoch: 12/50, iter: 834/834, loss: 0.35420, top1: 0.55515, throughput: 1314.48 | 2022-04-10 23:29:49.884 [rank:7] [eval], epoch: 12/50, iter: 125/125, loss: 0.00000, top1: 0.55424, throughput: 565.34 | 2022-04-10 23:30:00.939 [rank:0] [eval], epoch: 12/50, iter: 125/125, loss: 0.00000, top1: 0.54464, throughput: 564.98 | 2022-04-10 23:30:00.945 [rank:1] [eval], epoch: 12/50, iter: 125/125, loss: 0.00000, top1: 0.54688, throughput: 560.32 | 2022-04-10 23:30:01.038 [rank:3] [eval], epoch: 12/50, iter: 125/125, loss: 0.00000, top1: 0.53808, throughput: 556.84 | 2022-04-10 23:30:01.108 [rank:5] [eval], epoch: 12/50, iter: 125/125, loss: 0.00000, top1: 0.53648, throughput: 555.53 | 2022-04-10 23:30:01.132 [rank:6] [eval], epoch: 12/50, iter: 125/125, loss: 0.00000, top1: 0.54896, throughput: 554.52 | 2022-04-10 23:30:01.152 [rank:2] [eval], epoch: 12/50, iter: 125/125, loss: 0.00000, top1: 0.54160, throughput: 554.45 | 2022-04-10 23:30:01.155 [rank:4] [eval], epoch: 12/50, iter: 125/125, loss: 0.00000, top1: 0.53776, throughput: 548.06 | 2022-04-10 23:30:01.286 [rank:4] [train], epoch: 13/50, iter: 100/834, loss: 0.34843, top1: 0.55693, throughput: 1311.72 | 2022-04-10 23:30:15.924 [rank:2] [train], epoch: 13/50, iter: 100/834, loss: 0.35293, top1: 0.55510, throughput: 1299.93 | 2022-04-10 23:30:15.925 [rank:6] [train], epoch: 13/50, iter: 100/834, loss: 0.35222, top1: 0.55130, throughput: 1299.64 | 2022-04-10 23:30:15.926 [rank:5] [train], epoch: 13/50, iter: 100/834, loss: 0.35324, top1: 0.55615, throughput: 1297.86 | 2022-04-10 23:30:15.925 [rank:7] [train], epoch: 13/50, iter: 100/834, loss: 0.34569, top1: 0.56307, throughput: 1281.24 | 2022-04-10 23:30:15.925 [rank:0] [train], epoch: 13/50, iter: 100/834, loss: 0.35197, top1: 0.55464, throughput: 1281.71 | 2022-04-10 23:30:15.925 [rank:3] [train], epoch: 13/50, iter: 100/834, loss: 0.34893, top1: 0.56094, throughput: 1295.36 | 2022-04-10 23:30:15.930 [rank:1] [train], epoch: 13/50, iter: 100/834, loss: 0.34845, top1: 0.55807, throughput: 1289.28 | 2022-04-10 23:30:15.930 [rank:5] [train], epoch: 13/50, iter: 200/834, loss: 0.35221, top1: 0.55359, throughput: 1313.38 | 2022-04-10 23:30:30.544 [rank:6] [train], epoch: 13/50, iter: 200/834, loss: 0.35109, top1: 0.55750, throughput: 1313.20 | 2022-04-10 23:30:30.546 [rank:4] [train], epoch: 13/50, iter: 200/834, loss: 0.35080, top1: 0.55271, throughput: 1313.16 | 2022-04-10 23:30:30.545 [rank:3] [train], epoch: 13/50, iter: 200/834, loss: 0.35231, top1: 0.55255, throughput: 1313.64 | 2022-04-10 23:30:30.546 [rank:1] [train], epoch: 13/50, iter: 200/834, loss: 0.35107, top1: 0.56031, throughput: 1313.51 | 2022-04-10 23:30:30.547 [rank:0] [train], epoch: 13/50, iter: 200/834, loss: 0.35451, top1: 0.54943, throughput: 1313.26 | 2022-04-10 23:30:30.546 [rank:2] [train], epoch: 13/50, iter: 200/834, loss: 0.35395, top1: 0.55214, throughput: 1313.03 | 2022-04-10 23:30:30.548 [rank:7] [train], epoch: 13/50, iter: 200/834, loss: 0.34998, top1: 0.55943, throughput: 1313.02 | 2022-04-10 23:30:30.547 [rank:5] [train], epoch: 13/50, iter: 300/834, loss: 0.35412, top1: 0.54745, throughput: 1315.46 | 2022-04-10 23:30:45.140 [rank:4] [train], epoch: 13/50, iter: 300/834, loss: 0.35350, top1: 0.55146, throughput: 1315.48 | 2022-04-10 23:30:45.140 [rank:2] [train], epoch: 13/50, iter: 300/834, loss: 0.35252, top1: 0.55391, throughput: 1315.71 | 2022-04-10 23:30:45.141 [rank:1] [train], epoch: 13/50, iter: 300/834, loss: 0.35522, top1: 0.54781, throughput: 1315.47 | 2022-04-10 23:30:45.142 [rank:3] [train], epoch: 13/50, iter: 300/834, loss: 0.35578, top1: 0.54635, throughput: 1315.38 | 2022-04-10 23:30:45.143 [rank:6] [train], epoch: 13/50, iter: 300/834, loss: 0.35424, top1: 0.55115, throughput: 1315.43[rank:0] [train], epoch: 13/50, iter: 300/834, loss: 0.35712, top1: 0.54172, throughput: 1315.36 | 2022-04-10 23:30:45.142 | 2022-04-10 23:30:45.142 [rank:7] [train], epoch: 13/50, iter: 300/834, loss: 0.35229, top1: 0.55427, throughput: 1315.42 | 2022-04-10 23:30:45.143 [rank:4] [train], epoch: 13/50, iter: 400/834, loss: 0.35495, top1: 0.54661, throughput: 1314.20 | 2022-04-10 23:30:59.750 [rank:7] [train], epoch: 13/50, iter: 400/834, loss: 0.35295, top1: 0.55760, throughput: 1314.48 | 2022-04-10 23:30:59.750 [rank:6] [train], epoch: 13/50, iter: 400/834, loss: 0.35662, top1: 0.54672, throughput: 1314.06 | 2022-04-10 23:30:59.754 [rank:1] [train], epoch: 13/50, iter: 400/834, loss: 0.35393, top1: 0.55047, throughput: 1314.18 | 2022-04-10 23:30:59.752 [rank:2] [train], epoch: 13/50, iter: 400/834, loss: 0.35206, top1: 0.55089, throughput: 1314.05 | 2022-04-10 23:30:59.752 [rank:3] [train], epoch: 13/50, iter: 400/834, loss: 0.35324, top1: 0.55240, throughput: 1314.06 | 2022-04-10 23:30:59.754 [rank:5] [train], epoch: 13/50, iter: 400/834, loss: 0.35319, top1: 0.55234, throughput: 1313.86 | 2022-04-10 23:30:59.753 [rank:0] [train], epoch: 13/50, iter: 400/834, loss: 0.35312, top1: 0.55573, throughput: 1314.19 | 2022-04-10 23:30:59.752 [rank:2] [train], epoch: 13/50, iter: 500/834, loss: 0.35206, top1: 0.55156, throughput: 1315.66 | 2022-04-10 23:31:14.345 [rank:5] [train], epoch: 13/50, iter: 500/834, loss: 0.35521, top1: 0.54839, throughput: 1315.83 | 2022-04-10 23:31:14.345 [rank:6] [train], epoch: 13/50, iter: 500/834, loss: 0.35289, top1: 0.55547, throughput: 1315.96 | 2022-04-10 23:31:14.344 [rank:4] [train], epoch: 13/50, iter: 500/834, loss: 0.35259, top1: 0.54875, throughput: 1315.60 | 2022-04-10 23:31:14.344 [rank:1] [train], epoch: 13/50, iter: 500/834, loss: 0.35262, top1: 0.54979, throughput: 1315.72 | 2022-04-10 23:31:14.345 [rank:3] [train], epoch: 13/50, iter: 500/834, loss: 0.35130, top1: 0.55510, throughput: 1315.83 | 2022-04-10 23:31:14.345 [rank:7] [train], epoch: 13/50, iter: 500/834, loss: 0.35401, top1: 0.54891, throughput: 1315.52 | 2022-04-10 23:31:14.345 [rank:0] [train], epoch: 13/50, iter: 500/834, loss: 0.35622, top1: 0.54583, throughput: 1315.51 | 2022-04-10 23:31:14.347 [rank:4] [train], epoch: 13/50, iter: 600/834, loss: 0.35192, top1: 0.55047, throughput: 1313.60 | 2022-04-10 23:31:28.960 [rank:6] [train], epoch: 13/50, iter: 600/834, loss: 0.35490, top1: 0.54521, throughput: 1313.69 | 2022-04-10 23:31:28.959 [rank:2] [train], epoch: 13/50, iter: 600/834, loss: 0.34941, top1: 0.56089, throughput: 1313.78 | 2022-04-10 23:31:28.960 [rank:5] [train], epoch: 13/50, iter: 600/834, loss: 0.35505, top1: 0.54953, throughput: 1313.76 | 2022-04-10 23:31:28.959 [rank:3] [train], epoch: 13/50, iter: 600/834, loss: 0.35265, top1: 0.55625, throughput: 1313.65 | 2022-04-10 23:31:28.961 [rank:1] [train], epoch: 13/50, iter: 600/834, loss: 0.35274, top1: 0.55083, throughput: 1313.66 | 2022-04-10 23:31:28.961 [rank:0] [train], epoch: 13/50, iter: 600/834, loss: 0.35178, top1: 0.55297, throughput: 1313.86 | 2022-04-10 23:31:28.961 [rank:7] [train], epoch: 13/50, iter: 600/834, loss: 0.35423, top1: 0.54984, throughput: 1313.67 | 2022-04-10 23:31:28.961 [rank:2] [train], epoch: 13/50, iter: 700/834, loss: 0.35419, top1: 0.55229, throughput: 1306.05 | 2022-04-10 23:31:43.660 [rank:3] [train], epoch: 13/50, iter: 700/834, loss: 0.35472, top1: 0.55042, throughput: 1305.96 | 2022-04-10 23:31:43.663 [rank:7] [train], epoch: 13/50, iter: 700/834, loss: 0.35490, top1: 0.54964, throughput: 1306.01 | 2022-04-10 23:31:43.662 [rank:5] [train], epoch: 13/50, iter: 700/834, loss: 0.35611, top1: 0.54599, throughput: 1305.94 | 2022-04-10 23:31:43.661 [rank:4] [train], epoch: 13/50, iter: 700/834, loss: 0.35373, top1: 0.54885, throughput: 1305.99 | 2022-04-10 23:31:43.662 [rank:6] [train], epoch: 13/50, iter: 700/834, loss: 0.35388, top1: 0.55531, throughput: 1305.85 | 2022-04-10 23:31:43.662 [rank:1] [train], epoch: 13/50, iter: 700/834, loss: 0.35427, top1: 0.55479, throughput: 1305.72 | 2022-04-10 23:31:43.665 [rank:0] [train], epoch: 13/50, iter: 700/834, loss: 0.35162, top1: 0.55563, throughput: 1305.86 | 2022-04-10 23:31:43.664 [rank:4] [train], epoch: 13/50, iter: 800/834, loss: 0.35481, top1: 0.54703, throughput: 1314.50 | 2022-04-10 23:31:58.268 [rank:1] [train], epoch: 13/50, iter: 800/834, loss: 0.35351, top1: 0.55146, throughput: 1314.67 | 2022-04-10 23:31:58.270 [rank:3] [train], epoch: 13/50, iter: 800/834, loss: 0.35344, top1: 0.54969, throughput: 1314.37 | 2022-04-10 23:31:58.271 [rank:7] [train], epoch: 13/50, iter: 800/834, loss: 0.35232, top1: 0.55380, throughput: 1314.25 | 2022-04-10 23:31:58.271 [rank:2] [train], epoch: 13/50, iter: 800/834, loss: 0.35255, top1: 0.55458, throughput: 1313.99 | 2022-04-10 23:31:58.272 [rank:6] [train], epoch: 13/50, iter: 800/834, loss: 0.35406, top1: 0.55063, throughput: 1314.02 | 2022-04-10 23:31:58.274[rank:5] [train], epoch: 13/50, iter: 800/834, loss: 0.35268, top1: 0.55443, throughput: 1313.97 | 2022-04-10 23:31:58.273 [rank:0] [train], epoch: 13/50, iter: 800/834, loss: 0.35574, top1: 0.55062, throughput: 1314.25 | 2022-04-10 23:31:58.273 [rank:5] [train], epoch: 13/50, iter: 834/834, loss: 0.35148, top1: 0.55545, throughput: 1309.98[rank:6] [train], epoch: 13/50, iter: 834/834, loss: 0.35016, top1: 0.56526, throughput: 1310.44 | 2022-04-10 23:32:03.257| 2022-04-10 23:32:03.255 [rank:2] [train], epoch: 13/50, iter: 834/834, loss: 0.35399, top1: 0.55254, throughput: 1309.95 | 2022-04-10 23:32:03.256 [rank:7] [train], epoch: 13/50, iter: 834/834, loss: 0.35645, top1: 0.55392, throughput: 1309.60 | 2022-04-10 23:32:03.256 [rank:4] [train], epoch: 13/50, iter: 834/834, loss: 0.35655, top1: 0.54917, throughput: 1308.48 | 2022-04-10 23:32:03.257 [rank:3] [train], epoch: 13/50, iter: 834/834, loss: 0.35533, top1: 0.54994, throughput: 1309.02 | 2022-04-10 23:32:03.258 [rank:1] [train], epoch: 13/50, iter: 834/834, loss: 0.36040, top1: 0.53753, throughput: 1308.44 | 2022-04-10 23:32:03.259 [rank:0] [train], epoch: 13/50, iter: 834/834, loss: 0.35536, top1: 0.54504, throughput: 1309.18 | 2022-04-10 23:32:03.259 [rank:7] [eval], epoch: 13/50, iter: 125/125, loss: 0.00000, top1: 0.56048, throughput: 581.79 | 2022-04-10 23:32:13.998 [rank:0] [eval], epoch: 13/50, iter: 125/125, loss: 0.00000, top1: 0.56016, throughput: 581.10 | 2022-04-10 23:32:14.015 [rank:6] [eval], epoch: 13/50, iter: 125/125, loss: 0.00000, top1: 0.55200, throughput: 578.72 | 2022-04-10 23:32:14.055 [rank:3] [eval], epoch: 13/50, iter: 125/125, loss: 0.00000, top1: 0.53904, throughput: 576.53 | 2022-04-10 23:32:14.098 [rank:4] [eval], epoch: 13/50, iter: 125/125, loss: 0.00000, top1: 0.55696, throughput: 575.27 | 2022-04-10 23:32:14.121 [rank:2] [eval], epoch: 13/50, iter: 125/125, loss: 0.00000, top1: 0.54288, throughput: 573.76 | 2022-04-10 23:32:14.149 [rank:1] [eval], epoch: 13/50, iter: 125/125, loss: 0.00000, top1: 0.55632, throughput: 569.55 | 2022-04-10 23:32:14.232 [rank:5] [eval], epoch: 13/50, iter: 125/125, loss: 0.00000, top1: 0.54368, throughput: 566.74 | 2022-04-10 23:32:14.285 [rank:6] [train], epoch: 14/50, iter: 100/834, loss: 0.35042, top1: 0.55641, throughput: 1299.39 | 2022-04-10 23:32:28.831 [rank:4] [train], epoch: 14/50, iter: 100/834, loss: 0.34602, top1: 0.56599, throughput: 1305.24 | 2022-04-10 23:32:28.831 [rank:2] [train], epoch: 14/50, iter: 100/834, loss: 0.34841, top1: 0.56203, throughput: 1307.63 | 2022-04-10 23:32:28.832 [rank:3] [train], epoch: 14/50, iter: 100/834, loss: 0.34930, top1: 0.55943, throughput: 1303.03 | 2022-04-10 23:32:28.833 [rank:1] [train], epoch: 14/50, iter: 100/834, loss: 0.34868, top1: 0.56026, throughput: 1315.04 | 2022-04-10 23:32:28.833 [rank:5] [train], epoch: 14/50, iter: 100/834, loss: 0.34495, top1: 0.56464, throughput: 1319.74 | 2022-04-10 23:32:28.833 [rank:0] [train], epoch: 14/50, iter: 100/834, loss: 0.34720, top1: 0.56193, throughput: 1295.69 | 2022-04-10 23:32:28.833 [rank:7] [train], epoch: 14/50, iter: 100/834, loss: 0.34832, top1: 0.56385, throughput: 1294.15 | 2022-04-10 23:32:28.834 [rank:6] [train], epoch: 14/50, iter: 200/834, loss: 0.34853, top1: 0.56401, throughput: 1315.12 | 2022-04-10 23:32:43.431 [rank:4] [train], epoch: 14/50, iter: 200/834, loss: 0.34918, top1: 0.56177, throughput: 1315.12 | 2022-04-10 23:32:43.431 [rank:5] [train], epoch: 14/50, iter: 200/834, loss: 0.34779, top1: 0.56062, throughput: 1315.25 | 2022-04-10 23:32:43.431 [rank:7] [train], epoch: 14/50, iter: 200/834, loss: 0.35111, top1: 0.55484, throughput: 1315.26 | 2022-04-10 23:32:43.432 [rank:1] [train], epoch: 14/50, iter: 200/834, loss: 0.34889, top1: 0.56177, throughput: 1314.97[rank:2] [train], epoch: 14/50, iter: 200/834, loss: 0.34852, top1: 0.55964, throughput: 1314.95 | 2022-04-10 23:32:43.433| 2022-04-10 23:32:43.434 [rank:3] [train], epoch: 14/50, iter: 200/834, loss: 0.34735, top1: 0.56255, throughput: 1314.97 | 2022-04-10 23:32:43.434 [rank:0] [train], epoch: 14/50, iter: 200/834, loss: 0.34561, top1: 0.56609, throughput: 1314.87 | 2022-04-10 23:32:43.435 [rank:2] [train], epoch: 14/50, iter: 300/834, loss: 0.34835, top1: 0.56062, throughput: 1309.50 | 2022-04-10 23:32:58.095 [rank:6] [train], epoch: 14/50, iter: 300/834, loss: 0.34974, top1: 0.55714, throughput: 1309.28 | 2022-04-10 23:32:58.095 [rank:5] [train], epoch: 14/50, iter: 300/834, loss: 0.35094, top1: 0.55635, throughput: 1309.20 | 2022-04-10 23:32:58.097 [rank:3] [train], epoch: 14/50, iter: 300/834, loss: 0.35215, top1: 0.55073, throughput: 1309.49 | 2022-04-10 23:32:58.096 [rank:7] [train], epoch: 14/50, iter: 300/834, loss: 0.34792, top1: 0.56630, throughput: 1309.40 | 2022-04-10 23:32:58.095 [rank:4] [train], epoch: 14/50, iter: 300/834, loss: 0.35326, top1: 0.55552, throughput: 1309.36 | 2022-04-10 23:32:58.094 [rank:0] [train], epoch: 14/50, iter: 300/834, loss: 0.34919, top1: 0.56078, throughput: 1309.58 | 2022-04-10 23:32:58.096 [rank:1] [train], epoch: 14/50, iter: 300/834, loss: 0.34897, top1: 0.56089, throughput: 1309.43 | 2022-04-10 23:32:58.097 [rank:6] [train], epoch: 14/50, iter: 400/834, loss: 0.34894, top1: 0.56036, throughput: 1315.33 | 2022-04-10 23:33:12.692 [rank:5] [train], epoch: 14/50, iter: 400/834, loss: 0.34949, top1: 0.55964, throughput: 1315.48 | 2022-04-10 23:33:12.692 [rank:2] [train], epoch: 14/50, iter: 400/834, loss: 0.35159, top1: 0.55635, throughput: 1315.26 | 2022-04-10 23:33:12.693 [rank:1] [train], epoch: 14/50, iter: 400/834, loss: 0.34729, top1: 0.55969, throughput: 1315.22 | 2022-04-10 23:33:12.695 [rank:4] [train], epoch: 14/50, iter: 400/834, loss: 0.34737, top1: 0.56203, throughput: 1315.07 | 2022-04-10 23:33:12.694 [rank:3] [train], epoch: 14/50, iter: 400/834, loss: 0.34882, top1: 0.55875, throughput: 1315.11 | 2022-04-10 23:33:12.696 [rank:0] [train], epoch: 14/50, iter: 400/834, loss: 0.35159, top1: 0.55172, throughput: 1315.31 | 2022-04-10 23:33:12.694 [rank:7] [train], epoch: 14/50, iter: 400/834, loss: 0.35338, top1: 0.55036, throughput: 1315.10 | 2022-04-10 23:33:12.695 [rank:4] [train], epoch: 14/50, iter: 500/834, loss: 0.34855, top1: 0.56005, throughput: 1316.65 | 2022-04-10 23:33:27.277 [rank:5] [train], epoch: 14/50, iter: 500/834, loss: 0.35093, top1: 0.55547, throughput: 1316.47 | 2022-04-10 23:33:27.276 [rank:6] [train], epoch: 14/50, iter: 500/834, loss: 0.34942, top1: 0.55625, throughput: 1316.45 | 2022-04-10 23:33:27.277 [rank:1] [train], epoch: 14/50, iter: 500/834, loss: 0.34987, top1: 0.55839, throughput: 1316.59 | 2022-04-10 23:33:27.278 [rank:2] [train], epoch: 14/50, iter: 500/834, loss: 0.35148, top1: 0.55474, throughput: 1316.47 | 2022-04-10 23:33:27.278 [rank:3] [train], epoch: 14/50, iter: 500/834, loss: 0.34830, top1: 0.56365, throughput: 1316.64 | 2022-04-10 23:33:27.279 [rank:0] [train], epoch: 14/50, iter: 500/834, loss: 0.34951, top1: 0.55625, throughput: 1316.49 | 2022-04-10 23:33:27.278 [rank:7] [train], epoch: 14/50, iter: 500/834, loss: 0.35147, top1: 0.55729, throughput: 1316.48 | 2022-04-10 23:33:27.279 [rank:5] [train], epoch: 14/50, iter: 600/834, loss: 0.35015, top1: 0.55276, throughput: 1316.12 | 2022-04-10 23:33:41.865 [rank:6] [train], epoch: 14/50, iter: 600/834, loss: 0.35005, top1: 0.55729, throughput: 1316.02 | 2022-04-10 23:33:41.866 [rank:0] [train], epoch: 14/50, iter: 600/834, loss: 0.35145, top1: 0.55411, throughput: 1316.17 | 2022-04-10 23:33:41.866 [rank:2] [train], epoch: 14/50, iter: 600/834, loss: 0.35139, top1: 0.55302, throughput: 1315.74 | 2022-04-10 23:33:41.870 [rank:4] [train], epoch: 14/50, iter: 600/834, loss: 0.34958, top1: 0.56328, throughput: 1315.96 | 2022-04-10 23:33:41.867 [rank:1] [train], epoch: 14/50, iter: 600/834, loss: 0.34827, top1: 0.56286, throughput: 1315.86 | 2022-04-10 23:33:41.869 [rank:7] [train], epoch: 14/50, iter: 600/834, loss: 0.34816, top1: 0.56172, throughput: 1316.09 | 2022-04-10 23:33:41.868 [rank:3] [train], epoch: 14/50, iter: 600/834, loss: 0.34805, top1: 0.56406, throughput: 1315.65 | 2022-04-10 23:33:41.872 [rank:4] [train], epoch: 14/50, iter: 700/834, loss: 0.35007, top1: 0.56016, throughput: 1315.48 | 2022-04-10 23:33:56.462 [rank:6] [train], epoch: 14/50, iter: 700/834, loss: 0.35014, top1: 0.55240, throughput: 1315.41 | 2022-04-10 23:33:56.463 [rank:2] [train], epoch: 14/50, iter: 700/834, loss: 0.35129, top1: 0.55240, throughput: 1315.73 | 2022-04-10 23:33:56.463 [rank:3] [train], epoch: 14/50, iter: 700/834, loss: 0.34763, top1: 0.56474, throughput: 1315.78 | 2022-04-10 23:33:56.464 [rank:0] [train], epoch: 14/50, iter: 700/834, loss: 0.34879, top1: 0.56078, throughput: 1315.30 | 2022-04-10 23:33:56.463 [rank:5] [train], epoch: 14/50, iter: 700/834, loss: 0.35084, top1: 0.55380, throughput: 1315.14 | 2022-04-10 23:33:56.464 [rank:7] [train], epoch: 14/50, iter: 700/834, loss: 0.34922, top1: 0.55589, throughput: 1315.35 | 2022-04-10 23:33:56.465 [rank:1] [train], epoch: 14/50, iter: 700/834, loss: 0.35141, top1: 0.55557, throughput: 1315.35 | 2022-04-10 23:33:56.466 [rank:4] [train], epoch: 14/50, iter: 800/834, loss: 0.34951, top1: 0.56250, throughput: 1314.81 | 2022-04-10 23:34:11.065 [rank:2] [train], epoch: 14/50, iter: 800/834, loss: 0.34834, top1: 0.56120, throughput: 1314.57 | 2022-04-10 23:34:11.069 [rank:5] [train], epoch: 14/50, iter: 800/834, loss: 0.34802, top1: 0.56099, throughput: 1314.93 | 2022-04-10 23:34:11.065 [rank:3] [train], epoch: 14/50, iter: 800/834, loss: 0.34974, top1: 0.55885, throughput: 1314.75 | 2022-04-10 23:34:11.068 [rank:6] [train], epoch: 14/50, iter: 800/834, loss: 0.34671, top1: 0.56479, throughput: 1314.75 | 2022-04-10 23:34:11.066 [rank:7] [train], epoch: 14/50, iter: 800/834, loss: 0.35018, top1: 0.55250, throughput: 1314.92 | 2022-04-10 23:34:11.067 [rank:1] [train], epoch: 14/50, iter: 800/834, loss: 0.34912, top1: 0.56125, throughput: 1314.83 | 2022-04-10 23:34:11.069 [rank:0] [train], epoch: 14/50, iter: 800/834, loss: 0.35034, top1: 0.55833, throughput: 1314.77 | 2022-04-10 23:34:11.066 [rank:6] [train], epoch: 14/50, iter: 834/834, loss: 0.34995, top1: 0.56158, throughput: 1311.13 | 2022-04-10 23:34:16.045 [rank:5] [train], epoch: 14/50, iter: 834/834, loss: 0.35238, top1: 0.55407, throughput: 1310.94 | 2022-04-10 23:34:16.045 [rank:0] [train], epoch: 14/50, iter: 834/834, loss: 0.35394, top1: 0.54534, throughput: 1311.09 | 2022-04-10 23:34:16.045 [rank:2] [train], epoch: 14/50, iter: 834/834, loss: 0.35388, top1: 0.55070, throughput: 1311.35 | 2022-04-10 23:34:16.047 [rank:4] [train], epoch: 14/50, iter: 834/834, loss: 0.34410, top1: 0.56250, throughput: 1310.43 | 2022-04-10 23:34:16.047 [rank:3] [train], epoch: 14/50, iter: 834/834, loss: 0.34858, top1: 0.56127, throughput: 1311.03 | 2022-04-10 23:34:16.047 [rank:1] [train], epoch: 14/50, iter: 834/834, loss: 0.34782, top1: 0.55591, throughput: 1311.11 | 2022-04-10 23:34:16.048 [rank:7] [train], epoch: 14/50, iter: 834/834, loss: 0.35041, top1: 0.55453, throughput: 1310.41 | 2022-04-10 23:34:16.048 [rank:0] [eval], epoch: 14/50, iter: 125/125, loss: 0.00000, top1: 0.56624, throughput: 578.01 | 2022-04-10 23:34:26.858 [rank:7] [eval], epoch: 14/50, iter: 125/125, loss: 0.00000, top1: 0.56432, throughput: 574.11 | 2022-04-10 23:34:26.935 [rank:6] [eval], epoch: 14/50, iter: 125/125, loss: 0.00000, top1: 0.56592, throughput: 570.04 | 2022-04-10 23:34:27.009 [rank:2] [eval], epoch: 14/50, iter: 125/125, loss: 0.00000, top1: 0.56432, throughput: 569.66 | 2022-04-10 23:34:27.018 [rank:5] [eval], epoch: 14/50, iter: 125/125, loss: 0.00000, top1: 0.55232, throughput: 566.71 | 2022-04-10 23:34:27.074 [rank:3] [eval], epoch: 14/50, iter: 125/125, loss: 0.00000, top1: 0.55232, throughput: 566.77 | 2022-04-10 23:34:27.075 [rank:4] [eval], epoch: 14/50, iter: 125/125, loss: 0.00000, top1: 0.55536, throughput: 566.51 | 2022-04-10 23:34:27.079 [rank:1] [eval], epoch: 14/50, iter: 125/125, loss: 0.00000, top1: 0.56384, throughput: 563.18 | 2022-04-10 23:34:27.146 [rank:5] [train], epoch: 15/50, iter: 100/834, loss: 0.33972, top1: 0.58099, throughput: 1314.68 | 2022-04-10 23:34:41.678 [rank:6] [train], epoch: 15/50, iter: 100/834, loss: 0.34427, top1: 0.56766, throughput: 1308.81 | 2022-04-10 23:34:41.679 [rank:1] [train], epoch: 15/50, iter: 100/834, loss: 0.34487, top1: 0.56927, throughput: 1320.97 | 2022-04-10 23:34:41.680 [rank:2] [train], epoch: 15/50, iter: 100/834, loss: 0.34216, top1: 0.57078, throughput: 1309.58 | 2022-04-10 23:34:41.679 [rank:4] [train], epoch: 15/50, iter: 100/834, loss: 0.34259, top1: 0.57099, throughput: 1314.95 | 2022-04-10 23:34:41.681 [rank:3] [train], epoch: 15/50, iter: 100/834, loss: 0.34378, top1: 0.56896, throughput: 1314.33 | 2022-04-10 23:34:41.683 [rank:7] [train], epoch: 15/50, iter: 100/834, loss: 0.34129, top1: 0.57229, throughput: 1302.06 | 2022-04-10 23:34:41.681 [rank:0] [train], epoch: 15/50, iter: 100/834, loss: 0.34080, top1: 0.57036, throughput: 1295.24 | 2022-04-10 23:34:41.682 [rank:6] [train], epoch: 15/50, iter: 200/834, loss: 0.34209, top1: 0.57385, throughput: 1316.04 | 2022-04-10 23:34:56.268 [rank:5] [train], epoch: 15/50, iter: 200/834, loss: 0.34625, top1: 0.56333, throughput: 1316.06 | 2022-04-10 23:34:56.267 [rank:4] [train], epoch: 15/50, iter: 200/834, loss: 0.34730, top1: 0.56271, throughput: 1316.32 | 2022-04-10 23:34:56.267 [rank:2] [train], epoch: 15/50, iter: 200/834, loss: 0.34302, top1: 0.57208, throughput: 1316.07 | 2022-04-10 23:34:56.268 [rank:3] [train], epoch: 15/50, iter: 200/834, loss: 0.34632, top1: 0.56422, throughput: 1316.31 | 2022-04-10 23:34:56.269 [rank:0] [train], epoch: 15/50, iter: 200/834, loss: 0.34387, top1: 0.56776, throughput: 1316.33 | 2022-04-10 23:34:56.268 [rank:1] [train], epoch: 15/50, iter: 200/834, loss: 0.34258, top1: 0.57083, throughput: 1316.12 | 2022-04-10 23:34:56.269 [rank:7] [train], epoch: 15/50, iter: 200/834, loss: 0.34810, top1: 0.56563, throughput: 1316.11 | 2022-04-10 23:34:56.269 [rank:6] [train], epoch: 15/50, iter: 300/834, loss: 0.34641, top1: 0.56307, throughput: 1310.77 | 2022-04-10 23:35:10.916 [rank:4] [train], epoch: 15/50, iter: 300/834, loss: 0.34650, top1: 0.56328, throughput: 1310.63 | 2022-04-10 23:35:10.916 [rank:2] [train], epoch: 15/50, iter: 300/834, loss: 0.34640, top1: 0.56375, throughput: 1310.66 | 2022-04-10 23:35:10.917 [rank:1] [train], epoch: 15/50, iter: 300/834, loss: 0.34749, top1: 0.56313, throughput: 1310.65 | 2022-04-10 23:35:10.918 [rank:3] [train], epoch: 15/50, iter: 300/834, loss: 0.34565, top1: 0.56344, throughput: 1310.55[rank:5] [train], epoch: 15/50, iter: 300/834, loss: 0.34646, top1: 0.56495, throughput: 1310.43 | 2022-04-10 23:35:10.919 | 2022-04-10 23:35:10.919 [rank:0] [train], epoch: 15/50, iter: 300/834, loss: 0.34593, top1: 0.56656, throughput: 1310.53 | 2022-04-10 23:35:10.918 [rank:7] [train], epoch: 15/50, iter: 300/834, loss: 0.34727, top1: 0.56651, throughput: 1310.67 | 2022-04-10 23:35:10.918 [rank:6] [train], epoch: 15/50, iter: 400/834, loss: 0.34586, top1: 0.56490, throughput: 1313.15 | 2022-04-10 23:35:25.537 [rank:1] [train], epoch: 15/50, iter: 400/834, loss: 0.34538, top1: 0.56307, throughput: 1313.22 | 2022-04-10 23:35:25.538 [rank:4] [train], epoch: 15/50, iter: 400/834, loss: 0.34988, top1: 0.55396, throughput: 1313.07 | 2022-04-10 23:35:25.538 [rank:2] [train], epoch: 15/50, iter: 400/834, loss: 0.34474, top1: 0.56682, throughput: 1313.12 | 2022-04-10 23:35:25.539 [rank:5] [train], epoch: 15/50, iter: 400/834, loss: 0.34536, top1: 0.56380, throughput: 1313.15 | 2022-04-10 23:35:25.540 [rank:0] [train], epoch: 15/50, iter: 400/834, loss: 0.34554, top1: 0.56568, throughput: 1313.16 | 2022-04-10 23:35:25.540 [rank:3] [train], epoch: 15/50, iter: 400/834, loss: 0.34699, top1: 0.56141, throughput: 1313.10 | 2022-04-10 23:35:25.541 [rank:7] [train], epoch: 15/50, iter: 400/834, loss: 0.34272, top1: 0.57531, throughput: 1313.10 | 2022-04-10 23:35:25.540 [rank:5] [train], epoch: 15/50, iter: 500/834, loss: 0.34825, top1: 0.55755, throughput: 1314.68 | 2022-04-10 23:35:40.144 [rank:2] [train], epoch: 15/50, iter: 500/834, loss: 0.34779, top1: 0.56115, throughput: 1314.53 | 2022-04-10 23:35:40.145 [rank:4] [train], epoch: 15/50, iter: 500/834, loss: 0.34785, top1: 0.56214, throughput: 1314.42 | 2022-04-10 23:35:40.146 [rank:3] [train], epoch: 15/50, iter: 500/834, loss: 0.34895, top1: 0.55708, throughput: 1314.55 | 2022-04-10 23:35:40.147 [rank:6] [train], epoch: 15/50, iter: 500/834, loss: 0.34793, top1: 0.56036, throughput: 1314.33 | 2022-04-10 23:35:40.146 [rank:7] [train], epoch: 15/50, iter: 500/834, loss: 0.34935, top1: 0.55979, throughput: 1314.58 | 2022-04-10 23:35:40.145 [rank:0] [train], epoch: 15/50, iter: 500/834, loss: 0.34686, top1: 0.55896, throughput: 1314.52[rank:1] [train], epoch: 15/50, iter: 500/834, loss: 0.34519, top1: 0.56078, throughput: 1314.29 | 2022-04-10 23:35:40.146| 2022-04-10 23:35:40.147 [rank:4] [train], epoch: 15/50, iter: 600/834, loss: 0.34603, top1: 0.56109, throughput: 1316.90 | 2022-04-10 23:35:54.725 [rank:6] [train], epoch: 15/50, iter: 600/834, loss: 0.34976, top1: 0.56094, throughput: 1316.92 | 2022-04-10 23:35:54.725 [rank:3] [train], epoch: 15/50, iter: 600/834, loss: 0.34841, top1: 0.55776, throughput: 1316.83 | 2022-04-10 23:35:54.727 [rank:5] [train], epoch: 15/50, iter: 600/834, loss: 0.34689, top1: 0.56625, throughput: 1316.65 | 2022-04-10 23:35:54.727 [rank:7] [train], epoch: 15/50, iter: 600/834, loss: 0.34612, top1: 0.56932, throughput: 1316.68 | 2022-04-10 23:35:54.727 [rank:1] [train], epoch: 15/50, iter: 600/834, loss: 0.34601, top1: 0.56651, throughput: 1316.80 | 2022-04-10 23:35:54.728 [rank:2] [train], epoch: 15/50, iter: 600/834, loss: 0.34413, top1: 0.56870, throughput: 1316.62 | 2022-04-10 23:35:54.728 [rank:0] [train], epoch: 15/50, iter: 600/834, loss: 0.34456, top1: 0.56708, throughput: 1316.68 | 2022-04-10 23:35:54.728 [rank:2] [train], epoch: 15/50, iter: 700/834, loss: 0.34591, top1: 0.56370, throughput: 1315.51 | 2022-04-10 23:36:09.323 [rank:5] [train], epoch: 15/50, iter: 700/834, loss: 0.34772, top1: 0.55922, throughput: 1315.48 | 2022-04-10 23:36:09.322 [rank:7] [train], epoch: 15/50, iter: 700/834, loss: 0.34589, top1: 0.56448, throughput: 1315.42 | 2022-04-10 23:36:09.324 [rank:1] [train], epoch: 15/50, iter: 700/834, loss: 0.34899, top1: 0.56448, throughput: 1315.45 | 2022-04-10 23:36:09.324 [rank:4] [train], epoch: 15/50, iter: 700/834, loss: 0.34825, top1: 0.55932, throughput: 1315.28 | 2022-04-10 23:36:09.323 [rank:6] [train], epoch: 15/50, iter: 700/834, loss: 0.34973, top1: 0.56057, throughput: 1315.17 | 2022-04-10 23:36:09.324 [rank:3] [train], epoch: 15/50, iter: 700/834, loss: 0.34803, top1: 0.56557, throughput: 1315.30 | 2022-04-10 23:36:09.325 [rank:0] [train], epoch: 15/50, iter: 700/834, loss: 0.34841, top1: 0.56417, throughput: 1315.22 | 2022-04-10 23:36:09.326 [rank:6] [train], epoch: 15/50, iter: 800/834, loss: 0.35060, top1: 0.55401, throughput: 1303.92 | 2022-04-10 23:36:24.049 [rank:3] [train], epoch: 15/50, iter: 800/834, loss: 0.34661, top1: 0.56437, throughput: 1303.85 | 2022-04-10 23:36:24.050 [rank:2] [train], epoch: 15/50, iter: 800/834, loss: 0.34548, top1: 0.56333, throughput: 1303.70 | 2022-04-10 23:36:24.050 [rank:7] [train], epoch: 15/50, iter: 800/834, loss: 0.34841, top1: 0.55854, throughput: 1303.75 | 2022-04-10 23:36:24.050 [rank:4] [train], epoch: 15/50, iter: 800/834, loss: 0.34611, top1: 0.56333, throughput: 1303.72 | 2022-04-10 23:36:24.050 [rank:5] [train], epoch: 15/50, iter: 800/834, loss: 0.34792, top1: 0.56104, throughput: 1303.46 | 2022-04-10 23:36:24.052 [rank:0] [train], epoch: 15/50, iter: 800/834, loss: 0.34754, top1: 0.55776, throughput: 1303.99 | 2022-04-10 23:36:24.050 [rank:1] [train], epoch: 15/50, iter: 800/834, loss: 0.34388, top1: 0.56656, throughput: 1303.61 | 2022-04-10 23:36:24.052 [rank:5] [train], epoch: 15/50, iter: 834/834, loss: 0.34378, top1: 0.56893, throughput: 1311.59 | 2022-04-10 23:36:29.029 [rank:4] [train], epoch: 15/50, iter: 834/834, loss: 0.34199, top1: 0.57001, throughput: 1310.89 | 2022-04-10 23:36:29.030 [rank:0] [train], epoch: 15/50, iter: 834/834, loss: 0.34514, top1: 0.56204, throughput: 1310.96 | 2022-04-10 23:36:29.030 [rank:1] [train], epoch: 15/50, iter: 834/834, loss: 0.34362, top1: 0.57200, throughput: 1311.26 | 2022-04-10 23:36:29.030 [rank:6] [train], epoch: 15/50, iter: 834/834, loss: 0.34786, top1: 0.56449, throughput: 1310.31 | 2022-04-10 23:36:29.031[rank:7] [train], epoch: 15/50, iter: 834/834, loss: 0.34502, top1: 0.56955, throughput: 1310.79 | 2022-04-10 23:36:29.031 [rank:3] [train], epoch: 15/50, iter: 834/834, loss: 0.34326, top1: 0.58104, throughput: 1310.63 | 2022-04-10 23:36:29.031 [rank:2] [train], epoch: 15/50, iter: 834/834, loss: 0.35071, top1: 0.55193, throughput: 1310.53 | 2022-04-10 23:36:29.031 [rank:0] [eval], epoch: 15/50, iter: 125/125, loss: 0.00000, top1: 0.57376, throughput: 587.49 | 2022-04-10 23:36:39.668 [rank:7] [eval], epoch: 15/50, iter: 125/125, loss: 0.00000, top1: 0.57824, throughput: 584.36 | 2022-04-10 23:36:39.726 [rank:1] [eval], epoch: 15/50, iter: 125/125, loss: 0.00000, top1: 0.57104, throughput: 580.20 | 2022-04-10 23:36:39.803 [rank:6] [eval], epoch: 15/50, iter: 125/125, loss: 0.00000, top1: 0.57072, throughput: 576.98 | 2022-04-10 23:36:39.863 [rank:3] [eval], epoch: 15/50, iter: 125/125, loss: 0.00000, top1: 0.55888, throughput: 576.90 | 2022-04-10 23:36:39.865 [rank:2] [eval], epoch: 15/50, iter: 125/125, loss: 0.00000, top1: 0.55856, throughput: 576.23 | 2022-04-10 23:36:39.878 [rank:4] [eval], epoch: 15/50, iter: 125/125, loss: 0.00000, top1: 0.56272, throughput: 573.93 | 2022-04-10 23:36:39.919 [rank:5] [eval], epoch: 15/50, iter: 125/125, loss: 0.00000, top1: 0.55776, throughput: 570.55 | 2022-04-10 23:36:39.984 [rank:4] [train], epoch: 16/50, iter: 100/834, loss: 0.34104, top1: 0.57594, throughput: 1305.43 | 2022-04-10 23:36:54.627 [rank:1] [train], epoch: 16/50, iter: 100/834, loss: 0.34139, top1: 0.56917, throughput: 1295.01 | 2022-04-10 23:36:54.629 [rank:5] [train], epoch: 16/50, iter: 100/834, loss: 0.33870, top1: 0.57589, throughput: 1311.10 | 2022-04-10 23:36:54.628 [rank:6] [train], epoch: 16/50, iter: 100/834, loss: 0.33604, top1: 0.58302, throughput: 1300.26 | 2022-04-10 23:36:54.629 [rank:2] [train], epoch: 16/50, iter: 100/834, loss: 0.34004, top1: 0.57359, throughput: 1301.51 | 2022-04-10 23:36:54.630 [rank:7] [train], epoch: 16/50, iter: 100/834, loss: 0.34389, top1: 0.57583, throughput: 1288.27 | 2022-04-10 23:36:54.630 [rank:3] [train], epoch: 16/50, iter: 100/834, loss: 0.34196, top1: 0.57427, throughput: 1300.38 | 2022-04-10 23:36:54.630 [rank:0] [train], epoch: 16/50, iter: 100/834, loss: 0.33975, top1: 0.57708, throughput: 1283.22 | 2022-04-10 23:36:54.631 [rank:6] [train], epoch: 16/50, iter: 200/834, loss: 0.34123, top1: 0.57219, throughput: 1314.63 | 2022-04-10 23:37:09.234 [rank:5] [train], epoch: 16/50, iter: 200/834, loss: 0.34377, top1: 0.57193, throughput: 1314.53 | 2022-04-10 23:37:09.234 [rank:1] [train], epoch: 16/50, iter: 200/834, loss: 0.34161, top1: 0.57625, throughput: 1314.43 | 2022-04-10 23:37:09.236 [rank:3] [train], epoch: 16/50, iter: 200/834, loss: 0.34171, top1: 0.56833, throughput: 1314.44 | 2022-04-10 23:37:09.237 [rank:2] [train], epoch: 16/50, iter: 200/834, loss: 0.34114, top1: 0.57109, throughput: 1314.50 | 2022-04-10 23:37:09.236 [rank:7] [train], epoch: 16/50, iter: 200/834, loss: 0.33975, top1: 0.57703, throughput: 1314.48 | 2022-04-10 23:37:09.236 [rank:4] [train], epoch: 16/50, iter: 200/834, loss: 0.34226, top1: 0.57510, throughput: 1314.24 | 2022-04-10 23:37:09.236 [rank:0] [train], epoch: 16/50, iter: 200/834, loss: 0.34513, top1: 0.56323, throughput: 1314.47 | 2022-04-10 23:37:09.237 [rank:5] [train], epoch: 16/50, iter: 300/834, loss: 0.33891, top1: 0.57328, throughput: 1315.61 | 2022-04-10 23:37:23.828 [rank:6] [train], epoch: 16/50, iter: 300/834, loss: 0.33956, top1: 0.57781, throughput: 1315.59 | 2022-04-10 23:37:23.828 [rank:7] [train], epoch: 16/50, iter: 300/834, loss: 0.34269, top1: 0.57271, throughput: 1315.72 | 2022-04-10 23:37:23.829 [rank:2] [train], epoch: 16/50, iter: 300/834, loss: 0.34063, top1: 0.57401, throughput: 1315.71 | 2022-04-10 23:37:23.829 [rank:3] [train], epoch: 16/50, iter: 300/834, loss: 0.34149, top1: 0.57156, throughput: 1315.66 | 2022-04-10 23:37:23.830 [rank:4] [train], epoch: 16/50, iter: 300/834, loss: 0.34177, top1: 0.57354, throughput: 1315.67 | 2022-04-10 23:37:23.830 [rank:1] [train], epoch: 16/50, iter: 300/834, loss: 0.34081, top1: 0.57823, throughput: 1315.62 | 2022-04-10 23:37:23.830 [rank:0] [train], epoch: 16/50, iter: 300/834, loss: 0.34152, top1: 0.57578, throughput: 1315.53 | 2022-04-10 23:37:23.832 [rank:6] [train], epoch: 16/50, iter: 400/834, loss: 0.33967, top1: 0.57437, throughput: 1313.19 | 2022-04-10 23:37:38.449 [rank:5] [train], epoch: 16/50, iter: 400/834, loss: 0.34244, top1: 0.57255, throughput: 1313.15 | 2022-04-10 23:37:38.449 [rank:4] [train], epoch: 16/50, iter: 400/834, loss: 0.34372, top1: 0.56745, throughput: 1313.33 | 2022-04-10 23:37:38.449 [rank:2] [train], epoch: 16/50, iter: 400/834, loss: 0.34517, top1: 0.56771, throughput: 1313.08 | 2022-04-10 23:37:38.451 [rank:0] [train], epoch: 16/50, iter: 400/834, loss: 0.34376, top1: 0.57057, throughput: 1313.27 | 2022-04-10 23:37:38.452 [rank:7] [train], epoch: 16/50, iter: 400/834, loss: 0.33953, top1: 0.57771, throughput: 1313.10 | 2022-04-10 23:37:38.451 [rank:1] [train], epoch: 16/50, iter: 400/834, loss: 0.34244, top1: 0.57458, throughput: 1312.76 | 2022-04-10 23:37:38.455 [rank:3] [train], epoch: 16/50, iter: 400/834, loss: 0.34193, top1: 0.57417, throughput: 1312.80 | 2022-04-10 23:37:38.456 [rank:4] [train], epoch: 16/50, iter: 500/834, loss: 0.34290, top1: 0.56568, throughput: 1316.36 | 2022-04-10 23:37:53.035 [rank:2] [train], epoch: 16/50, iter: 500/834, loss: 0.34507, top1: 0.56953, throughput: 1316.38 | 2022-04-10 23:37:53.036 [rank:6] [train], epoch: 16/50, iter: 500/834, loss: 0.34625, top1: 0.56146, throughput: 1316.41 | 2022-04-10 23:37:53.035 [rank:5] [train], epoch: 16/50, iter: 500/834, loss: 0.34540, top1: 0.56448, throughput: 1316.40 | 2022-04-10 23:37:53.034 [rank:1] [train], epoch: 16/50, iter: 500/834, loss: 0.34248, top1: 0.56854, throughput: 1316.76 | 2022-04-10 23:37:53.037 [rank:3] [train], epoch: 16/50, iter: 500/834, loss: 0.34495, top1: 0.56458, throughput: 1316.50 | 2022-04-10 23:37:53.040 [rank:7] [train], epoch: 16/50, iter: 500/834, loss: 0.34130, top1: 0.57266, throughput: 1316.38 | 2022-04-10 23:37:53.036 [rank:0] [train], epoch: 16/50, iter: 500/834, loss: 0.34389, top1: 0.56802, throughput: 1316.25 | 2022-04-10 23:37:53.039 [rank:6] [train], epoch: 16/50, iter: 600/834, loss: 0.34255, top1: 0.57365, throughput: 1315.46 | 2022-04-10 23:38:07.630 [rank:5] [train], epoch: 16/50, iter: 600/834, loss: 0.34444, top1: 0.56375, throughput: 1315.28 | 2022-04-10 23:38:07.632 [rank:7] [train], epoch: 16/50, iter: 600/834, loss: 0.34482, top1: 0.56599, throughput: 1315.57 | 2022-04-10 23:38:07.631 [rank:3] [train], epoch: 16/50, iter: 600/834, loss: 0.34521, top1: 0.56130, throughput: 1315.68 | 2022-04-10 23:38:07.633[rank:1] [train], epoch: 16/50, iter: 600/834, loss: 0.34403, top1: 0.56833, throughput: 1315.36 | 2022-04-10 23:38:07.633 [rank:4] [train], epoch: 16/50, iter: 600/834, loss: 0.34323, top1: 0.56875, throughput: 1315.31 | 2022-04-10 23:38:07.632 [rank:0] [train], epoch: 16/50, iter: 600/834, loss: 0.34624, top1: 0.56677, throughput: 1315.68 | 2022-04-10 23:38:07.632 [rank:2] [train], epoch: 16/50, iter: 600/834, loss: 0.34512, top1: 0.56505, throughput: 1315.40 | 2022-04-10 23:38:07.633 [rank:2] [train], epoch: 16/50, iter: 700/834, loss: 0.34416, top1: 0.56844, throughput: 1308.51 | 2022-04-10 23:38:22.306 [rank:3] [train], epoch: 16/50, iter: 700/834, loss: 0.34376, top1: 0.56990, throughput: 1308.36 | 2022-04-10 23:38:22.308 [rank:5] [train], epoch: 16/50, iter: 700/834, loss: 0.34513, top1: 0.56365, throughput: 1308.57 | 2022-04-10 23:38:22.305 [rank:4] [train], epoch: 16/50, iter: 700/834, loss: 0.34322, top1: 0.56625, throughput: 1308.48 | 2022-04-10 23:38:22.306 [rank:7] [train], epoch: 16/50, iter: 700/834, loss: 0.34420, top1: 0.56651, throughput: 1308.02 | 2022-04-10 23:38:22.309 [rank:6] [train], epoch: 16/50, iter: 700/834, loss: 0.34393, top1: 0.56771, throughput: 1308.18 | 2022-04-10 23:38:22.307 [rank:0] [train], epoch: 16/50, iter: 700/834, loss: 0.34569, top1: 0.56411, throughput: 1308.39 | 2022-04-10 23:38:22.307 [rank:1] [train], epoch: 16/50, iter: 700/834, loss: 0.34514, top1: 0.56802, throughput: 1308.46 | 2022-04-10 23:38:22.307 [rank:2] [train], epoch: 16/50, iter: 800/834, loss: 0.34273, top1: 0.56740, throughput: 1315.98 | 2022-04-10 23:38:36.896 [rank:7] [train], epoch: 16/50, iter: 800/834, loss: 0.34467, top1: 0.56953, throughput: 1316.20 | 2022-04-10 23:38:36.897 [rank:4] [train], epoch: 16/50, iter: 800/834, loss: 0.34533, top1: 0.56406, throughput: 1316.00 | 2022-04-10 23:38:36.895 [rank:5] [train], epoch: 16/50, iter: 800/834, loss: 0.34671, top1: 0.56276, throughput: 1315.89 | 2022-04-10 23:38:36.896 [rank:1] [train], epoch: 16/50, iter: 800/834, loss: 0.34185, top1: 0.57167, throughput: 1315.97 | 2022-04-10 23:38:36.897 [rank:3] [train], epoch: 16/50, iter: 800/834, loss: 0.34405, top1: 0.56651, throughput: 1316.04 | 2022-04-10 23:38:36.897 [rank:0] [train], epoch: 16/50, iter: 800/834, loss: 0.34343, top1: 0.56646, throughput: 1315.91 | 2022-04-10 23:38:36.897 [rank:6] [train], epoch: 16/50, iter: 800/834, loss: 0.34474, top1: 0.56979, throughput: 1316.01 | 2022-04-10 23:38:36.897 [rank:5] [train], epoch: 16/50, iter: 834/834, loss: 0.34236, top1: 0.56801, throughput: 1313.74 | 2022-04-10 23:38:41.865 [rank:1] [train], epoch: 16/50, iter: 834/834, loss: 0.34700, top1: 0.56587, throughput: 1314.07 | 2022-04-10 23:38:41.865 [rank:2] [train], epoch: 16/50, iter: 834/834, loss: 0.34617, top1: 0.56756, throughput: 1313.69 | 2022-04-10 23:38:41.865 [rank:0] [train], epoch: 16/50, iter: 834/834, loss: 0.34053, top1: 0.57154, throughput: 1313.96 | 2022-04-10 23:38:41.865 [rank:7] [train], epoch: 16/50, iter: 834/834, loss: 0.34250, top1: 0.57062, throughput: 1313.86 | 2022-04-10 23:38:41.865 [rank:6] [train], epoch: 16/50, iter: 834/834, loss: 0.34430, top1: 0.56924, throughput: 1313.67 | 2022-04-10 23:38:41.866 [rank:3] [train], epoch: 16/50, iter: 834/834, loss: 0.34392, top1: 0.56495, throughput: 1313.60 | 2022-04-10 23:38:41.867 [rank:4] [train], epoch: 16/50, iter: 834/834, loss: 0.34514, top1: 0.56602, throughput: 1312.88 | 2022-04-10 23:38:41.868 [rank:7] [eval], epoch: 16/50, iter: 125/125, loss: 0.00000, top1: 0.57584, throughput: 584.70 | 2022-04-10 23:38:52.555 [rank:0] [eval], epoch: 16/50, iter: 125/125, loss: 0.00000, top1: 0.58192, throughput: 583.53 | 2022-04-10 23:38:52.576 [rank:3] [eval], epoch: 16/50, iter: 125/125, loss: 0.00000, top1: 0.57392, throughput: 580.88 | 2022-04-10 23:38:52.626 [rank:2] [eval], epoch: 16/50, iter: 125/125, loss: 0.00000, top1: 0.56720, throughput: 580.24 | 2022-04-10 23:38:52.636 [rank:4] [eval], epoch: 16/50, iter: 125/125, loss: 0.00000, top1: 0.57600, throughput: 580.32 | 2022-04-10 23:38:52.637 [rank:6] [eval], epoch: 16/50, iter: 125/125, loss: 0.00000, top1: 0.57552, throughput: 577.98 | 2022-04-10 23:38:52.679 [rank:1] [eval], epoch: 16/50, iter: 125/125, loss: 0.00000, top1: 0.58432, throughput: 571.54 | 2022-04-10 23:38:52.800 [rank:5] [eval], epoch: 16/50, iter: 125/125, loss: 0.00000, top1: 0.57056, throughput: 566.39 | 2022-04-10 23:38:52.899 [rank:5] [train], epoch: 17/50, iter: 100/834, loss: 0.33945, top1: 0.57318, throughput: 1320.69 | 2022-04-10 23:39:07.437 [rank:6] [train], epoch: 17/50, iter: 100/834, loss: 0.33710, top1: 0.58182, throughput: 1301.02 | 2022-04-10 23:39:07.437 [rank:4] [train], epoch: 17/50, iter: 100/834, loss: 0.33715, top1: 0.58005, throughput: 1297.34 | 2022-04-10 23:39:07.437 [rank:1] [train], epoch: 17/50, iter: 100/834, loss: 0.33769, top1: 0.57750, throughput: 1311.57 | 2022-04-10 23:39:07.439 [rank:3] [train], epoch: 17/50, iter: 100/834, loss: 0.33825, top1: 0.57859, throughput: 1295.98 | 2022-04-10 23:39:07.441 [rank:7] [train], epoch: 17/50, iter: 100/834, loss: 0.34067, top1: 0.57328, throughput: 1290.05 | 2022-04-10 23:39:07.438 [rank:0] [train], epoch: 17/50, iter: 100/834, loss: 0.33332, top1: 0.58901, throughput: 1291.73 | 2022-04-10 23:39:07.440 [rank:2] [train], epoch: 17/50, iter: 100/834, loss: 0.33606, top1: 0.58161, throughput: 1297.04 | 2022-04-10 23:39:07.439 [rank:6] [train], epoch: 17/50, iter: 200/834, loss: 0.33970, top1: 0.57813, throughput: 1316.12 | 2022-04-10 23:39:22.025 [rank:5] [train], epoch: 17/50, iter: 200/834, loss: 0.34054, top1: 0.57380, throughput: 1315.96 | 2022-04-10 23:39:22.027 [rank:4] [train], epoch: 17/50, iter: 200/834, loss: 0.34147, top1: 0.57323, throughput: 1315.92 | 2022-04-10 23:39:22.028 [rank:2] [train], epoch: 17/50, iter: 200/834, loss: 0.33842, top1: 0.57698, throughput: 1316.22 | 2022-04-10 23:39:22.027 [rank:1] [train], epoch: 17/50, iter: 200/834, loss: 0.33959, top1: 0.58255, throughput: 1316.08 | 2022-04-10 23:39:22.028 [rank:0] [train], epoch: 17/50, iter: 200/834, loss: 0.33883, top1: 0.57969, throughput: 1316.21 | 2022-04-10 23:39:22.027 [rank:3] [train], epoch: 17/50, iter: 200/834, loss: 0.33942, top1: 0.57672, throughput: 1316.20 | 2022-04-10 23:39:22.029 [rank:7] [train], epoch: 17/50, iter: 200/834, loss: 0.33507, top1: 0.58302, throughput: 1315.97 | 2022-04-10 23:39:22.028 [rank:5] [train], epoch: 17/50, iter: 300/834, loss: 0.34084, top1: 0.57615, throughput: 1316.42 | 2022-04-10 23:39:36.612 [rank:6] [train], epoch: 17/50, iter: 300/834, loss: 0.33844, top1: 0.57885, throughput: 1316.23 | 2022-04-10 23:39:36.613 [rank:4] [train], epoch: 17/50, iter: 300/834, loss: 0.33541, top1: 0.58255, throughput: 1316.24 | 2022-04-10 23:39:36.615 [rank:3] [train], epoch: 17/50, iter: 300/834, loss: 0.33942, top1: 0.57672, throughput: 1316.29 | 2022-04-10 23:39:36.615 [rank:0] [train], epoch: 17/50, iter: 300/834, loss: 0.34360, top1: 0.57161, throughput: 1316.22 | 2022-04-10 23:39:36.615 [rank:1] [train], epoch: 17/50, iter: 300/834, loss: 0.33972, top1: 0.57526, throughput: 1316.13 | 2022-04-10 23:39:36.616 [rank:7] [train], epoch: 17/50, iter: 300/834, loss: 0.33961, top1: 0.57432, throughput: 1316.04 | 2022-04-10 23:39:36.617 [rank:2] [train], epoch: 17/50, iter: 300/834, loss: 0.34214, top1: 0.57380, throughput: 1315.99 | 2022-04-10 23:39:36.616 [rank:6] [train], epoch: 17/50, iter: 400/834, loss: 0.34069, top1: 0.57781, throughput: 1314.09 | 2022-04-10 23:39:51.223 [rank:5] [train], epoch: 17/50, iter: 400/834, loss: 0.34101, top1: 0.56797, throughput: 1314.01 | 2022-04-10 23:39:51.224 [rank:1] [train], epoch: 17/50, iter: 400/834, loss: 0.34281, top1: 0.56573, throughput: 1314.31 | 2022-04-10 23:39:51.225 [rank:3] [train], epoch: 17/50, iter: 400/834, loss: 0.33647, top1: 0.58130, throughput: 1314.16 | 2022-04-10 23:39:51.225 [rank:7] [train], epoch: 17/50, iter: 400/834, loss: 0.33981, top1: 0.57682, throughput: 1314.36 | 2022-04-10 23:39:51.225 [rank:2] [train], epoch: 17/50, iter: 400/834, loss: 0.34362, top1: 0.57318, throughput: 1314.33 | 2022-04-10 23:39:51.225 [rank:4] [train], epoch: 17/50, iter: 400/834, loss: 0.34165, top1: 0.57432, throughput: 1314.08 | 2022-04-10 23:39:51.226 [rank:0] [train], epoch: 17/50, iter: 400/834, loss: 0.34352, top1: 0.57172, throughput: 1314.08 | 2022-04-10 23:39:51.225 [rank:4] [train], epoch: 17/50, iter: 500/834, loss: 0.33960, top1: 0.57552, throughput: 1301.26 | 2022-04-10 23:40:05.981 [rank:5] [train], epoch: 17/50, iter: 500/834, loss: 0.34376, top1: 0.56818, throughput: 1300.93 | 2022-04-10 23:40:05.983 [rank:3] [train], epoch: 17/50, iter: 500/834, loss: 0.33954, top1: 0.57807, throughput: 1301.04 | 2022-04-10 23:40:05.983 [rank:7] [train], epoch: 17/50, iter: 500/834, loss: 0.34051, top1: 0.57151, throughput: 1301.11 | 2022-04-10 23:40:05.982 [rank:1] [train], epoch: 17/50, iter: 500/834, loss: 0.34057, top1: 0.57130, throughput: 1300.98 | 2022-04-10 23:40:05.983 [rank:6] [train], epoch: 17/50, iter: 500/834, loss: 0.34222, top1: 0.57458, throughput: 1300.87 | 2022-04-10 23:40:05.983 [rank:0] [train], epoch: 17/50, iter: 500/834, loss: 0.34007, top1: 0.57521, throughput: 1301.10 | 2022-04-10 23:40:05.982 [rank:2] [train], epoch: 17/50, iter: 500/834, loss: 0.33989, top1: 0.57839, throughput: 1301.08 | 2022-04-10 23:40:05.982 [rank:6] [train], epoch: 17/50, iter: 600/834, loss: 0.34218, top1: 0.56620, throughput: 1311.76 | 2022-04-10 23:40:20.620 [rank:2] [train], epoch: 17/50, iter: 600/834, loss: 0.33905, top1: 0.58146, throughput: 1311.49 | 2022-04-10 23:40:20.621 [rank:3] [train], epoch: 17/50, iter: 600/834, loss: 0.34004, top1: 0.57734, throughput: 1311.55 | 2022-04-10 23:40:20.622 [rank:4] [train], epoch: 17/50, iter: 600/834, loss: 0.34035, top1: 0.57609, throughput: 1311.32 | 2022-04-10 23:40:20.622 [rank:1] [train], epoch: 17/50, iter: 600/834, loss: 0.33927, top1: 0.57552, throughput: 1311.37 | 2022-04-10 23:40:20.624 [rank:0] [train], epoch: 17/50, iter: 600/834, loss: 0.34163, top1: 0.57260, throughput: 1311.51 | 2022-04-10 23:40:20.622 [rank:5] [train], epoch: 17/50, iter: 600/834, loss: 0.34197, top1: 0.56839, throughput: 1311.07 | 2022-04-10 23:40:20.627 [rank:7] [train], epoch: 17/50, iter: 600/834, loss: 0.33942, top1: 0.57260, throughput: 1310.84 | 2022-04-10 23:40:20.629 [rank:3] [train], epoch: 17/50, iter: 700/834, loss: 0.34274, top1: 0.57464, throughput: 1317.81 | 2022-04-10 23:40:35.191 [rank:5] [train], epoch: 17/50, iter: 700/834, loss: 0.34090, top1: 0.57349, throughput: 1318.50 | 2022-04-10 23:40:35.189 [rank:4] [train], epoch: 17/50, iter: 700/834, loss: 0.34086, top1: 0.57333, throughput: 1317.93 | 2022-04-10 23:40:35.191 [rank:1] [train], epoch: 17/50, iter: 700/834, loss: 0.34268, top1: 0.57260, throughput: 1318.11 [rank:6] [train], epoch: 17/50, iter: 700/834, loss: 0.33870, top1: 0.57646, throughput: 1317.62| 2022-04-10 23:40:35.190 | 2022-04-10 23:40:35.191 [rank:2] [train], epoch: 17/50, iter: 700/834, loss: 0.34118, top1: 0.57417, throughput: 1317.89 | 2022-04-10 23:40:35.190 [rank:7] [train], epoch: 17/50, iter: 700/834, loss: 0.34116, top1: 0.57411, throughput: 1318.53 | 2022-04-10 23:40:35.190 [rank:0] [train], epoch: 17/50, iter: 700/834, loss: 0.34234, top1: 0.57068, throughput: 1317.69 | 2022-04-10 23:40:35.193 [rank:2] [train], epoch: 17/50, iter: 800/834, loss: 0.34402, top1: 0.56953, throughput: 1311.00 [rank:7] [train], epoch: 17/50, iter: 800/834, loss: 0.33912, top1: 0.57792, throughput: 1311.20| 2022-04-10 23:40:49.835 | 2022-04-10 23:40:49.833 [rank:4] [train], epoch: 17/50, iter: 800/834, loss: 0.33618, top1: 0.58302, throughput: 1311.20[rank:5] [train], epoch: 17/50, iter: 800/834, loss: 0.34081, top1: 0.57135, throughput: 1311.02 | 2022-04-10 23:40:49.834 | 2022-04-10 23:40:49.834 [rank:3] [train], epoch: 17/50, iter: 800/834, loss: 0.34179, top1: 0.56984, throughput: 1311.16 | 2022-04-10 23:40:49.835 [rank:1] [train], epoch: 17/50, iter: 800/834, loss: 0.34134, top1: 0.57260, throughput: 1310.99 | 2022-04-10 23:40:49.836 [rank:6] [train], epoch: 17/50, iter: 800/834, loss: 0.33989, top1: 0.57594, throughput: 1311.09 | 2022-04-10 23:40:49.836 [rank:0] [train], epoch: 17/50, iter: 800/834, loss: 0.33848, top1: 0.58109, throughput: 1311.16 | 2022-04-10 23:40:49.836 [rank:5] [train], epoch: 17/50, iter: 834/834, loss: 0.34436, top1: 0.56679, throughput: 1309.59 | 2022-04-10 23:40:54.819 [rank:4] [train], epoch: 17/50, iter: 834/834, loss: 0.34815, top1: 0.55744, throughput: 1309.31 | 2022-04-10 23:40:54.820 [rank:6] [train], epoch: 17/50, iter: 834/834, loss: 0.34136, top1: 0.57521, throughput: 1309.75 | 2022-04-10 23:40:54.820 [rank:3] [train], epoch: 17/50, iter: 834/834, loss: 0.34069, top1: 0.57047, throughput: 1309.16 | 2022-04-10 23:40:54.821 [rank:0] [train], epoch: 17/50, iter: 834/834, loss: 0.34550, top1: 0.56434, throughput: 1309.05[rank:7] [train], epoch: 17/50, iter: 834/834, loss: 0.33882, top1: 0.57858, throughput: 1308.62 | 2022-04-10 23:40:54.822 | 2022-04-10 23:40:54.823 [rank:2] [train], epoch: 17/50, iter: 834/834, loss: 0.33726, top1: 0.57935, throughput: 1309.26 | 2022-04-10 23:40:54.821 [rank:1] [train], epoch: 17/50, iter: 834/834, loss: 0.34253, top1: 0.56756, throughput: 1308.84 | 2022-04-10 23:40:54.823 [rank:7] [eval], epoch: 17/50, iter: 125/125, loss: 0.00000, top1: 0.59056, throughput: 577.38 | 2022-04-10 23:41:05.646 [rank:0] [eval], epoch: 17/50, iter: 125/125, loss: 0.00000, top1: 0.59120, throughput: 576.90 | 2022-04-10 23:41:05.657 [rank:1] [eval], epoch: 17/50, iter: 125/125, loss: 0.00000, top1: 0.58704, throughput: 574.95 | 2022-04-10 23:41:05.694 [rank:2] [eval], epoch: 17/50, iter: 125/125, loss: 0.00000, top1: 0.57536, throughput: 571.34 | 2022-04-10 23:41:05.761 [rank:3] [eval], epoch: 17/50, iter: 125/125, loss: 0.00000, top1: 0.58448, throughput: 569.25 | 2022-04-10 23:41:05.801 [rank:6] [eval], epoch: 17/50, iter: 125/125, loss: 0.00000, top1: 0.58272, throughput: 568.82 | 2022-04-10 23:41:05.807 [rank:5] [eval], epoch: 17/50, iter: 125/125, loss: 0.00000, top1: 0.57696, throughput: 566.86 | 2022-04-10 23:41:05.845 [rank:4] [eval], epoch: 17/50, iter: 125/125, loss: 0.00000, top1: 0.58304, throughput: 561.17 | 2022-04-10 23:41:05.957 [rank:5] [train], epoch: 18/50, iter: 100/834, loss: 0.33482, top1: 0.58786, throughput: 1307.65 | 2022-04-10 23:41:20.528 [rank:4] [train], epoch: 18/50, iter: 100/834, loss: 0.33798, top1: 0.57776, throughput: 1317.67 | 2022-04-10 23:41:20.528 [rank:0] [train], epoch: 18/50, iter: 100/834, loss: 0.33342, top1: 0.58688, throughput: 1291.06 | 2022-04-10 23:41:20.528 [rank:6] [train], epoch: 18/50, iter: 100/834, loss: 0.33137, top1: 0.59583, throughput: 1304.33 | 2022-04-10 23:41:20.528 [rank:1] [train], epoch: 18/50, iter: 100/834, loss: 0.33343, top1: 0.59135, throughput: 1294.22 | 2022-04-10 23:41:20.529 [rank:3] [train], epoch: 18/50, iter: 100/834, loss: 0.33152, top1: 0.58974, throughput: 1303.56 | 2022-04-10 23:41:20.529 [rank:7] [train], epoch: 18/50, iter: 100/834, loss: 0.33337, top1: 0.58828, throughput: 1290.12 | 2022-04-10 23:41:20.529 [rank:2] [train], epoch: 18/50, iter: 100/834, loss: 0.33336, top1: 0.58839, throughput: 1299.93 | 2022-04-10 23:41:20.531 [rank:4] [train], epoch: 18/50, iter: 200/834, loss: 0.33635, top1: 0.58354, throughput: 1315.33 | 2022-04-10 23:41:35.125 [rank:5] [train], epoch: 18/50, iter: 200/834, loss: 0.33681, top1: 0.58099, throughput: 1315.30 | 2022-04-10 23:41:35.125 [rank:6] [train], epoch: 18/50, iter: 200/834, loss: 0.33400, top1: 0.58396, throughput: 1315.04 | 2022-04-10 23:41:35.128 [rank:0] [train], epoch: 18/50, iter: 200/834, loss: 0.33580, top1: 0.58286, throughput: 1315.24 | 2022-04-10 23:41:35.126 [rank:3] [train], epoch: 18/50, iter: 200/834, loss: 0.33408, top1: 0.58484, throughput: 1315.26 | 2022-04-10 23:41:35.127 [rank:1] [train], epoch: 18/50, iter: 200/834, loss: 0.33529, top1: 0.58276, throughput: 1315.20 | 2022-04-10 23:41:35.127 [rank:2] [train], epoch: 18/50, iter: 200/834, loss: 0.33620, top1: 0.58474, throughput: 1315.31 | 2022-04-10 23:41:35.128 [rank:7] [train], epoch: 18/50, iter: 200/834, loss: 0.33579, top1: 0.58177, throughput: 1314.94 | 2022-04-10 23:41:35.130 [rank:6] [train], epoch: 18/50, iter: 300/834, loss: 0.33697, top1: 0.57984, throughput: 1302.74 | 2022-04-10 23:41:49.866 [rank:4] [train], epoch: 18/50, iter: 300/834, loss: 0.33600, top1: 0.58396, throughput: 1302.50 | 2022-04-10 23:41:49.866 [rank:5] [train], epoch: 18/50, iter: 300/834, loss: 0.33299, top1: 0.59115, throughput: 1302.28 | 2022-04-10 23:41:49.868 [rank:2] [train], epoch: 18/50, iter: 300/834, loss: 0.33904, top1: 0.57807, throughput: 1302.69 | 2022-04-10 23:41:49.867 [rank:3] [train], epoch: 18/50, iter: 300/834, loss: 0.33641, top1: 0.58375, throughput: 1302.22 | 2022-04-10 23:41:49.871 [rank:0] [train], epoch: 18/50, iter: 300/834, loss: 0.33754, top1: 0.57724, throughput: 1302.32 | 2022-04-10 23:41:49.869 [rank:1] [train], epoch: 18/50, iter: 300/834, loss: 0.33814, top1: 0.57844, throughput: 1302.45 | 2022-04-10 23:41:49.869 [rank:7] [train], epoch: 18/50, iter: 300/834, loss: 0.33788, top1: 0.57687, throughput: 1302.62 | 2022-04-10 23:41:49.870 [rank:6] [train], epoch: 18/50, iter: 400/834, loss: 0.33771, top1: 0.57917, throughput: 1314.22 | 2022-04-10 23:42:04.476 [rank:5] [train], epoch: 18/50, iter: 400/834, loss: 0.33617, top1: 0.58620, throughput: 1314.45 | 2022-04-10 23:42:04.475 [rank:2] [train], epoch: 18/50, iter: 400/834, loss: 0.33884, top1: 0.58000, throughput: 1314.16 | 2022-04-10 23:42:04.477 [rank:7] [train], epoch: 18/50, iter: 400/834, loss: 0.33734, top1: 0.58073, throughput: 1314.43 | 2022-04-10 23:42:04.477 [rank:1] [train], epoch: 18/50, iter: 400/834, loss: 0.33901, top1: 0.58391, throughput: 1314.18 | 2022-04-10 23:42:04.479 [rank:4] [train], epoch: 18/50, iter: 400/834, loss: 0.33665, top1: 0.58234, throughput: 1313.99 | 2022-04-10 23:42:04.478 [rank:3] [train], epoch: 18/50, iter: 400/834, loss: 0.33619, top1: 0.58000, throughput: 1314.27 | 2022-04-10 23:42:04.480 [rank:0] [train], epoch: 18/50, iter: 400/834, loss: 0.33833, top1: 0.57802, throughput: 1314.17 | 2022-04-10 23:42:04.479 [rank:5] [train], epoch: 18/50, iter: 500/834, loss: 0.33708, top1: 0.58297, throughput: 1314.80 | 2022-04-10 23:42:19.078 [rank:4] [train], epoch: 18/50, iter: 500/834, loss: 0.33934, top1: 0.57526, throughput: 1315.06 | 2022-04-10 23:42:19.078 [rank:6] [train], epoch: 18/50, iter: 500/834, loss: 0.33740, top1: 0.58203, throughput: 1314.87 | 2022-04-10 23:42:19.078 [rank:1] [train], epoch: 18/50, iter: 500/834, loss: 0.33785, top1: 0.58036, throughput: 1315.01 | 2022-04-10 23:42:19.079 [rank:2] [train], epoch: 18/50, iter: 500/834, loss: 0.33711, top1: 0.58130, throughput: 1314.85 | 2022-04-10 23:42:19.079 [rank:3] [train], epoch: 18/50, iter: 500/834, loss: 0.33651, top1: 0.57714, throughput: 1314.96 | 2022-04-10 23:42:19.082 [rank:0] [train], epoch: 18/50, iter: 500/834, loss: 0.33983, top1: 0.57776, throughput: 1314.91 | 2022-04-10 23:42:19.081 [rank:7] [train], epoch: 18/50, iter: 500/834, loss: 0.33947, top1: 0.57698, throughput: 1314.70 | 2022-04-10 23:42:19.081 [rank:4] [train], epoch: 18/50, iter: 600/834, loss: 0.33971, top1: 0.57958, throughput: 1313.54 | 2022-04-10 23:42:33.695 [rank:6] [train], epoch: 18/50, iter: 600/834, loss: 0.34070, top1: 0.57448, throughput: 1313.56 | 2022-04-10 23:42:33.695 [rank:2] [train], epoch: 18/50, iter: 600/834, loss: 0.33768, top1: 0.57917, throughput: 1313.40 | 2022-04-10 23:42:33.698 [rank:5] [train], epoch: 18/50, iter: 600/834, loss: 0.33734, top1: 0.58312, throughput: 1313.37 | 2022-04-10 23:42:33.697 [rank:3] [train], epoch: 18/50, iter: 600/834, loss: 0.33743, top1: 0.58193, throughput: 1313.58 | 2022-04-10 23:42:33.698 [rank:7] [train], epoch: 18/50, iter: 600/834, loss: 0.34087, top1: 0.57589, throughput: 1313.72 | 2022-04-10 23:42:33.696 [rank:1] [train], epoch: 18/50, iter: 600/834, loss: 0.33806, top1: 0.57958, throughput: 1313.46 | 2022-04-10 23:42:33.697 [rank:0] [train], epoch: 18/50, iter: 600/834, loss: 0.33681, top1: 0.58443, throughput: 1313.49 | 2022-04-10 23:42:33.699 [rank:5] [train], epoch: 18/50, iter: 700/834, loss: 0.34122, top1: 0.57583, throughput: 1316.68 | 2022-04-10 23:42:48.279 [rank:7] [train], epoch: 18/50, iter: 700/834, loss: 0.33842, top1: 0.57880, throughput: 1316.50 | 2022-04-10 23:42:48.280 [rank:6] [train], epoch: 18/50, iter: 700/834, loss: 0.33989, top1: 0.57224, throughput: 1316.38 | 2022-04-10 23:42:48.280 [rank:1] [train], epoch: 18/50, iter: 700/834, loss: 0.33848, top1: 0.57828, throughput: 1316.57 | 2022-04-10 23:42:48.281 [rank:3] [train], epoch: 18/50, iter: 700/834, loss: 0.33820, top1: 0.57547, throughput: 1316.55 | 2022-04-10 23:42:48.282 [rank:4] [train], epoch: 18/50, iter: 700/834, loss: 0.33803, top1: 0.57828, throughput: 1316.30 | 2022-04-10 23:42:48.281 [rank:0] [train], epoch: 18/50, iter: 700/834, loss: 0.33885, top1: 0.57656, throughput: 1316.55 | 2022-04-10 23:42:48.282 [rank:2] [train], epoch: 18/50, iter: 700/834, loss: 0.33843, top1: 0.58141, throughput: 1316.48 | 2022-04-10 23:42:48.282 [rank:2] [train], epoch: 18/50, iter: 800/834, loss: 0.33981, top1: 0.57281, throughput: 1318.20 | 2022-04-10 23:43:02.848 [rank:5] [train], epoch: 18/50, iter: 800/834, loss: 0.34017, top1: 0.57714, throughput: 1317.99 | 2022-04-10 23:43:02.847 [rank:6] [train], epoch: 18/50, iter: 800/834, loss: 0.33596, top1: 0.58255, throughput: 1318.05 | 2022-04-10 23:43:02.847 [rank:3] [train], epoch: 18/50, iter: 800/834, loss: 0.33552, top1: 0.57974, throughput: 1317.88 | 2022-04-10 23:43:02.850 [rank:1] [train], epoch: 18/50, iter: 800/834, loss: 0.33790, top1: 0.57776, throughput: 1317.81 | 2022-04-10 23:43:02.850 [rank:0] [train], epoch: 18/50, iter: 800/834, loss: 0.33648, top1: 0.58172, throughput: 1318.09 | 2022-04-10 23:43:02.849 [rank:7] [train], epoch: 18/50, iter: 800/834, loss: 0.33500, top1: 0.58484, throughput: 1317.79 | 2022-04-10 23:43:02.850 [rank:4] [train], epoch: 18/50, iter: 800/834, loss: 0.33556, top1: 0.58401, throughput: 1318.00 | 2022-04-10 23:43:02.849 [rank:5] [train], epoch: 18/50, iter: 834/834, loss: 0.33828, top1: 0.57384, throughput: 1289.20 | 2022-04-10 23:43:07.910 [rank:7] [train], epoch: 18/50, iter: 834/834, loss: 0.34110, top1: 0.57261, throughput: 1289.92 | 2022-04-10 23:43:07.911 [rank:6] [train], epoch: 18/50, iter: 834/834, loss: 0.34141, top1: 0.57552, throughput: 1289.24 | 2022-04-10 23:43:07.910 [rank:0] [train], epoch: 18/50, iter: 834/834, loss: 0.33766, top1: 0.58961, throughput: 1289.34 | 2022-04-10 23:43:07.912 [rank:2] [train], epoch: 18/50, iter: 834/834, loss: 0.33861, top1: 0.58150, throughput: 1288.90 | 2022-04-10 23:43:07.912 [rank:4] [train], epoch: 18/50, iter: 834/834, loss: 0.34199, top1: 0.57935, throughput: 1289.34 | 2022-04-10 23:43:07.912 [rank:1] [train], epoch: 18/50, iter: 834/834, loss: 0.33361, top1: 0.58732, throughput: 1289.44 | 2022-04-10 23:43:07.913 [rank:3] [train], epoch: 18/50, iter: 834/834, loss: 0.33526, top1: 0.58609, throughput: 1289.40 | 2022-04-10 23:43:07.913 [rank:7] [eval], epoch: 18/50, iter: 125/125, loss: 0.00000, top1: 0.60048, throughput: 584.82 | 2022-04-10 23:43:18.598 [rank:0] [eval], epoch: 18/50, iter: 125/125, loss: 0.00000, top1: 0.59552, throughput: 581.30 | 2022-04-10 23:43:18.663 [rank:2] [eval], epoch: 18/50, iter: 125/125, loss: 0.00000, top1: 0.58656, throughput: 579.63 | 2022-04-10 23:43:18.695 [rank:4] [eval], epoch: 18/50, iter: 125/125, loss: 0.00000, top1: 0.59568, throughput: 577.41 | 2022-04-10 23:43:18.736 [rank:6] [eval], epoch: 18/50, iter: 125/125, loss: 0.00000, top1: 0.58816, throughput: 577.04 | 2022-04-10 23:43:18.741 [rank:3] [eval], epoch: 18/50, iter: 125/125, loss: 0.00000, top1: 0.58368, throughput: 576.73 | 2022-04-10 23:43:18.750 [rank:5] [eval], epoch: 18/50, iter: 125/125, loss: 0.00000, top1: 0.58256, throughput: 566.20 | 2022-04-10 23:43:18.949 [rank:1] [eval], epoch: 18/50, iter: 125/125, loss: 0.00000, top1: 0.59248, throughput: 564.38 | 2022-04-10 23:43:18.987 [rank:5] [train], epoch: 19/50, iter: 100/834, loss: 0.33141, top1: 0.59146, throughput: 1316.24 | 2022-04-10 23:43:33.536 [rank:1] [train], epoch: 19/50, iter: 100/834, loss: 0.32768, top1: 0.60078, throughput: 1319.61 | 2022-04-10 23:43:33.537 [rank:4] [train], epoch: 19/50, iter: 100/834, loss: 0.33155, top1: 0.59286, throughput: 1297.28 | 2022-04-10 23:43:33.536 [rank:2] [train], epoch: 19/50, iter: 100/834, loss: 0.33208, top1: 0.58865, throughput: 1293.70 | 2022-04-10 23:43:33.536 [rank:6] [train], epoch: 19/50, iter: 100/834, loss: 0.33136, top1: 0.59526, throughput: 1297.67 | 2022-04-10 23:43:33.537 [rank:0] [train], epoch: 19/50, iter: 100/834, loss: 0.32958, top1: 0.59839, throughput: 1290.83 | 2022-04-10 23:43:33.538 [rank:3] [train], epoch: 19/50, iter: 100/834, loss: 0.33515, top1: 0.58474, throughput: 1298.10 | 2022-04-10 23:43:33.541 [rank:7] [train], epoch: 19/50, iter: 100/834, loss: 0.33286, top1: 0.58771, throughput: 1284.82 | 2022-04-10 23:43:33.542 [rank:5] [train], epoch: 19/50, iter: 200/834, loss: 0.33198, top1: 0.59245, throughput: 1318.89 | 2022-04-10 23:43:48.094 [rank:6] [train], epoch: 19/50, iter: 200/834, loss: 0.33460, top1: 0.58401, throughput: 1318.84 | 2022-04-10 23:43:48.096 [rank:2] [train], epoch: 19/50, iter: 200/834, loss: 0.33450, top1: 0.58568, throughput: 1318.71 | 2022-04-10 23:43:48.096 [rank:3] [train], epoch: 19/50, iter: 200/834, loss: 0.33000, top1: 0.59344, throughput: 1319.06 | 2022-04-10 23:43:48.097 [rank:4] [train], epoch: 19/50, iter: 200/834, loss: 0.33616, top1: 0.58490, throughput: 1318.82 | 2022-04-10 23:43:48.095 [rank:7] [train], epoch: 19/50, iter: 200/834, loss: 0.33026, top1: 0.59365, throughput: 1319.19 | 2022-04-10 23:43:48.096 [rank:1] [train], epoch: 19/50, iter: 200/834, loss: 0.33178, top1: 0.59297, throughput: 1318.77 | 2022-04-10 23:43:48.096 [rank:0] [train], epoch: 19/50, iter: 200/834, loss: 0.32917, top1: 0.59464, throughput: 1318.77 | 2022-04-10 23:43:48.097 [rank:5] [train], epoch: 19/50, iter: 300/834, loss: 0.33532, top1: 0.58302, throughput: 1316.37 | 2022-04-10 23:44:02.679 [rank:4] [train], epoch: 19/50, iter: 300/834, loss: 0.33223, top1: 0.58781, throughput: 1316.46 | 2022-04-10 23:44:02.679 [rank:7] [train], epoch: 19/50, iter: 300/834, loss: 0.33105, top1: 0.59339, throughput: 1316.50 | 2022-04-10 23:44:02.680 [rank:3] [train], epoch: 19/50, iter: 300/834, loss: 0.33037, top1: 0.59510, throughput: 1316.42 | 2022-04-10 23:44:02.682 [rank:1] [train], epoch: 19/50, iter: 300/834, loss: 0.33381, top1: 0.59177, throughput: 1316.22 | 2022-04-10 23:44:02.683 [rank:0] [train], epoch: 19/50, iter: 300/834, loss: 0.33162, top1: 0.59047, throughput: 1316.47 | 2022-04-10 23:44:02.681 [rank:2] [train], epoch: 19/50, iter: 300/834, loss: 0.33559, top1: 0.58594, throughput: 1316.32 | 2022-04-10 23:44:02.682 [rank:6] [train], epoch: 19/50, iter: 300/834, loss: 0.33238, top1: 0.59141, throughput: 1316.33 | 2022-04-10 23:44:02.681 [rank:6] [train], epoch: 19/50, iter: 400/834, loss: 0.33035, top1: 0.59479, throughput: 1314.06 | 2022-04-10 23:44:17.293 [rank:0] [train], epoch: 19/50, iter: 400/834, loss: 0.33607, top1: 0.58276, throughput: 1313.97 | 2022-04-10 23:44:17.293 [rank:2] [train], epoch: 19/50, iter: 400/834, loss: 0.33450, top1: 0.58422, throughput: 1314.05 | 2022-04-10 23:44:17.293 [rank:4] [train], epoch: 19/50, iter: 400/834, loss: 0.33421, top1: 0.58724, throughput: 1313.84 | 2022-04-10 23:44:17.293 [rank:5] [train], epoch: 19/50, iter: 400/834, loss: 0.33381, top1: 0.58401, throughput: 1313.67 | 2022-04-10 23:44:17.295 [rank:1] [train], epoch: 19/50, iter: 400/834, loss: 0.33870, top1: 0.57552, throughput: 1313.86 | 2022-04-10 23:44:17.296 [rank:3] [train], epoch: 19/50, iter: 400/834, loss: 0.33211, top1: 0.59490, throughput: 1313.86 | 2022-04-10 23:44:17.295 [rank:7] [train], epoch: 19/50, iter: 400/834, loss: 0.33283, top1: 0.58849, throughput: 1313.81 | 2022-04-10 23:44:17.294 [rank:6] [train], epoch: 19/50, iter: 500/834, loss: 0.33596, top1: 0.58365, throughput: 1316.60 | 2022-04-10 23:44:31.876 [rank:5] [train], epoch: 19/50, iter: 500/834, loss: 0.33504, top1: 0.58995, throughput: 1316.46 | 2022-04-10 23:44:31.879 [rank:2] [train], epoch: 19/50, iter: 500/834, loss: 0.33218, top1: 0.58682, throughput: 1316.64 | 2022-04-10 23:44:31.876 [rank:1] [train], epoch: 19/50, iter: 500/834, loss: 0.33588, top1: 0.57969, throughput: 1316.75 | 2022-04-10 23:44:31.878 [rank:3] [train], epoch: 19/50, iter: 500/834, loss: 0.33806, top1: 0.58245, throughput: 1316.63 | 2022-04-10 23:44:31.878 [rank:0] [train], epoch: 19/50, iter: 500/834, loss: 0.33333, top1: 0.58297, throughput: 1316.37 | 2022-04-10 23:44:31.879 [rank:4] [train], epoch: 19/50, iter: 500/834, loss: 0.33339, top1: 0.59255, throughput: 1316.45 | 2022-04-10 23:44:31.878 [rank:7] [train], epoch: 19/50, iter: 500/834, loss: 0.33735, top1: 0.58156, throughput: 1316.30 | 2022-04-10 23:44:31.880 [rank:2] [train], epoch: 19/50, iter: 600/834, loss: 0.33420, top1: 0.58792, throughput: 1315.19 | 2022-04-10 23:44:46.475 [rank:6] [train], epoch: 19/50, iter: 600/834, loss: 0.33446, top1: 0.58714, throughput: 1315.21 | 2022-04-10 23:44:46.474 [rank:5] [train], epoch: 19/50, iter: 600/834, loss: 0.33760, top1: 0.58182, throughput: 1315.59 | 2022-04-10 23:44:46.473 [rank:4] [train], epoch: 19/50, iter: 600/834, loss: 0.33375, top1: 0.59062, throughput: 1315.40 | 2022-04-10 23:44:46.474 [rank:3] [train], epoch: 19/50, iter: 600/834, loss: 0.33610, top1: 0.58266, throughput: 1315.11 | 2022-04-10 23:44:46.477 [rank:1] [train], epoch: 19/50, iter: 600/834, loss: 0.33556, top1: 0.58016, throughput: 1315.15 | 2022-04-10 23:44:46.477 [rank:7] [train], epoch: 19/50, iter: 600/834, loss: 0.33421, top1: 0.58521, throughput: 1315.30 | 2022-04-10 23:44:46.478 [rank:0] [train], epoch: 19/50, iter: 600/834, loss: 0.33675, top1: 0.58104, throughput: 1315.13 | 2022-04-10 23:44:46.478 [rank:1] [train], epoch: 19/50, iter: 700/834, loss: 0.33451, top1: 0.58318, throughput: 1317.57 | 2022-04-10 23:45:01.049 [rank:5] [train], epoch: 19/50, iter: 700/834, loss: 0.33625, top1: 0.58323, throughput: 1317.36 | 2022-04-10 23:45:01.048 [rank:6] [train], epoch: 19/50, iter: 700/834, loss: 0.33472, top1: 0.58891, throughput: 1317.47 | 2022-04-10 23:45:01.048 [rank:4] [train], epoch: 19/50, iter: 700/834, loss: 0.33527, top1: 0.58760, throughput: 1317.51 | 2022-04-10 23:45:01.047 [rank:2] [train], epoch: 19/50, iter: 700/834, loss: 0.33391, top1: 0.58443, throughput: 1317.50 | 2022-04-10 23:45:01.048 [rank:0] [train], epoch: 19/50, iter: 700/834, loss: 0.33508, top1: 0.58208, throughput: 1317.67 | 2022-04-10 23:45:01.049 [rank:3] [train], epoch: 19/50, iter: 700/834, loss: 0.33421, top1: 0.58604, throughput: 1317.34 | 2022-04-10 23:45:01.052 [rank:7] [train], epoch: 19/50, iter: 700/834, loss: 0.33662, top1: 0.58229, throughput: 1317.77 | 2022-04-10 23:45:01.048 [rank:2] [train], epoch: 19/50, iter: 800/834, loss: 0.33417, top1: 0.58797, throughput: 1314.40 | 2022-04-10 23:45:15.655 [rank:4] [train], epoch: 19/50, iter: 800/834, loss: 0.33403, top1: 0.58807, throughput: 1314.44 | 2022-04-10 23:45:15.654 [rank:5] [train], epoch: 19/50, iter: 800/834, loss: 0.33175, top1: 0.59562, throughput: 1314.45 | 2022-04-10 23:45:15.655 [rank:3] [train], epoch: 19/50, iter: 800/834, loss: 0.33577, top1: 0.58307, throughput: 1314.70 | 2022-04-10 23:45:15.656 [rank:6] [train], epoch: 19/50, iter: 800/834, loss: 0.33271, top1: 0.59349, throughput: 1314.45 | 2022-04-10 23:45:15.654 [rank:7] [train], epoch: 19/50, iter: 800/834, loss: 0.33494, top1: 0.58583, throughput: 1314.21 | 2022-04-10 23:45:15.657 [rank:0] [train], epoch: 19/50, iter: 800/834, loss: 0.33846, top1: 0.57667, throughput: 1314.27 | 2022-04-10 23:45:15.658 [rank:1] [train], epoch: 19/50, iter: 800/834, loss: 0.33474, top1: 0.58130, throughput: 1314.20 | 2022-04-10 23:45:15.659 [rank:2] [train], epoch: 19/50, iter: 834/834, loss: 0.33247, top1: 0.59023, throughput: 1313.06 | 2022-04-10 23:45:20.627 [rank:5] [train], epoch: 19/50, iter: 834/834, loss: 0.33611, top1: 0.58793, throughput: 1312.43 | 2022-04-10 23:45:20.629 [rank:4] [train], epoch: 19/50, iter: 834/834, loss: 0.33348, top1: 0.58563, throughput: 1312.61 | 2022-04-10 23:45:20.627 [rank:1] [train], epoch: 19/50, iter: 834/834, loss: 0.33227, top1: 0.58349, throughput: 1313.67 | 2022-04-10 23:45:20.628 [rank:6] [train], epoch: 19/50, iter: 834/834, loss: 0.33699, top1: 0.57613, throughput: 1312.36 | 2022-04-10 23:45:20.629 [rank:7] [train], epoch: 19/50, iter: 834/834, loss: 0.33351, top1: 0.58931, throughput: 1312.97 | 2022-04-10 23:45:20.629 [rank:0] [train], epoch: 19/50, iter: 834/834, loss: 0.33435, top1: 0.58487, throughput: 1312.38 | 2022-04-10 23:45:20.632 [rank:3] [train], epoch: 19/50, iter: 834/834, loss: 0.34013, top1: 0.56694, throughput: 1311.70 | 2022-04-10 23:45:20.633 [rank:7] [eval], epoch: 19/50, iter: 125/125, loss: 0.00000, top1: 0.59312, throughput: 597.88 | 2022-04-10 23:45:31.083 [rank:6] [eval], epoch: 19/50, iter: 125/125, loss: 0.00000, top1: 0.60000, throughput: 590.30 | 2022-04-10 23:45:31.216 [rank:2] [eval], epoch: 19/50, iter: 125/125, loss: 0.00000, top1: 0.59232, throughput: 590.01 | 2022-04-10 23:45:31.220 [rank:4] [eval], epoch: 19/50, iter: 125/125, loss: 0.00000, top1: 0.58576, throughput: 588.90 | 2022-04-10 23:45:31.240 [rank:3] [eval], epoch: 19/50, iter: 125/125, loss: 0.00000, top1: 0.59344, throughput: 588.40 | 2022-04-10 23:45:31.255 [rank:0] [eval], epoch: 19/50, iter: 125/125, loss: 0.00000, top1: 0.60048, throughput: 582.97 | 2022-04-10 23:45:31.353 [rank:1] [eval], epoch: 19/50, iter: 125/125, loss: 0.00000, top1: 0.59760, throughput: 581.01 | 2022-04-10 23:45:31.385 [rank:5] [eval], epoch: 19/50, iter: 125/125, loss: 0.00000, top1: 0.57776, throughput: 578.77 | 2022-04-10 23:45:31.428 [rank:6] [train], epoch: 20/50, iter: 100/834, loss: 0.32850, top1: 0.59750, throughput: 1292.77 | 2022-04-10 23:45:46.068 [rank:4] [train], epoch: 20/50, iter: 100/834, loss: 0.33107, top1: 0.59323, throughput: 1294.84 | 2022-04-10 23:45:46.068 [rank:5] [train], epoch: 20/50, iter: 100/834, loss: 0.33125, top1: 0.59240, throughput: 1311.31 | 2022-04-10 23:45:46.069 [rank:3] [train], epoch: 20/50, iter: 100/834, loss: 0.32943, top1: 0.59375, throughput: 1295.86 | 2022-04-10 23:45:46.072 [rank:1] [train], epoch: 20/50, iter: 100/834, loss: 0.33087, top1: 0.59547, throughput: 1307.38 | 2022-04-10 23:45:46.071 [rank:0] [train], epoch: 20/50, iter: 100/834, loss: 0.32831, top1: 0.59818, throughput: 1304.64 | 2022-04-10 23:45:46.070 [rank:7] [train], epoch: 20/50, iter: 100/834, loss: 0.32803, top1: 0.59703, throughput: 1281.01 | 2022-04-10 23:45:46.071 [rank:2] [train], epoch: 20/50, iter: 100/834, loss: 0.32704, top1: 0.59599, throughput: 1292.79 | 2022-04-10 23:45:46.071 [rank:7] [train], epoch: 20/50, iter: 200/834, loss: 0.32780, top1: 0.59979, throughput: 1312.61 | 2022-04-10 23:46:00.699 [rank:5] [train], epoch: 20/50, iter: 200/834, loss: 0.32875, top1: 0.59359, throughput: 1312.54 | 2022-04-10 23:46:00.697 [rank:2] [train], epoch: 20/50, iter: 200/834, loss: 0.32619, top1: 0.59943, throughput: 1312.66 | 2022-04-10 23:46:00.698 [rank:4] [train], epoch: 20/50, iter: 200/834, loss: 0.32766, top1: 0.59953, throughput: 1312.44 | 2022-04-10 23:46:00.698 [rank:6] [train], epoch: 20/50, iter: 200/834, loss: 0.33220, top1: 0.58807, throughput: 1312.25 | 2022-04-10 23:46:00.700 [rank:1] [train], epoch: 20/50, iter: 200/834, loss: 0.32834, top1: 0.59917, throughput: 1312.36 | 2022-04-10 23:46:00.701 [rank:3] [train], epoch: 20/50, iter: 200/834, loss: 0.33038, top1: 0.59260, throughput: 1312.03 | 2022-04-10 23:46:00.705 [rank:0] [train], epoch: 20/50, iter: 200/834, loss: 0.33009, top1: 0.59521, throughput: 1311.95 | 2022-04-10 23:46:00.705 [rank:6] [train], epoch: 20/50, iter: 300/834, loss: 0.33146, top1: 0.58891, throughput: 1315.67 | 2022-04-10 23:46:15.293 [rank:4] [train], epoch: 20/50, iter: 300/834, loss: 0.33022, top1: 0.59286, throughput: 1315.39 | 2022-04-10 23:46:15.294 [rank:3] [train], epoch: 20/50, iter: 300/834, loss: 0.33080, top1: 0.59745, throughput: 1315.97 | 2022-04-10 23:46:15.295 [rank:2] [train], epoch: 20/50, iter: 300/834, loss: 0.33098, top1: 0.59297, throughput: 1315.44 | 2022-04-10 23:46:15.294 [rank:7] [train], epoch: 20/50, iter: 300/834, loss: 0.33298, top1: 0.58990, throughput: 1315.48 | 2022-04-10 23:46:15.294 [rank:5] [train], epoch: 20/50, iter: 300/834, loss: 0.33301, top1: 0.58875, throughput: 1315.33 | 2022-04-10 23:46:15.294 [rank:1] [train], epoch: 20/50, iter: 300/834, loss: 0.32895, top1: 0.59703, throughput: 1315.39 | 2022-04-10 23:46:15.298 [rank:0] [train], epoch: 20/50, iter: 300/834, loss: 0.33110, top1: 0.59385, throughput: 1315.72 | 2022-04-10 23:46:15.297 [rank:6] [train], epoch: 20/50, iter: 400/834, loss: 0.32935, top1: 0.59458, throughput: 1316.02 | 2022-04-10 23:46:29.882 [rank:3] [train], epoch: 20/50, iter: 400/834, loss: 0.33057, top1: 0.59297, throughput: 1316.07 | 2022-04-10 23:46:29.884 [rank:4] [train], epoch: 20/50, iter: 400/834, loss: 0.33012, top1: 0.59630, throughput: 1316.05 | 2022-04-10 23:46:29.883 [rank:5] [train], epoch: 20/50, iter: 400/834, loss: 0.33222, top1: 0.58896, throughput: 1316.05 | 2022-04-10 23:46:29.884 [rank:0] [train], epoch: 20/50, iter: 400/834, loss: 0.33412, top1: 0.58641, throughput: 1316.29 | 2022-04-10 23:46:29.884 [rank:2] [train], epoch: 20/50, iter: 400/834, loss: 0.33494, top1: 0.58776, throughput: 1315.87 | 2022-04-10 23:46:29.885 [rank:7] [train], epoch: 20/50, iter: 400/834, loss: 0.33160, top1: 0.58760, throughput: 1315.93 | 2022-04-10 23:46:29.884 [rank:1] [train], epoch: 20/50, iter: 400/834, loss: 0.32966, top1: 0.59500, throughput: 1316.32 | 2022-04-10 23:46:29.884 [rank:6] [train], epoch: 20/50, iter: 500/834, loss: 0.33060, top1: 0.59344, throughput: 1309.84 | 2022-04-10 23:46:44.541 [rank:4] [train], epoch: 20/50, iter: 500/834, loss: 0.32902, top1: 0.59667, throughput: 1309.88 | 2022-04-10 23:46:44.541 [rank:2] [train], epoch: 20/50, iter: 500/834, loss: 0.33152, top1: 0.59401, throughput: 1309.98 | 2022-04-10 23:46:44.542 [rank:3] [train], epoch: 20/50, iter: 500/834, loss: 0.33297, top1: 0.58948, throughput: 1309.73 | 2022-04-10 23:46:44.544 [rank:5] [train], epoch: 20/50, iter: 500/834, loss: 0.33274, top1: 0.58740, throughput: 1309.76 | 2022-04-10 23:46:44.543 [rank:0] [train], epoch: 20/50, iter: 500/834, loss: 0.33357, top1: 0.58833, throughput: 1309.87 | 2022-04-10 23:46:44.542 [rank:1] [train], epoch: 20/50, iter: 500/834, loss: 0.32907, top1: 0.58995, throughput: 1309.65[rank:7] [train], epoch: 20/50, iter: 500/834, loss: 0.32964, top1: 0.59354, throughput: 1309.80 | 2022-04-10 23:46:44.543 | 2022-04-10 23:46:44.544 [rank:2] [train], epoch: 20/50, iter: 600/834, loss: 0.33020, top1: 0.59052, throughput: 1311.65 | 2022-04-10 23:46:59.180 [rank:5] [train], epoch: 20/50, iter: 600/834, loss: 0.33070, top1: 0.59130, throughput: 1311.82 | 2022-04-10 23:46:59.179 [rank:4] [train], epoch: 20/50, iter: 600/834, loss: 0.32950, top1: 0.59490, throughput: 1311.63 | 2022-04-10 23:46:59.179 [rank:1] [train], epoch: 20/50, iter: 600/834, loss: 0.33201, top1: 0.59214, throughput: 1311.85 | 2022-04-10 23:46:59.180 [rank:0] [train], epoch: 20/50, iter: 600/834, loss: 0.33511, top1: 0.58260, throughput: 1311.59 | 2022-04-10 23:46:59.180 [rank:6] [train], epoch: 20/50, iter: 600/834, loss: 0.33012, top1: 0.59432, throughput: 1311.57 | 2022-04-10 23:46:59.180 [rank:7] [train], epoch: 20/50, iter: 600/834, loss: 0.33065, top1: 0.59370, throughput: 1311.61 | 2022-04-10 23:46:59.182 [rank:3] [train], epoch: 20/50, iter: 600/834, loss: 0.32975, top1: 0.59542, throughput: 1311.41 | 2022-04-10 23:46:59.184 [rank:4] [train], epoch: 20/50, iter: 700/834, loss: 0.33124, top1: 0.58964, throughput: 1316.04 | 2022-04-10 23:47:13.768 [rank:6] [train], epoch: 20/50, iter: 700/834, loss: 0.33666, top1: 0.57917, throughput: 1315.92 | 2022-04-10 23:47:13.770 [rank:1] [train], epoch: 20/50, iter: 700/834, loss: 0.33305, top1: 0.58661, throughput: 1315.99 | 2022-04-10 23:47:13.770 [rank:5] [train], epoch: 20/50, iter: 700/834, loss: 0.33078, top1: 0.59458, throughput: 1315.89 | 2022-04-10 23:47:13.770 [rank:2] [train], epoch: 20/50, iter: 700/834, loss: 0.33150, top1: 0.59453, throughput: 1315.88 | 2022-04-10 23:47:13.771 [rank:7] [train], epoch: 20/50, iter: 700/834, loss: 0.32944, top1: 0.59443, throughput: 1315.96 | 2022-04-10 23:47:13.772 [rank:3] [train], epoch: 20/50, iter: 700/834, loss: 0.33058, top1: 0.59380, throughput: 1316.10 | 2022-04-10 23:47:13.773 [rank:0] [train], epoch: 20/50, iter: 700/834, loss: 0.33214, top1: 0.59005, throughput: 1315.81 | 2022-04-10 23:47:13.772 [rank:6] [train], epoch: 20/50, iter: 800/834, loss: 0.33241, top1: 0.58865, throughput: 1318.17 | 2022-04-10 23:47:28.336 [rank:3] [train], epoch: 20/50, iter: 800/834, loss: 0.33271, top1: 0.58766, throughput: 1318.11 | 2022-04-10 23:47:28.339 [rank:5] [train], epoch: 20/50, iter: 800/834, loss: 0.33299, top1: 0.58854, throughput: 1318.11 | 2022-04-10 23:47:28.336 [rank:2] [train], epoch: 20/50, iter: 800/834, loss: 0.33092, top1: 0.59016, throughput: 1317.96[rank:4] [train], epoch: 20/50, iter: 800/834, loss: 0.33233, top1: 0.58927, throughput: 1317.93 | 2022-04-10 23:47:28.337 | 2022-04-10 23:47:28.339 [rank:1] [train], epoch: 20/50, iter: 800/834, loss: 0.33366, top1: 0.58682, throughput: 1317.96 | 2022-04-10 23:47:28.338 [rank:0] [train], epoch: 20/50, iter: 800/834, loss: 0.33205, top1: 0.59219, throughput: 1318.22 | 2022-04-10 23:47:28.337 [rank:7] [train], epoch: 20/50, iter: 800/834, loss: 0.33539, top1: 0.58641, throughput: 1318.14 | 2022-04-10 23:47:28.338 [rank:4] [train], epoch: 20/50, iter: 834/834, loss: 0.33113, top1: 0.59467, throughput: 1307.39 | 2022-04-10 23:47:33.330 [rank:5] [train], epoch: 20/50, iter: 834/834, loss: 0.33147, top1: 0.58992, throughput: 1307.23 | 2022-04-10 23:47:33.330 [rank:7] [train], epoch: 20/50, iter: 834/834, loss: 0.33340, top1: 0.58624, throughput: 1307.39 | 2022-04-10 23:47:33.331 [rank:0] [train], epoch: 20/50, iter: 834/834, loss: 0.33549, top1: 0.57981, throughput: 1307.20 | 2022-04-10 23:47:33.331 [rank:3] [train], epoch: 20/50, iter: 834/834, loss: 0.33580, top1: 0.58272, throughput: 1307.48 | 2022-04-10 23:47:33.332 [rank:2] [train], epoch: 20/50, iter: 834/834, loss: 0.33497, top1: 0.58670, throughput: 1307.12 | 2022-04-10 23:47:33.333 [rank:1] [train], epoch: 20/50, iter: 834/834, loss: 0.32931, top1: 0.59268, throughput: 1306.92 | 2022-04-10 23:47:33.333 [rank:6] [train], epoch: 20/50, iter: 834/834, loss: 0.32940, top1: 0.59498, throughput: 1306.13 | 2022-04-10 23:47:33.334 [rank:7] [eval], epoch: 20/50, iter: 125/125, loss: 0.00000, top1: 0.59168, throughput: 587.93 | 2022-04-10 23:47:43.961 [rank:0] [eval], epoch: 20/50, iter: 125/125, loss: 0.00000, top1: 0.60048, throughput: 587.53 | 2022-04-10 23:47:43.969 [rank:6] [eval], epoch: 20/50, iter: 125/125, loss: 0.00000, top1: 0.59648, throughput: 584.30 | 2022-04-10 23:47:44.030 [rank:2] [eval], epoch: 20/50, iter: 125/125, loss: 0.00000, top1: 0.58896, throughput: 583.34 | 2022-04-10 23:47:44.047 [rank:3] [eval], epoch: 20/50, iter: 125/125, loss: 0.00000, top1: 0.59152, throughput: 579.24 | 2022-04-10 23:47:44.122 [rank:4] [eval], epoch: 20/50, iter: 125/125, loss: 0.00000, top1: 0.59376, throughput: 578.97 | 2022-04-10 23:47:44.125 [rank:5] [eval], epoch: 20/50, iter: 125/125, loss: 0.00000, top1: 0.58336, throughput: 577.40 | 2022-04-10 23:47:44.154 [rank:1] [eval], epoch: 20/50, iter: 125/125, loss: 0.00000, top1: 0.60224, throughput: 575.20 | 2022-04-10 23:47:44.198 [rank:4] [train], epoch: 21/50, iter: 100/834, loss: 0.32581, top1: 0.60099, throughput: 1308.07 | 2022-04-10 23:47:58.803 [rank:5] [train], epoch: 21/50, iter: 100/834, loss: 0.32740, top1: 0.60005, throughput: 1310.92 | 2022-04-10 23:47:58.800 [rank:1] [train], epoch: 21/50, iter: 100/834, loss: 0.32342, top1: 0.60365, throughput: 1314.63 | 2022-04-10 23:47:58.803 [rank:7] [train], epoch: 21/50, iter: 100/834, loss: 0.31955, top1: 0.61052, throughput: 1293.68 | 2022-04-10 23:47:58.803 [rank:6] [train], epoch: 21/50, iter: 100/834, loss: 0.32918, top1: 0.59604, throughput: 1299.74 | 2022-04-10 23:47:58.802 [rank:2] [train], epoch: 21/50, iter: 100/834, loss: 0.32567, top1: 0.60026, throughput: 1301.10 | 2022-04-10 23:47:58.804 [rank:0] [train], epoch: 21/50, iter: 100/834, loss: 0.32848, top1: 0.59339, throughput: 1294.32 | 2022-04-10 23:47:58.803 [rank:3] [train], epoch: 21/50, iter: 100/834, loss: 0.32178, top1: 0.60734, throughput: 1307.59 | 2022-04-10 23:47:58.806 [rank:4] [train], epoch: 21/50, iter: 200/834, loss: 0.32955, top1: 0.59141, throughput: 1314.19 | 2022-04-10 23:48:13.413 [rank:6] [train], epoch: 21/50, iter: 200/834, loss: 0.32921, top1: 0.59464, throughput: 1313.96 | 2022-04-10 23:48:13.415 [rank:2] [train], epoch: 21/50, iter: 200/834, loss: 0.32508, top1: 0.60229, throughput: 1314.13 | 2022-04-10 23:48:13.414 [rank:5] [train], epoch: 21/50, iter: 200/834, loss: 0.32555, top1: 0.59849, throughput: 1313.92 | 2022-04-10 23:48:13.413 [rank:3] [train], epoch: 21/50, iter: 200/834, loss: 0.32774, top1: 0.59870, throughput: 1314.06 | 2022-04-10 23:48:13.417 [rank:7] [train], epoch: 21/50, iter: 200/834, loss: 0.32954, top1: 0.59750, throughput: 1314.02 | 2022-04-10 23:48:13.414 [rank:1] [train], epoch: 21/50, iter: 200/834, loss: 0.32991, top1: 0.59245, throughput: 1313.96 | 2022-04-10 23:48:13.416 [rank:0] [train], epoch: 21/50, iter: 200/834, loss: 0.32722, top1: 0.59729, throughput: 1313.92 | 2022-04-10 23:48:13.416 [rank:5] [train], epoch: 21/50, iter: 300/834, loss: 0.32716, top1: 0.59661, throughput: 1315.35 | 2022-04-10 23:48:28.010 [rank:2] [train], epoch: 21/50, iter: 300/834, loss: 0.32358, top1: 0.60781, throughput: 1315.32 | 2022-04-10 23:48:28.011 [rank:6] [train], epoch: 21/50, iter: 300/834, loss: 0.32845, top1: 0.59802, throughput: 1315.29 | 2022-04-10 23:48:28.012 [rank:4] [train], epoch: 21/50, iter: 300/834, loss: 0.32868, top1: 0.59406, throughput: 1315.14 | 2022-04-10 23:48:28.012 [rank:0] [train], epoch: 21/50, iter: 300/834, loss: 0.32909, top1: 0.59672, throughput: 1315.39 | 2022-04-10 23:48:28.012 [rank:1] [train], epoch: 21/50, iter: 300/834, loss: 0.32614, top1: 0.59964, throughput: 1315.29 | 2022-04-10 23:48:28.013 [rank:3] [train], epoch: 21/50, iter: 300/834, loss: 0.32645, top1: 0.60026, throughput: 1315.40 | 2022-04-10 23:48:28.013 [rank:7] [train], epoch: 21/50, iter: 300/834, loss: 0.32833, top1: 0.59344, throughput: 1315.19 | 2022-04-10 23:48:28.013 [rank:6] [train], epoch: 21/50, iter: 400/834, loss: 0.32914, top1: 0.59594, throughput: 1314.83 | 2022-04-10 23:48:42.615 [rank:2] [train], epoch: 21/50, iter: 400/834, loss: 0.33087, top1: 0.59109, throughput: 1314.69 | 2022-04-10 23:48:42.616 [rank:1] [train], epoch: 21/50, iter: 400/834, loss: 0.32905, top1: 0.60021, throughput: 1314.79 | 2022-04-10 23:48:42.616 [rank:4] [train], epoch: 21/50, iter: 400/834, loss: 0.32894, top1: 0.59714, throughput: 1314.77 | 2022-04-10 23:48:42.615 [rank:3] [train], epoch: 21/50, iter: 400/834, loss: 0.32889, top1: 0.59719, throughput: 1314.77 | 2022-04-10 23:48:42.617 [rank:0] [train], epoch: 21/50, iter: 400/834, loss: 0.32654, top1: 0.59964, throughput: 1314.70 | 2022-04-10 23:48:42.616 [rank:5] [train], epoch: 21/50, iter: 400/834, loss: 0.32993, top1: 0.59474, throughput: 1314.49 | 2022-04-10 23:48:42.617 [rank:7] [train], epoch: 21/50, iter: 400/834, loss: 0.32792, top1: 0.59927, throughput: 1314.78 | 2022-04-10 23:48:42.616 [rank:5] [train], epoch: 21/50, iter: 500/834, loss: 0.32626, top1: 0.59958, throughput: 1314.17 | 2022-04-10 23:48:57.227 [rank:2] [train], epoch: 21/50, iter: 500/834, loss: 0.32731, top1: 0.59323, throughput: 1313.95 | 2022-04-10 23:48:57.228 [rank:6] [train], epoch: 21/50, iter: 500/834, loss: 0.32985, top1: 0.59557, throughput: 1313.99 | 2022-04-10 23:48:57.227 [rank:4] [train], epoch: 21/50, iter: 500/834, loss: 0.32762, top1: 0.60135, throughput: 1313.99 [rank:3] [train], epoch: 21/50, iter: 500/834, loss: 0.32781, top1: 0.59771, throughput: 1313.91| 2022-04-10 23:48:57.227 | 2022-04-10 23:48:57.229 [rank:7] [train], epoch: 21/50, iter: 500/834, loss: 0.32780, top1: 0.60135, throughput: 1313.86 | 2022-04-10 23:48:57.230 [rank:1] [train], epoch: 21/50, iter: 500/834, loss: 0.32729, top1: 0.59964, throughput: 1313.87[rank:0] [train], epoch: 21/50, iter: 500/834, loss: 0.32854, top1: 0.59536, throughput: 1314.00 | 2022-04-10 23:48:57.229 | 2022-04-10 23:48:57.228 [rank:4] [train], epoch: 21/50, iter: 600/834, loss: 0.33127, top1: 0.59188, throughput: 1315.58 | 2022-04-10 23:49:11.822 [rank:6] [train], epoch: 21/50, iter: 600/834, loss: 0.32659, top1: 0.60068, throughput: 1315.55 | 2022-04-10 23:49:11.822 [rank:2] [train], epoch: 21/50, iter: 600/834, loss: 0.32732, top1: 0.59828, throughput: 1315.45 | 2022-04-10 23:49:11.824 [rank:5] [train], epoch: 21/50, iter: 600/834, loss: 0.32837, top1: 0.59646, throughput: 1315.34 | 2022-04-10 23:49:11.824 [rank:3] [train], epoch: 21/50, iter: 600/834, loss: 0.32891, top1: 0.59526, throughput: 1315.48 | 2022-04-10 23:49:11.825 [rank:1] [train], epoch: 21/50, iter: 600/834, loss: 0.32899, top1: 0.59542, throughput: 1315.40 | 2022-04-10 23:49:11.826 [rank:7] [train], epoch: 21/50, iter: 600/834, loss: 0.33049, top1: 0.59208, throughput: 1315.64 | 2022-04-10 23:49:11.823 [rank:0] [train], epoch: 21/50, iter: 600/834, loss: 0.32965, top1: 0.59438, throughput: 1315.40 | 2022-04-10 23:49:11.825 [rank:2] [train], epoch: 21/50, iter: 700/834, loss: 0.32792, top1: 0.59271, throughput: 1315.66 | 2022-04-10 23:49:26.417 [rank:5] [train], epoch: 21/50, iter: 700/834, loss: 0.32974, top1: 0.59505, throughput: 1315.70 | 2022-04-10 23:49:26.416 [rank:6] [train], epoch: 21/50, iter: 700/834, loss: 0.32937, top1: 0.59474, throughput: 1315.47 | 2022-04-10 23:49:26.417 [rank:4] [train], epoch: 21/50, iter: 700/834, loss: 0.32673, top1: 0.60250, throughput: 1315.35 | 2022-04-10 23:49:26.419 [rank:3] [train], epoch: 21/50, iter: 700/834, loss: 0.33020, top1: 0.59281, throughput: 1315.55 | 2022-04-10 23:49:26.420 [rank:1] [train], epoch: 21/50, iter: 700/834, loss: 0.33143, top1: 0.59583, throughput: 1315.51 | 2022-04-10 23:49:26.421 [rank:0] [train], epoch: 21/50, iter: 700/834, loss: 0.33288, top1: 0.58703, throughput: 1315.64 | 2022-04-10 23:49:26.418 [rank:7] [train], epoch: 21/50, iter: 700/834, loss: 0.33025, top1: 0.58937, throughput: 1315.34 | 2022-04-10 23:49:26.420 [rank:3] [train], epoch: 21/50, iter: 800/834, loss: 0.32810, top1: 0.59797, throughput: 1315.65 | 2022-04-10 23:49:41.013 [rank:4] [train], epoch: 21/50, iter: 800/834, loss: 0.32829, top1: 0.59724, throughput: 1315.56 | 2022-04-10 23:49:41.013 [rank:2] [train], epoch: 21/50, iter: 800/834, loss: 0.32898, top1: 0.59589, throughput: 1315.45 | 2022-04-10 23:49:41.013 [rank:7] [train], epoch: 21/50, iter: 800/834, loss: 0.32859, top1: 0.59786, throughput: 1315.75 | 2022-04-10 23:49:41.013 [rank:6] [train], epoch: 21/50, iter: 800/834, loss: 0.32759, top1: 0.59667, throughput: 1315.35 | 2022-04-10 23:49:41.014 [rank:5] [train], epoch: 21/50, iter: 800/834, loss: 0.32855, top1: 0.59120, throughput: 1315.13 | 2022-04-10 23:49:41.016 [rank:0] [train], epoch: 21/50, iter: 800/834, loss: 0.32751, top1: 0.59458, throughput: 1315.41 | 2022-04-10 23:49:41.014 [rank:1] [train], epoch: 21/50, iter: 800/834, loss: 0.32830, top1: 0.59807, throughput: 1315.47 | 2022-04-10 23:49:41.016 [rank:5] [train], epoch: 21/50, iter: 834/834, loss: 0.32963, top1: 0.59559, throughput: 1314.28[rank:6] [train], epoch: 21/50, iter: 834/834, loss: 0.32741, top1: 0.59452, throughput: 1314.07 | 2022-04-10 23:49:45.982| 2022-04-10 23:49:45.983 [rank:4] [train], epoch: 21/50, iter: 834/834, loss: 0.32570, top1: 0.60248, throughput: 1313.86 | 2022-04-10 23:49:45.982 [rank:2] [train], epoch: 21/50, iter: 834/834, loss: 0.32552, top1: 0.59988, throughput: 1313.75 | 2022-04-10 23:49:45.982 [rank:7] [train], epoch: 21/50, iter: 834/834, loss: 0.33123, top1: 0.59559, throughput: 1313.50 | 2022-04-10 23:49:45.983 [rank:0] [train], epoch: 21/50, iter: 834/834, loss: 0.32634, top1: 0.59850, throughput: 1313.82 | 2022-04-10 23:49:45.983 [rank:1] [train], epoch: 21/50, iter: 834/834, loss: 0.33081, top1: 0.58762, throughput: 1314.23 | 2022-04-10 23:49:45.984 [rank:3] [train], epoch: 21/50, iter: 834/834, loss: 0.32688, top1: 0.59635, throughput: 1313.34 | 2022-04-10 23:49:45.984 [rank:7] [eval], epoch: 21/50, iter: 125/125, loss: 0.00000, top1: 0.61808, throughput: 595.39 | 2022-04-10 23:49:56.480 [rank:0] [eval], epoch: 21/50, iter: 125/125, loss: 0.00000, top1: 0.62848, throughput: 595.34 | 2022-04-10 23:49:56.481 [rank:6] [eval], epoch: 21/50, iter: 125/125, loss: 0.00000, top1: 0.62048, throughput: 589.05 | 2022-04-10 23:49:56.592 [rank:3] [eval], epoch: 21/50, iter: 125/125, loss: 0.00000, top1: 0.60864, throughput: 586.18 | 2022-04-10 23:49:56.646 [rank:2] [eval], epoch: 21/50, iter: 125/125, loss: 0.00000, top1: 0.60560, throughput: 585.67 | 2022-04-10 23:49:56.653 [rank:4] [eval], epoch: 21/50, iter: 125/125, loss: 0.00000, top1: 0.61792, throughput: 582.17 | 2022-04-10 23:49:56.717 [rank:5] [eval], epoch: 21/50, iter: 125/125, loss: 0.00000, top1: 0.60848, throughput: 581.22 | 2022-04-10 23:49:56.736 [rank:1] [eval], epoch: 21/50, iter: 125/125, loss: 0.00000, top1: 0.61888, throughput: 579.73 | 2022-04-10 23:49:56.764 [rank:4] [train], epoch: 22/50, iter: 100/834, loss: 0.32271, top1: 0.60958, throughput: 1315.36 | 2022-04-10 23:50:11.314 [rank:5] [train], epoch: 22/50, iter: 100/834, loss: 0.32468, top1: 0.60375, throughput: 1316.96 | 2022-04-10 23:50:11.315 [rank:6] [train], epoch: 22/50, iter: 100/834, loss: 0.32457, top1: 0.60526, throughput: 1303.98 | 2022-04-10 23:50:11.316 [rank:3] [train], epoch: 22/50, iter: 100/834, loss: 0.31815, top1: 0.61573, throughput: 1308.74 | 2022-04-10 23:50:11.317 [rank:1] [train], epoch: 22/50, iter: 100/834, loss: 0.32187, top1: 0.61005, throughput: 1319.34 | 2022-04-10 23:50:11.317 [rank:2] [train], epoch: 22/50, iter: 100/834, loss: 0.32142, top1: 0.61365, throughput: 1309.42 | 2022-04-10 23:50:11.316 [rank:7] [train], epoch: 22/50, iter: 100/834, loss: 0.32582, top1: 0.60276, throughput: 1294.19 | 2022-04-10 23:50:11.316 [rank:0] [train], epoch: 22/50, iter: 100/834, loss: 0.32509, top1: 0.60036, throughput: 1294.21 | 2022-04-10 23:50:11.317 [rank:6] [train], epoch: 22/50, iter: 200/834, loss: 0.32604, top1: 0.60010, throughput: 1313.77 | 2022-04-10 23:50:25.931 [rank:4] [train], epoch: 22/50, iter: 200/834, loss: 0.32444, top1: 0.60313, throughput: 1313.57 | 2022-04-10 23:50:25.931 [rank:2] [train], epoch: 22/50, iter: 200/834, loss: 0.32246, top1: 0.60630, throughput: 1313.67 | 2022-04-10 23:50:25.932 [rank:7] [train], epoch: 22/50, iter: 200/834, loss: 0.32581, top1: 0.60323, throughput: 1313.59 | 2022-04-10 23:50:25.932 [rank:5] [train], epoch: 22/50, iter: 200/834, loss: 0.32405, top1: 0.60344, throughput: 1313.49 | 2022-04-10 23:50:25.933 [rank:3] [train], epoch: 22/50, iter: 200/834, loss: 0.32356, top1: 0.60385, throughput: 1313.55 | 2022-04-10 23:50:25.933 [rank:1] [train], epoch: 22/50, iter: 200/834, loss: 0.32391, top1: 0.60401, throughput: 1313.53 | 2022-04-10 23:50:25.934 [rank:0] [train], epoch: 22/50, iter: 200/834, loss: 0.32150, top1: 0.60932, throughput: 1313.06 | 2022-04-10 23:50:25.939 [rank:5] [train], epoch: 22/50, iter: 300/834, loss: 0.32305, top1: 0.61099, throughput: 1316.45 | 2022-04-10 23:50:40.517 [rank:4] [train], epoch: 22/50, iter: 300/834, loss: 0.32326, top1: 0.60635, throughput: 1316.30 | 2022-04-10 23:50:40.517 [rank:3] [train], epoch: 22/50, iter: 300/834, loss: 0.32620, top1: 0.59771, throughput: 1316.30 | 2022-04-10 23:50:40.520 [rank:6] [train], epoch: 22/50, iter: 300/834, loss: 0.32629, top1: 0.60271, throughput: 1316.10 | 2022-04-10 23:50:40.519 [rank:7] [train], epoch: 22/50, iter: 300/834, loss: 0.32302, top1: 0.60542, throughput: 1316.16 | 2022-04-10 23:50:40.520 [rank:0] [train], epoch: 22/50, iter: 300/834, loss: 0.32556, top1: 0.60047, throughput: 1316.72 | 2022-04-10 23:50:40.521 [rank:2] [train], epoch: 22/50, iter: 300/834, loss: 0.32343, top1: 0.60620, throughput: 1315.90 | 2022-04-10 23:50:40.523 [rank:1] [train], epoch: 22/50, iter: 300/834, loss: 0.32547, top1: 0.60073, throughput: 1316.03 | 2022-04-10 23:50:40.524 [rank:6] [train], epoch: 22/50, iter: 400/834, loss: 0.32348, top1: 0.60250, throughput: 1315.88 | 2022-04-10 23:50:55.110 [rank:4] [train], epoch: 22/50, iter: 400/834, loss: 0.32669, top1: 0.60302, throughput: 1315.80 | 2022-04-10 23:50:55.109 [rank:1] [train], epoch: 22/50, iter: 400/834, loss: 0.32518, top1: 0.60052, throughput: 1316.31 | 2022-04-10 23:50:55.110 [rank:5] [train], epoch: 22/50, iter: 400/834, loss: 0.32377, top1: 0.60620, throughput: 1315.62 | 2022-04-10 23:50:55.111 [rank:2] [train], epoch: 22/50, iter: 400/834, loss: 0.32545, top1: 0.60661, throughput: 1316.14 | 2022-04-10 23:50:55.111 [rank:3] [train], epoch: 22/50, iter: 400/834, loss: 0.32551, top1: 0.60115, throughput: 1315.71 | 2022-04-10 23:50:55.113 [rank:0] [train], epoch: 22/50, iter: 400/834, loss: 0.32593, top1: 0.60182, throughput: 1315.76 | 2022-04-10 23:50:55.113 [rank:7] [train], epoch: 22/50, iter: 400/834, loss: 0.32310, top1: 0.60177, throughput: 1315.80 | 2022-04-10 23:50:55.112 [rank:5] [train], epoch: 22/50, iter: 500/834, loss: 0.32599, top1: 0.59932, throughput: 1316.77[rank:4] [train], epoch: 22/50, iter: 500/834, loss: 0.32589, top1: 0.60073, throughput: 1316.58 | 2022-04-10 23:51:09.692 | 2022-04-10 23:51:09.692 [rank:2] [train], epoch: 22/50, iter: 500/834, loss: 0.32496, top1: 0.60193, throughput: 1316.40 | 2022-04-10 23:51:09.696 [rank:1] [train], epoch: 22/50, iter: 500/834, loss: 0.32676, top1: 0.59708, throughput: 1316.62 | 2022-04-10 23:51:09.693 [rank:7] [train], epoch: 22/50, iter: 500/834, loss: 0.32331, top1: 0.60422, throughput: 1316.81 | 2022-04-10 23:51:09.693 [rank:6] [train], epoch: 22/50, iter: 500/834, loss: 0.32485, top1: 0.60172, throughput: 1316.58 | 2022-04-10 23:51:09.693 [rank:0] [train], epoch: 22/50, iter: 500/834, loss: 0.32539, top1: 0.60161, throughput: 1316.80 | 2022-04-10 23:51:09.694 [rank:3] [train], epoch: 22/50, iter: 500/834, loss: 0.32725, top1: 0.59375, throughput: 1316.45 | 2022-04-10 23:51:09.697 [rank:6] [train], epoch: 22/50, iter: 600/834, loss: 0.32565, top1: 0.60292, throughput: 1317.01 | 2022-04-10 23:51:24.272 [rank:2] [train], epoch: 22/50, iter: 600/834, loss: 0.32420, top1: 0.60318, throughput: 1317.18 | 2022-04-10 23:51:24.273 [rank:3] [train], epoch: 22/50, iter: 600/834, loss: 0.32584, top1: 0.60188, throughput: 1317.12 | 2022-04-10 23:51:24.275 [rank:0] [train], epoch: 22/50, iter: 600/834, loss: 0.32567, top1: 0.60391, throughput: 1316.91 | 2022-04-10 23:51:24.273 [rank:7] [train], epoch: 22/50, iter: 600/834, loss: 0.32252, top1: 0.60786, throughput: 1316.78 | 2022-04-10 23:51:24.274 [rank:5] [train], epoch: 22/50, iter: 600/834, loss: 0.32338, top1: 0.60849, throughput: 1316.51 | 2022-04-10 23:51:24.276 [rank:4] [train], epoch: 22/50, iter: 600/834, loss: 0.32536, top1: 0.60849, throughput: 1316.50 | 2022-04-10 23:51:24.276 [rank:1] [train], epoch: 22/50, iter: 600/834, loss: 0.32480, top1: 0.60078, throughput: 1316.54 | 2022-04-10 23:51:24.276 [rank:2] [train], epoch: 22/50, iter: 700/834, loss: 0.32467, top1: 0.60661, throughput: 1314.30 | 2022-04-10 23:51:38.881 [rank:6] [train], epoch: 22/50, iter: 700/834, loss: 0.32959, top1: 0.59922, throughput: 1314.23 | 2022-04-10 23:51:38.881 [rank:4] [train], epoch: 22/50, iter: 700/834, loss: 0.32746, top1: 0.59891, throughput: 1314.63 | 2022-04-10 23:51:38.881 [rank:7] [train], epoch: 22/50, iter: 700/834, loss: 0.32735, top1: 0.59792, throughput: 1314.26 | 2022-04-10 23:51:38.883 [rank:3] [train], epoch: 22/50, iter: 700/834, loss: 0.32837, top1: 0.59911, throughput: 1314.26 | 2022-04-10 23:51:38.884 [rank:1] [train], epoch: 22/50, iter: 700/834, loss: 0.32642, top1: 0.60182, throughput: 1314.32 | 2022-04-10 23:51:38.885 [rank:5] [train], epoch: 22/50, iter: 700/834, loss: 0.32738, top1: 0.59708, throughput: 1314.37 | 2022-04-10 23:51:38.884 [rank:0] [train], epoch: 22/50, iter: 700/834, loss: 0.32371, top1: 0.60729, throughput: 1314.19 | 2022-04-10 23:51:38.883 [rank:5] [train], epoch: 22/50, iter: 800/834, loss: 0.32701, top1: 0.59995, throughput: 1316.32 | 2022-04-10 23:51:53.470 [rank:4] [train], epoch: 22/50, iter: 800/834, loss: 0.32541, top1: 0.59552, throughput: 1316.04 | 2022-04-10 23:51:53.470 [rank:2] [train], epoch: 22/50, iter: 800/834, loss: 0.32557, top1: 0.60078, throughput: 1316.10 | 2022-04-10 23:51:53.470 [rank:6] [train], epoch: 22/50, iter: 800/834, loss: 0.32470, top1: 0.60135, throughput: 1316.00 | 2022-04-10 23:51:53.471 [rank:1] [train], epoch: 22/50, iter: 800/834, loss: 0.32563, top1: 0.60495, throughput: 1316.37 | 2022-04-10 23:51:53.470 [rank:3] [train], epoch: 22/50, iter: 800/834, loss: 0.32856, top1: 0.59844, throughput: 1316.11 | 2022-04-10 23:51:53.472 [rank:0] [train], epoch: 22/50, iter: 800/834, loss: 0.32339, top1: 0.60318, throughput: 1316.10 | 2022-04-10 23:51:53.472 [rank:7] [train], epoch: 22/50, iter: 800/834, loss: 0.32629, top1: 0.60078, throughput: 1316.03 | 2022-04-10 23:51:53.472 [rank:6] [train], epoch: 22/50, iter: 834/834, loss: 0.32968, top1: 0.59069, throughput: 1309.09 | 2022-04-10 23:51:58.458 [rank:5] [train], epoch: 22/50, iter: 834/834, loss: 0.32598, top1: 0.60355, throughput: 1308.56 | 2022-04-10 23:51:58.459 [rank:0] [train], epoch: 22/50, iter: 834/834, loss: 0.32333, top1: 0.60738, throughput: 1308.85 | 2022-04-10 23:51:58.459 [rank:1] [train], epoch: 22/50, iter: 834/834, loss: 0.31985, top1: 0.60815, throughput: 1308.27 | 2022-04-10 23:51:58.460 [rank:4] [train], epoch: 22/50, iter: 834/834, loss: 0.32711, top1: 0.59988, throughput: 1308.13 | 2022-04-10 23:51:58.461 [rank:3] [train], epoch: 22/50, iter: 834/834, loss: 0.32439, top1: 0.60248, throughput: 1308.70 | 2022-04-10 23:51:58.460 [rank:2] [train], epoch: 22/50, iter: 834/834, loss: 0.32501, top1: 0.60248, throughput: 1308.05 | 2022-04-10 23:51:58.461 [rank:7] [train], epoch: 22/50, iter: 834/834, loss: 0.32569, top1: 0.60141, throughput: 1308.38 | 2022-04-10 23:51:58.461 [rank:0] [eval], epoch: 22/50, iter: 125/125, loss: 0.00000, top1: 0.58928, throughput: 584.85 | 2022-04-10 23:52:09.146 [rank:7] [eval], epoch: 22/50, iter: 125/125, loss: 0.00000, top1: 0.58272, throughput: 583.68 | 2022-04-10 23:52:09.169 [rank:2] [eval], epoch: 22/50, iter: 125/125, loss: 0.00000, top1: 0.56768, throughput: 578.62 | 2022-04-10 23:52:09.262 [rank:6] [eval], epoch: 22/50, iter: 125/125, loss: 0.00000, top1: 0.57664, throughput: 577.74 | 2022-04-10 23:52:09.276 [rank:4] [eval], epoch: 22/50, iter: 125/125, loss: 0.00000, top1: 0.58320, throughput: 576.10 | 2022-04-10 23:52:09.310 [rank:3] [eval], epoch: 22/50, iter: 125/125, loss: 0.00000, top1: 0.56928, throughput: 574.47 | 2022-04-10 23:52:09.340 [rank:5] [eval], epoch: 22/50, iter: 125/125, loss: 0.00000, top1: 0.56688, throughput: 570.36 | 2022-04-10 23:52:09.417 [rank:1] [eval], epoch: 22/50, iter: 125/125, loss: 0.00000, top1: 0.58816, throughput: 566.19 | 2022-04-10 23:52:09.499 [rank:6] [train], epoch: 23/50, iter: 100/834, loss: 0.32413, top1: 0.60021, throughput: 1299.56 | 2022-04-10 23:52:24.050 [rank:4] [train], epoch: 23/50, iter: 100/834, loss: 0.32333, top1: 0.60891, throughput: 1302.47 | 2022-04-10 23:52:24.051 [rank:3] [train], epoch: 23/50, iter: 100/834, loss: 0.31843, top1: 0.61365, throughput: 1304.97 | 2022-04-10 23:52:24.053 [rank:1] [train], epoch: 23/50, iter: 100/834, loss: 0.31791, top1: 0.62099, throughput: 1319.30 | 2022-04-10 23:52:24.052 [rank:5] [train], epoch: 23/50, iter: 100/834, loss: 0.32399, top1: 0.60542, throughput: 1312.05 | 2022-04-10 23:52:24.051 [rank:2] [train], epoch: 23/50, iter: 100/834, loss: 0.32104, top1: 0.60724, throughput: 1298.10 | 2022-04-10 23:52:24.053 [rank:0] [train], epoch: 23/50, iter: 100/834, loss: 0.32182, top1: 0.61141, throughput: 1287.92 | 2022-04-10 23:52:24.054 [rank:7] [train], epoch: 23/50, iter: 100/834, loss: 0.31990, top1: 0.61307, throughput: 1289.98 | 2022-04-10 23:52:24.053 [rank:6] [train], epoch: 23/50, iter: 200/834, loss: 0.32225, top1: 0.60896, throughput: 1314.32[rank:2] [train], epoch: 23/50, iter: 200/834, loss: 0.32007, top1: 0.61219, throughput: 1314.61 | 2022-04-10 23:52:38.658| 2022-04-10 23:52:38.658 [rank:1] [train], epoch: 23/50, iter: 200/834, loss: 0.32321, top1: 0.60500, throughput: 1314.42 | 2022-04-10 23:52:38.659 [rank:5] [train], epoch: 23/50, iter: 200/834, loss: 0.32154, top1: 0.60698, throughput: 1314.40 | 2022-04-10 23:52:38.658 [rank:0] [train], epoch: 23/50, iter: 200/834, loss: 0.32038, top1: 0.61182, throughput: 1314.63 | 2022-04-10 23:52:38.658 [rank:7] [train], epoch: 23/50, iter: 200/834, loss: 0.32007, top1: 0.61042, throughput: 1314.54 | 2022-04-10 23:52:38.659 [rank:4] [train], epoch: 23/50, iter: 200/834, loss: 0.32094, top1: 0.61187, throughput: 1314.26 | 2022-04-10 23:52:38.660 [rank:3] [train], epoch: 23/50, iter: 200/834, loss: 0.31611, top1: 0.61969, throughput: 1314.33 | 2022-04-10 23:52:38.661 [rank:5] [train], epoch: 23/50, iter: 300/834, loss: 0.32193, top1: 0.61031, throughput: 1314.17 | 2022-04-10 23:52:53.268 [rank:6] [train], epoch: 23/50, iter: 300/834, loss: 0.32068, top1: 0.61104, throughput: 1314.10 | 2022-04-10 23:52:53.269 [rank:2] [train], epoch: 23/50, iter: 300/834, loss: 0.32380, top1: 0.60547, throughput: 1314.01 | 2022-04-10 23:52:53.270 [rank:1] [train], epoch: 23/50, iter: 300/834, loss: 0.32406, top1: 0.60896, throughput: 1314.03[rank:3] [train], epoch: 23/50, iter: 300/834, loss: 0.31909, top1: 0.61635, throughput: 1313.97 | 2022-04-10 23:52:53.273 | 2022-04-10 23:52:53.271 [rank:0] [train], epoch: 23/50, iter: 300/834, loss: 0.32094, top1: 0.61104, throughput: 1313.88 | 2022-04-10 23:52:53.272 [rank:4] [train], epoch: 23/50, iter: 300/834, loss: 0.32256, top1: 0.60849, throughput: 1314.06 | 2022-04-10 23:52:53.271 [rank:7] [train], epoch: 23/50, iter: 300/834, loss: 0.32281, top1: 0.60536, throughput: 1313.95 | 2022-04-10 23:52:53.272 [rank:6] [train], epoch: 23/50, iter: 400/834, loss: 0.32176, top1: 0.61115, throughput: 1314.57 | 2022-04-10 23:53:07.874 [rank:5] [train], epoch: 23/50, iter: 400/834, loss: 0.32075, top1: 0.61208, throughput: 1314.66 | 2022-04-10 23:53:07.873 [rank:4] [train], epoch: 23/50, iter: 400/834, loss: 0.32052, top1: 0.61156, throughput: 1314.78 | 2022-04-10 23:53:07.874 [rank:0] [train], epoch: 23/50, iter: 400/834, loss: 0.31912, top1: 0.61391, throughput: 1314.86 | 2022-04-10 23:53:07.874 [rank:1] [train], epoch: 23/50, iter: 400/834, loss: 0.32374, top1: 0.60797, throughput: 1314.75 | 2022-04-10 23:53:07.874 [rank:7] [train], epoch: 23/50, iter: 400/834, loss: 0.32357, top1: 0.60750, throughput: 1314.89 | 2022-04-10 23:53:07.874 [rank:3] [train], epoch: 23/50, iter: 400/834, loss: 0.32230, top1: 0.60693, throughput: 1314.69 | 2022-04-10 23:53:07.877 [rank:2] [train], epoch: 23/50, iter: 400/834, loss: 0.32335, top1: 0.60661, throughput: 1314.53 | 2022-04-10 23:53:07.876 [rank:4] [train], epoch: 23/50, iter: 500/834, loss: 0.32203, top1: 0.60630, throughput: 1313.87 | 2022-04-10 23:53:22.487 [rank:5] [train], epoch: 23/50, iter: 500/834, loss: 0.32431, top1: 0.60156, throughput: 1313.36 | 2022-04-10 23:53:22.492 [rank:1] [train], epoch: 23/50, iter: 500/834, loss: 0.32478, top1: 0.60490, throughput: 1313.76 | 2022-04-10 23:53:22.489 [rank:3] [train], epoch: 23/50, iter: 500/834, loss: 0.32071, top1: 0.60865, throughput: 1313.87 | 2022-04-10 23:53:22.491 [rank:2] [train], epoch: 23/50, iter: 500/834, loss: 0.32570, top1: 0.59818, throughput: 1313.78 | 2022-04-10 23:53:22.490 [rank:6] [train], epoch: 23/50, iter: 500/834, loss: 0.32130, top1: 0.60891, throughput: 1313.51 | 2022-04-10 23:53:22.492 [rank:0] [train], epoch: 23/50, iter: 500/834, loss: 0.32442, top1: 0.60370, throughput: 1313.61 | 2022-04-10 23:53:22.490 [rank:7] [train], epoch: 23/50, iter: 500/834, loss: 0.32272, top1: 0.60917, throughput: 1313.61 | 2022-04-10 23:53:22.490 [rank:6] [train], epoch: 23/50, iter: 600/834, loss: 0.32360, top1: 0.60495, throughput: 1315.89 | 2022-04-10 23:53:37.083 [rank:5] [train], epoch: 23/50, iter: 600/834, loss: 0.32429, top1: 0.60068, throughput: 1315.91 | 2022-04-10 23:53:37.082 [rank:2] [train], epoch: 23/50, iter: 600/834, loss: 0.32217, top1: 0.60745, throughput: 1315.64 | 2022-04-10 23:53:37.084 [rank:3] [train], epoch: 23/50, iter: 600/834, loss: 0.32251, top1: 0.60698, throughput: 1315.43 | 2022-04-10 23:53:37.087 [rank:4] [train], epoch: 23/50, iter: 600/834, loss: 0.32443, top1: 0.60635, throughput: 1315.40 | 2022-04-10 23:53:37.083 [rank:1] [train], epoch: 23/50, iter: 600/834, loss: 0.32382, top1: 0.60380, throughput: 1315.27 | 2022-04-10 23:53:37.087 [rank:7] [train], epoch: 23/50, iter: 600/834, loss: 0.32368, top1: 0.60812, throughput: 1315.29 | 2022-04-10 23:53:37.087 [rank:0] [train], epoch: 23/50, iter: 600/834, loss: 0.32173, top1: 0.60677, throughput: 1315.30 | 2022-04-10 23:53:37.088 [rank:4] [train], epoch: 23/50, iter: 700/834, loss: 0.32082, top1: 0.60490, throughput: 1314.99 | 2022-04-10 23:53:51.684 [rank:1] [train], epoch: 23/50, iter: 700/834, loss: 0.32227, top1: 0.60646, throughput: 1315.21 | 2022-04-10 23:53:51.685 [rank:2] [train], epoch: 23/50, iter: 700/834, loss: 0.32301, top1: 0.60531, throughput: 1314.83 | 2022-04-10 23:53:51.686 [rank:3] [train], epoch: 23/50, iter: 700/834, loss: 0.32587, top1: 0.59693, throughput: 1314.90 | 2022-04-10 23:53:51.688 [rank:5] [train], epoch: 23/50, iter: 700/834, loss: 0.32330, top1: 0.60714, throughput: 1314.49 | 2022-04-10 23:53:51.689 [rank:6] [train], epoch: 23/50, iter: 700/834, loss: 0.32174, top1: 0.60828, throughput: 1314.50 | 2022-04-10 23:53:51.689 [rank:7] [train], epoch: 23/50, iter: 700/834, loss: 0.32235, top1: 0.61089, throughput: 1315.04 | 2022-04-10 23:53:51.688 [rank:0] [train], epoch: 23/50, iter: 700/834, loss: 0.32164, top1: 0.61224, throughput: 1315.00 | 2022-04-10 23:53:51.688 [rank:4] [train], epoch: 23/50, iter: 800/834, loss: 0.32248, top1: 0.60667, throughput: 1314.78 | 2022-04-10 23:54:06.288 [rank:2] [train], epoch: 23/50, iter: 800/834, loss: 0.32376, top1: 0.60635, throughput: 1314.64 | 2022-04-10 23:54:06.291 [rank:5] [train], epoch: 23/50, iter: 800/834, loss: 0.32227, top1: 0.61266, throughput: 1314.93 | 2022-04-10 23:54:06.290 [rank:3] [train], epoch: 23/50, iter: 800/834, loss: 0.32431, top1: 0.60536, throughput: 1314.64 | 2022-04-10 23:54:06.293 [rank:6] [train], epoch: 23/50, iter: 800/834, loss: 0.32499, top1: 0.60052, throughput: 1314.93 | 2022-04-10 23:54:06.291 [rank:1] [train], epoch: 23/50, iter: 800/834, loss: 0.32531, top1: 0.60510, throughput: 1314.55 | 2022-04-10 23:54:06.291 [rank:0] [train], epoch: 23/50, iter: 800/834, loss: 0.32494, top1: 0.60214, throughput: 1314.91 | 2022-04-10 23:54:06.290 [rank:7] [train], epoch: 23/50, iter: 800/834, loss: 0.32467, top1: 0.60271, throughput: 1314.72 | 2022-04-10 23:54:06.291 [rank:5] [train], epoch: 23/50, iter: 834/834, loss: 0.32039, top1: 0.60800, throughput: 1311.39 | 2022-04-10 23:54:11.268 [rank:4] [train], epoch: 23/50, iter: 834/834, loss: 0.31313, top1: 0.61688, throughput: 1310.68 | 2022-04-10 23:54:11.268 [rank:7] [train], epoch: 23/50, iter: 834/834, loss: 0.32513, top1: 0.60309, throughput: 1311.36 | 2022-04-10 23:54:11.269 [rank:6] [train], epoch: 23/50, iter: 834/834, loss: 0.32308, top1: 0.61550, throughput: 1310.92 | 2022-04-10 23:54:11.270 [rank:3] [train], epoch: 23/50, iter: 834/834, loss: 0.32505, top1: 0.60218, throughput: 1311.52 | 2022-04-10 23:54:11.271 [rank:1] [train], epoch: 23/50, iter: 834/834, loss: 0.32007, top1: 0.60938, throughput: 1310.43 | 2022-04-10 23:54:11.272 [rank:2] [train], epoch: 23/50, iter: 834/834, loss: 0.31864, top1: 0.61198, throughput: 1310.25 | 2022-04-10 23:54:11.273 [rank:0] [train], epoch: 23/50, iter: 834/834, loss: 0.32896, top1: 0.59436, throughput: 1309.81 | 2022-04-10 23:54:11.274 [rank:7] [eval], epoch: 23/50, iter: 125/125, loss: 0.00000, top1: 0.59136, throughput: 583.88 | 2022-04-10 23:54:21.974 [rank:0] [eval], epoch: 23/50, iter: 125/125, loss: 0.00000, top1: 0.59552, throughput: 583.44 | 2022-04-10 23:54:21.986 [rank:4] [eval], epoch: 23/50, iter: 125/125, loss: 0.00000, top1: 0.58256, throughput: 581.16 | 2022-04-10 23:54:22.022 [rank:3] [eval], epoch: 23/50, iter: 125/125, loss: 0.00000, top1: 0.58576, throughput: 579.92 | 2022-04-10 23:54:22.048 [rank:2] [eval], epoch: 23/50, iter: 125/125, loss: 0.00000, top1: 0.57728, throughput: 579.71 | 2022-04-10 23:54:22.055 [rank:6] [eval], epoch: 23/50, iter: 125/125, loss: 0.00000, top1: 0.58976, throughput: 576.64 | 2022-04-10 23:54:22.109 [rank:1] [eval], epoch: 23/50, iter: 125/125, loss: 0.00000, top1: 0.59312, throughput: 569.60 | 2022-04-10 23:54:22.245 [rank:5] [eval], epoch: 23/50, iter: 125/125, loss: 0.00000, top1: 0.58160, throughput: 564.87 | 2022-04-10 23:54:22.333 [rank:6] [train], epoch: 24/50, iter: 100/834, loss: 0.31534, top1: 0.62349, throughput: 1301.25 | 2022-04-10 23:54:36.864 [rank:5] [train], epoch: 24/50, iter: 100/834, loss: 0.31691, top1: 0.61812, throughput: 1321.24 | 2022-04-10 23:54:36.864 [rank:1] [train], epoch: 24/50, iter: 100/834, loss: 0.31612, top1: 0.61698, throughput: 1313.20 | 2022-04-10 23:54:36.866 [rank:4] [train], epoch: 24/50, iter: 100/834, loss: 0.31704, top1: 0.61953, throughput: 1293.54 | 2022-04-10 23:54:36.866 [rank:7] [train], epoch: 24/50, iter: 100/834, loss: 0.31635, top1: 0.62094, throughput: 1289.28 | 2022-04-10 23:54:36.866 [rank:2] [train], epoch: 24/50, iter: 100/834, loss: 0.31763, top1: 0.61682, throughput: 1296.27 | 2022-04-10 23:54:36.866 [rank:0] [train], epoch: 24/50, iter: 100/834, loss: 0.31791, top1: 0.61443, throughput: 1290.28 | 2022-04-10 23:54:36.867 [rank:3] [train], epoch: 24/50, iter: 100/834, loss: 0.31730, top1: 0.61552, throughput: 1295.53 | 2022-04-10 23:54:36.868 [rank:6] [train], epoch: 24/50, iter: 200/834, loss: 0.31911, top1: 0.61161, throughput: 1313.78 | 2022-04-10 23:54:51.478 [rank:2] [train], epoch: 24/50, iter: 200/834, loss: 0.31753, top1: 0.61604, throughput: 1314.10 | 2022-04-10 23:54:51.477 [rank:4] [train], epoch: 24/50, iter: 200/834, loss: 0.31917, top1: 0.61536, throughput: 1313.86 | 2022-04-10 23:54:51.479 [rank:5] [train], epoch: 24/50, iter: 200/834, loss: 0.32043, top1: 0.61328, throughput: 1313.77 | 2022-04-10 23:54:51.479 [rank:1] [train], epoch: 24/50, iter: 200/834, loss: 0.31791, top1: 0.61943, throughput: 1313.88 | 2022-04-10 23:54:51.479 [rank:3] [train], epoch: 24/50, iter: 200/834, loss: 0.31596, top1: 0.62115, throughput: 1313.90 | 2022-04-10 23:54:51.481 [rank:7] [train], epoch: 24/50, iter: 200/834, loss: 0.32033, top1: 0.61292, throughput: 1313.95 | 2022-04-10 23:54:51.478 [rank:0] [train], epoch: 24/50, iter: 200/834, loss: 0.31957, top1: 0.61203, throughput: 1313.88 | 2022-04-10 23:54:51.480 [rank:5] [train], epoch: 24/50, iter: 300/834, loss: 0.31924, top1: 0.61078, throughput: 1313.52[rank:4] [train], epoch: 24/50, iter: 300/834, loss: 0.32077, top1: 0.61208, throughput: 1313.68 | 2022-04-10 23:55:06.096 | 2022-04-10 23:55:06.094 [rank:7] [train], epoch: 24/50, iter: 300/834, loss: 0.32047, top1: 0.61089, throughput: 1313.51 | 2022-04-10 23:55:06.096 [rank:2] [train], epoch: 24/50, iter: 300/834, loss: 0.32044, top1: 0.61182, throughput: 1313.43 | 2022-04-10 23:55:06.095 [rank:3] [train], epoch: 24/50, iter: 300/834, loss: 0.31598, top1: 0.61891, throughput: 1313.60 | 2022-04-10 23:55:06.097 [rank:1] [train], epoch: 24/50, iter: 300/834, loss: 0.31856, top1: 0.61870, throughput: 1313.35 | 2022-04-10 23:55:06.098 [rank:0] [train], epoch: 24/50, iter: 300/834, loss: 0.32105, top1: 0.61385, throughput: 1313.52 | 2022-04-10 23:55:06.097 [rank:6] [train], epoch: 24/50, iter: 300/834, loss: 0.31849, top1: 0.61297, throughput: 1313.40 | 2022-04-10 23:55:06.097 [rank:6] [train], epoch: 24/50, iter: 400/834, loss: 0.32040, top1: 0.61401, throughput: 1312.99 | 2022-04-10 23:55:20.720 [rank:2] [train], epoch: 24/50, iter: 400/834, loss: 0.31745, top1: 0.61708, throughput: 1312.71 | 2022-04-10 23:55:20.722 [rank:4] [train], epoch: 24/50, iter: 400/834, loss: 0.32029, top1: 0.61646, throughput: 1312.63 | 2022-04-10 23:55:20.721 [rank:5] [train], epoch: 24/50, iter: 400/834, loss: 0.31897, top1: 0.61271, throughput: 1312.75 | 2022-04-10 23:55:20.722 [rank:1] [train], epoch: 24/50, iter: 400/834, loss: 0.32287, top1: 0.60818, throughput: 1312.90 | 2022-04-10 23:55:20.722 [rank:0] [train], epoch: 24/50, iter: 400/834, loss: 0.31718, top1: 0.61526, throughput: 1312.85 | 2022-04-10 23:55:20.722 [rank:3] [train], epoch: 24/50, iter: 400/834, loss: 0.32089, top1: 0.60891, throughput: 1312.75 | 2022-04-10 23:55:20.723 [rank:7] [train], epoch: 24/50, iter: 400/834, loss: 0.32099, top1: 0.61432, throughput: 1312.69 | 2022-04-10 23:55:20.722 [rank:4] [train], epoch: 24/50, iter: 500/834, loss: 0.32074, top1: 0.61276, throughput: 1312.97 | 2022-04-10 23:55:35.345 [rank:5] [train], epoch: 24/50, iter: 500/834, loss: 0.31886, top1: 0.61635, throughput: 1312.87 | 2022-04-10 23:55:35.346 [rank:1] [train], epoch: 24/50, iter: 500/834, loss: 0.31520, top1: 0.62010, throughput: 1312.85 | 2022-04-10 23:55:35.347 [rank:2] [train], epoch: 24/50, iter: 500/834, loss: 0.32057, top1: 0.61229, throughput: 1312.86 | 2022-04-10 23:55:35.346 [rank:3] [train], epoch: 24/50, iter: 500/834, loss: 0.32082, top1: 0.61156, throughput: 1312.87 | 2022-04-10 23:55:35.348 [rank:6] [train], epoch: 24/50, iter: 500/834, loss: 0.31772, top1: 0.61573, throughput: 1312.68 | 2022-04-10 23:55:35.346 [rank:0] [train], epoch: 24/50, iter: 500/834, loss: 0.31941, top1: 0.60906, throughput: 1312.78 | 2022-04-10 23:55:35.347 [rank:7] [train], epoch: 24/50, iter: 500/834, loss: 0.31859, top1: 0.61172, throughput: 1312.71 | 2022-04-10 23:55:35.348 [rank:5] [train], epoch: 24/50, iter: 600/834, loss: 0.32084, top1: 0.60505, throughput: 1312.90 | 2022-04-10 23:55:49.970 [rank:4] [train], epoch: 24/50, iter: 600/834, loss: 0.32279, top1: 0.60740, throughput: 1312.82 | 2022-04-10 23:55:49.970 [rank:2] [train], epoch: 24/50, iter: 600/834, loss: 0.32307, top1: 0.60708, throughput: 1312.87 | 2022-04-10 23:55:49.971 [rank:6] [train], epoch: 24/50, iter: 600/834, loss: 0.32032, top1: 0.61005, throughput: 1312.96 | 2022-04-10 23:55:49.970 [rank:3] [train], epoch: 24/50, iter: 600/834, loss: 0.31890, top1: 0.61260, throughput: 1312.88 | 2022-04-10 23:55:49.972 [rank:1] [train], epoch: 24/50, iter: 600/834, loss: 0.31888, top1: 0.61250, throughput: 1312.90 | 2022-04-10 23:55:49.971 [rank:7] [train], epoch: 24/50, iter: 600/834, loss: 0.32103, top1: 0.61255, throughput: 1313.04 | 2022-04-10 23:55:49.971 [rank:0] [train], epoch: 24/50, iter: 600/834, loss: 0.32026, top1: 0.61323, throughput: 1312.79 | 2022-04-10 23:55:49.973 [rank:3] [train], epoch: 24/50, iter: 700/834, loss: 0.31892, top1: 0.61099, throughput: 1312.93 | 2022-04-10 23:56:04.596 [rank:5] [train], epoch: 24/50, iter: 700/834, loss: 0.31968, top1: 0.61214, throughput: 1312.89 | 2022-04-10 23:56:04.595 [rank:6] [train], epoch: 24/50, iter: 700/834, loss: 0.32127, top1: 0.61401, throughput: 1312.71 | 2022-04-10 23:56:04.596 [rank:0] [train], epoch: 24/50, iter: 700/834, loss: 0.31790, top1: 0.61802, throughput: 1313.02 | 2022-04-10 23:56:04.596 [rank:4] [train], epoch: 24/50, iter: 700/834, loss: 0.31926, top1: 0.61359, throughput: 1312.77 | 2022-04-10 23:56:04.595 [rank:2] [train], epoch: 24/50, iter: 700/834, loss: 0.31924, top1: 0.61411, throughput: 1312.84 | 2022-04-10 23:56:04.595 [rank:7] [train], epoch: 24/50, iter: 700/834, loss: 0.31760, top1: 0.61557, throughput: 1312.83 | 2022-04-10 23:56:04.596 [rank:1] [train], epoch: 24/50, iter: 700/834, loss: 0.31884, top1: 0.61333, throughput: 1312.60 | 2022-04-10 23:56:04.598 [rank:5] [train], epoch: 24/50, iter: 800/834, loss: 0.31727, top1: 0.61943, throughput: 1306.85 | 2022-04-10 23:56:19.287 [rank:4] [train], epoch: 24/50, iter: 800/834, loss: 0.32208, top1: 0.60964, throughput: 1306.92 | 2022-04-10 23:56:19.286 [rank:2] [train], epoch: 24/50, iter: 800/834, loss: 0.32183, top1: 0.61120, throughput: 1306.64 | 2022-04-10 23:56:19.290 [rank:3] [train], epoch: 24/50, iter: 800/834, loss: 0.32062, top1: 0.61151, throughput: 1306.76 | 2022-04-10 23:56:19.289 [rank:0] [train], epoch: 24/50, iter: 800/834, loss: 0.31903, top1: 0.61583, throughput: 1306.75 | 2022-04-10 23:56:19.288 [rank:1] [train], epoch: 24/50, iter: 800/834, loss: 0.32161, top1: 0.61401, throughput: 1306.77 | 2022-04-10 23:56:19.291 [rank:6] [train], epoch: 24/50, iter: 800/834, loss: 0.32218, top1: 0.60818, throughput: 1306.79 | 2022-04-10 23:56:19.289 [rank:7] [train], epoch: 24/50, iter: 800/834, loss: 0.31870, top1: 0.61339, throughput: 1306.78 | 2022-04-10 23:56:19.288 [rank:6] [train], epoch: 24/50, iter: 834/834, loss: 0.31621, top1: 0.61443, throughput: 1280.80 | 2022-04-10 23:56:24.385 [rank:7] [train], epoch: 24/50, iter: 834/834, loss: 0.31872, top1: 0.61259, throughput: 1280.58 | 2022-04-10 23:56:24.386 [rank:3] [train], epoch: 24/50, iter: 834/834, loss: 0.32091, top1: 0.61244, throughput: 1280.18 | 2022-04-10 23:56:24.388 [rank:0] [train], epoch: 24/50, iter: 834/834, loss: 0.32167, top1: 0.60156, throughput: 1279.98 | 2022-04-10 23:56:24.389 [rank:4] [train], epoch: 24/50, iter: 834/834, loss: 0.32269, top1: 0.60907, throughput: 1279.40[rank:5] [train], epoch: 24/50, iter: 834/834, loss: 0.31795, top1: 0.61688, throughput: 1279.45 | 2022-04-10 23:56:24.389 | 2022-04-10 23:56:24.389 [rank:1] [train], epoch: 24/50, iter: 834/834, loss: 0.31720, top1: 0.61489, throughput: 1279.94 | 2022-04-10 23:56:24.391 [rank:2] [train], epoch: 24/50, iter: 834/834, loss: 0.32203, top1: 0.60126, throughput: 1279.79 | 2022-04-10 23:56:24.390 [rank:0] [eval], epoch: 24/50, iter: 125/125, loss: 0.00000, top1: 0.60480, throughput: 591.60 | 2022-04-10 23:56:34.953 [rank:7] [eval], epoch: 24/50, iter: 125/125, loss: 0.00000, top1: 0.60160, throughput: 590.01 | 2022-04-10 23:56:34.979 [rank:6] [eval], epoch: 24/50, iter: 125/125, loss: 0.00000, top1: 0.59808, throughput: 580.76 | 2022-04-10 23:56:35.147 [rank:2] [eval], epoch: 24/50, iter: 125/125, loss: 0.00000, top1: 0.59568, throughput: 580.93 | 2022-04-10 23:56:35.149 [rank:3] [eval], epoch: 24/50, iter: 125/125, loss: 0.00000, top1: 0.59296, throughput: 579.88 | 2022-04-10 23:56:35.166 [rank:1] [eval], epoch: 24/50, iter: 125/125, loss: 0.00000, top1: 0.60336, throughput: 579.27 | 2022-04-10 23:56:35.181 [rank:5] [eval], epoch: 24/50, iter: 125/125, loss: 0.00000, top1: 0.59152, throughput: 577.55 | 2022-04-10 23:56:35.210 [rank:4] [eval], epoch: 24/50, iter: 125/125, loss: 0.00000, top1: 0.59808, throughput: 574.33 | 2022-04-10 23:56:35.271 [rank:5] [train], epoch: 25/50, iter: 100/834, loss: 0.31170, top1: 0.63276, throughput: 1309.64 | 2022-04-10 23:56:49.871 [rank:2] [train], epoch: 25/50, iter: 100/834, loss: 0.31653, top1: 0.62307, throughput: 1304.21 | 2022-04-10 23:56:49.871 [rank:4] [train], epoch: 25/50, iter: 100/834, loss: 0.31350, top1: 0.62240, throughput: 1315.06 | 2022-04-10 23:56:49.871 [rank:3] [train], epoch: 25/50, iter: 100/834, loss: 0.31450, top1: 0.62130, throughput: 1305.57 | 2022-04-10 23:56:49.872 [rank:6] [train], epoch: 25/50, iter: 100/834, loss: 0.31310, top1: 0.62714, throughput: 1303.93 | 2022-04-10 23:56:49.872 [rank:1] [train], epoch: 25/50, iter: 100/834, loss: 0.31289, top1: 0.62922, throughput: 1306.84 | 2022-04-10 23:56:49.873 [rank:7] [train], epoch: 25/50, iter: 100/834, loss: 0.31203, top1: 0.63094, throughput: 1289.24 | 2022-04-10 23:56:49.872 [rank:0] [train], epoch: 25/50, iter: 100/834, loss: 0.31656, top1: 0.61844, throughput: 1286.89 | 2022-04-10 23:56:49.873 [rank:6] [train], epoch: 25/50, iter: 200/834, loss: 0.31359, top1: 0.62516, throughput: 1318.64 | 2022-04-10 23:57:04.432 [rank:5] [train], epoch: 25/50, iter: 200/834, loss: 0.31504, top1: 0.62052, throughput: 1318.60 | 2022-04-10 23:57:04.432 [rank:4] [train], epoch: 25/50, iter: 200/834, loss: 0.31319, top1: 0.62224, throughput: 1318.67 | 2022-04-10 23:57:04.431 [rank:3] [train], epoch: 25/50, iter: 200/834, loss: 0.31458, top1: 0.62417, throughput: 1318.65 | 2022-04-10 23:57:04.433 [rank:7] [train], epoch: 25/50, iter: 200/834, loss: 0.31428, top1: 0.62005, throughput: 1318.58 | 2022-04-10 23:57:04.433 [rank:1] [train], epoch: 25/50, iter: 200/834, loss: 0.31739, top1: 0.61453, throughput: 1318.62 | 2022-04-10 23:57:04.433 [rank:0] [train], epoch: 25/50, iter: 200/834, loss: 0.31474, top1: 0.61880, throughput: 1318.63 | 2022-04-10 23:57:04.433 [rank:2] [train], epoch: 25/50, iter: 200/834, loss: 0.31188, top1: 0.62693, throughput: 1318.47 | 2022-04-10 23:57:04.433 [rank:4] [train], epoch: 25/50, iter: 300/834, loss: 0.31803, top1: 0.61875, throughput: 1316.46 | 2022-04-10 23:57:19.016 [rank:6] [train], epoch: 25/50, iter: 300/834, loss: 0.31747, top1: 0.61724, throughput: 1316.59 | 2022-04-10 23:57:19.016 [rank:3] [train], epoch: 25/50, iter: 300/834, loss: 0.31355, top1: 0.62609, throughput: 1316.25 | 2022-04-10 23:57:19.020 [rank:7] [train], epoch: 25/50, iter: 300/834, loss: 0.31808, top1: 0.61302, throughput: 1316.50 | 2022-04-10 23:57:19.017 [rank:1] [train], epoch: 25/50, iter: 300/834, loss: 0.31448, top1: 0.62391, throughput: 1316.42 | 2022-04-10 23:57:19.018 [rank:0] [train], epoch: 25/50, iter: 300/834, loss: 0.31707, top1: 0.61609, throughput: 1316.56 | 2022-04-10 23:57:19.017 [rank:5] [train], epoch: 25/50, iter: 300/834, loss: 0.31666, top1: 0.61578, throughput: 1316.27 | 2022-04-10 23:57:19.018 [rank:2] [train], epoch: 25/50, iter: 300/834, loss: 0.31778, top1: 0.61958, throughput: 1316.38 | 2022-04-10 23:57:19.018 [rank:6] [train], epoch: 25/50, iter: 400/834, loss: 0.31715, top1: 0.61500, throughput: 1308.87 | 2022-04-10 23:57:33.685 [rank:4] [train], epoch: 25/50, iter: 400/834, loss: 0.31708, top1: 0.61526, throughput: 1308.99 | 2022-04-10 23:57:33.684 [rank:2] [train], epoch: 25/50, iter: 400/834, loss: 0.31513, top1: 0.61948, throughput: 1309.12 | 2022-04-10 23:57:33.685 [rank:3] [train], epoch: 25/50, iter: 400/834, loss: 0.31611, top1: 0.61682, throughput: 1309.21 | 2022-04-10 23:57:33.685 [rank:5] [train], epoch: 25/50, iter: 400/834, loss: 0.31990, top1: 0.61120, throughput: 1309.18 | 2022-04-10 23:57:33.684 [rank:7] [train], epoch: 25/50, iter: 400/834, loss: 0.31671, top1: 0.61781, throughput: 1309.04 | 2022-04-10 23:57:33.684 [rank:1] [train], epoch: 25/50, iter: 400/834, loss: 0.31756, top1: 0.61823, throughput: 1308.93 | 2022-04-10 23:57:33.687 [rank:0] [train], epoch: 25/50, iter: 400/834, loss: 0.31489, top1: 0.61891, throughput: 1308.93 | 2022-04-10 23:57:33.685 [rank:6] [train], epoch: 25/50, iter: 500/834, loss: 0.31398, top1: 0.62438, throughput: 1314.41 | 2022-04-10 23:57:48.292 [rank:4] [train], epoch: 25/50, iter: 500/834, loss: 0.31785, top1: 0.61583, throughput: 1314.23 | 2022-04-10 23:57:48.293 [rank:5] [train], epoch: 25/50, iter: 500/834, loss: 0.31690, top1: 0.62026, throughput: 1314.08 | 2022-04-10 23:57:48.295 [rank:2] [train], epoch: 25/50, iter: 500/834, loss: 0.31909, top1: 0.61651, throughput: 1314.15 | 2022-04-10 23:57:48.295 [rank:1] [train], epoch: 25/50, iter: 500/834, loss: 0.31433, top1: 0.62177, throughput: 1314.24 | 2022-04-10 23:57:48.296 [rank:3] [train], epoch: 25/50, iter: 500/834, loss: 0.31654, top1: 0.61917, throughput: 1314.12 | 2022-04-10 23:57:48.295 [rank:0] [train], epoch: 25/50, iter: 500/834, loss: 0.31780, top1: 0.61750, throughput: 1314.30 | 2022-04-10 23:57:48.294 [rank:7] [train], epoch: 25/50, iter: 500/834, loss: 0.31601, top1: 0.62036, throughput: 1314.07 | 2022-04-10 23:57:48.295 [rank:4] [train], epoch: 25/50, iter: 600/834, loss: 0.31933, top1: 0.61359, throughput: 1313.69 | 2022-04-10 23:58:02.908 [rank:6] [train], epoch: 25/50, iter: 600/834, loss: 0.31890, top1: 0.61036, throughput: 1313.68 | 2022-04-10 23:58:02.907 [rank:1] [train], epoch: 25/50, iter: 600/834, loss: 0.31710, top1: 0.62177, throughput: 1313.94 | 2022-04-10 23:58:02.908 [rank:7] [train], epoch: 25/50, iter: 600/834, loss: 0.31732, top1: 0.61703, throughput: 1313.76 | 2022-04-10 23:58:02.910 [rank:3] [train], epoch: 25/50, iter: 600/834, loss: 0.31608, top1: 0.62182, throughput: 1313.79 | 2022-04-10 23:58:02.910 [rank:5] [train], epoch: 25/50, iter: 600/834, loss: 0.31493, top1: 0.61984, throughput: 1313.75 | 2022-04-10 23:58:02.910 [rank:2] [train], epoch: 25/50, iter: 600/834, loss: 0.32027, top1: 0.61458, throughput: 1313.83 | 2022-04-10 23:58:02.909 [rank:0] [train], epoch: 25/50, iter: 600/834, loss: 0.31681, top1: 0.61656, throughput: 1313.58 | 2022-04-10 23:58:02.910 [rank:2] [train], epoch: 25/50, iter: 700/834, loss: 0.31659, top1: 0.61615, throughput: 1313.86 | 2022-04-10 23:58:17.522 [rank:6] [train], epoch: 25/50, iter: 700/834, loss: 0.31688, top1: 0.61948, throughput: 1313.80 | 2022-04-10 23:58:17.522 [rank:7] [train], epoch: 25/50, iter: 700/834, loss: 0.31786, top1: 0.61870, throughput: 1313.93 | 2022-04-10 23:58:17.522 [rank:0] [train], epoch: 25/50, iter: 700/834, loss: 0.31759, top1: 0.61797, throughput: 1313.98 | 2022-04-10 23:58:17.523 [rank:5] [train], epoch: 25/50, iter: 700/834, loss: 0.31973, top1: 0.61286, throughput: 1313.83 | 2022-04-10 23:58:17.523 [rank:4] [train], epoch: 25/50, iter: 700/834, loss: 0.31701, top1: 0.62333, throughput: 1313.80 | 2022-04-10 23:58:17.522 [rank:1] [train], epoch: 25/50, iter: 700/834, loss: 0.31449, top1: 0.62297, throughput: 1313.52 | 2022-04-10 23:58:17.526 [rank:3] [train], epoch: 25/50, iter: 700/834, loss: 0.31716, top1: 0.61672, throughput: 1313.69 | 2022-04-10 23:58:17.525 [rank:5] [train], epoch: 25/50, iter: 800/834, loss: 0.31817, top1: 0.61547, throughput: 1315.63 | 2022-04-10 23:58:32.117 [rank:3] [train], epoch: 25/50, iter: 800/834, loss: 0.31558, top1: 0.62115, throughput: 1315.54 | 2022-04-10 23:58:32.120 [rank:4] [train], epoch: 25/50, iter: 800/834, loss: 0.31662, top1: 0.62120, throughput: 1315.49 | 2022-04-10 23:58:32.118 [rank:2] [train], epoch: 25/50, iter: 800/834, loss: 0.31511, top1: 0.61589, throughput: 1315.44 | 2022-04-10 23:58:32.118 [rank:6] [train], epoch: 25/50, iter: 800/834, loss: 0.31473, top1: 0.62266, throughput: 1315.43 | 2022-04-10 23:58:32.117 [rank:1] [train], epoch: 25/50, iter: 800/834, loss: 0.31662, top1: 0.61797, throughput: 1315.70 | 2022-04-10 23:58:32.118 [rank:0] [train], epoch: 25/50, iter: 800/834, loss: 0.31718, top1: 0.61698, throughput: 1315.35 | 2022-04-10 23:58:32.119 [rank:7] [train], epoch: 25/50, iter: 800/834, loss: 0.31754, top1: 0.61542, throughput: 1315.23 | 2022-04-10 23:58:32.120 [rank:6] [train], epoch: 25/50, iter: 834/834, loss: 0.31999, top1: 0.60999, throughput: 1311.04 | 2022-04-10 23:58:37.097 [rank:5] [train], epoch: 25/50, iter: 834/834, loss: 0.31462, top1: 0.62623, throughput: 1310.66 | 2022-04-10 23:58:37.098 [rank:1] [train], epoch: 25/50, iter: 834/834, loss: 0.31308, top1: 0.63312, throughput: 1311.08 | 2022-04-10 23:58:37.098 [rank:7] [train], epoch: 25/50, iter: 834/834, loss: 0.31577, top1: 0.61259, throughput: 1311.47 | 2022-04-10 23:58:37.098 [rank:3] [train], epoch: 25/50, iter: 834/834, loss: 0.31457, top1: 0.62086, throughput: 1311.08 | 2022-04-10 23:58:37.099 [rank:4] [train], epoch: 25/50, iter: 834/834, loss: 0.31618, top1: 0.61811, throughput: 1310.55 | 2022-04-10 23:58:37.099 [rank:2] [train], epoch: 25/50, iter: 834/834, loss: 0.31700, top1: 0.61949, throughput: 1309.80 | 2022-04-10 23:58:37.102 [rank:0] [train], epoch: 25/50, iter: 834/834, loss: 0.31370, top1: 0.62102, throughput: 1310.12 | 2022-04-10 23:58:37.102 [rank:7] [eval], epoch: 25/50, iter: 125/125, loss: 0.00000, top1: 0.63408, throughput: 591.95 | 2022-04-10 23:58:47.656 [rank:4] [eval], epoch: 25/50, iter: 125/125, loss: 0.00000, top1: 0.62592, throughput: 588.77 | 2022-04-10 23:58:47.714 [rank:0] [eval], epoch: 25/50, iter: 125/125, loss: 0.00000, top1: 0.64000, throughput: 588.67 | 2022-04-10 23:58:47.719 [rank:2] [eval], epoch: 25/50, iter: 125/125, loss: 0.00000, top1: 0.63184, throughput: 588.63 | 2022-04-10 23:58:47.720 [rank:6] [eval], epoch: 25/50, iter: 125/125, loss: 0.00000, top1: 0.62480, throughput: 584.45 | 2022-04-10 23:58:47.791 [rank:3] [eval], epoch: 25/50, iter: 125/125, loss: 0.00000, top1: 0.62480, throughput: 583.52 | 2022-04-10 23:58:47.810 [rank:5] [eval], epoch: 25/50, iter: 125/125, loss: 0.00000, top1: 0.62128, throughput: 574.59 | 2022-04-10 23:58:47.975 [rank:1] [eval], epoch: 25/50, iter: 125/125, loss: 0.00000, top1: 0.63984, throughput: 573.99 | 2022-04-10 23:58:47.986 [rank:5] [train], epoch: 26/50, iter: 100/834, loss: 0.31214, top1: 0.63141, throughput: 1320.07 | 2022-04-10 23:59:02.520 [rank:4] [train], epoch: 26/50, iter: 100/834, loss: 0.31171, top1: 0.62677, throughput: 1296.76 | 2022-04-10 23:59:02.520 [rank:6] [train], epoch: 26/50, iter: 100/834, loss: 0.31393, top1: 0.62469, throughput: 1303.35 | 2022-04-10 23:59:02.522 [rank:3] [train], epoch: 26/50, iter: 100/834, loss: 0.31059, top1: 0.62979, throughput: 1304.65 | 2022-04-10 23:59:02.526 [rank:1] [train], epoch: 26/50, iter: 100/834, loss: 0.31194, top1: 0.63063, throughput: 1320.66[rank:0] [train], epoch: 26/50, iter: 100/834, loss: 0.30852, top1: 0.63734, throughput: 1296.96 | 2022-04-10 23:59:02.523| 2022-04-10 23:59:02.524 [rank:2] [train], epoch: 26/50, iter: 100/834, loss: 0.31250, top1: 0.62578, throughput: 1296.90 | 2022-04-10 23:59:02.524 [rank:7] [train], epoch: 26/50, iter: 100/834, loss: 0.30987, top1: 0.63307, throughput: 1291.49 | 2022-04-10 23:59:02.523 [rank:2] [train], epoch: 26/50, iter: 200/834, loss: 0.31320, top1: 0.62547, throughput: 1304.13 | 2022-04-10 23:59:17.247 [rank:5] [train], epoch: 26/50, iter: 200/834, loss: 0.31365, top1: 0.62583, throughput: 1303.68 | 2022-04-10 23:59:17.248 [rank:1] [train], epoch: 26/50, iter: 200/834, loss: 0.31272, top1: 0.62260, throughput: 1304.04 [rank:0] [train], epoch: 26/50, iter: 200/834, loss: 0.30934, top1: 0.63042, throughput: 1303.99| 2022-04-10 23:59:17.248 | 2022-04-10 23:59:17.247 [rank:4] [train], epoch: 26/50, iter: 200/834, loss: 0.31353, top1: 0.62578, throughput: 1303.78 | 2022-04-10 23:59:17.246 [rank:6] [train], epoch: 26/50, iter: 200/834, loss: 0.31197, top1: 0.62552, throughput: 1303.91 | 2022-04-10 23:59:17.247 [rank:7] [train], epoch: 26/50, iter: 200/834, loss: 0.31201, top1: 0.62813, throughput: 1303.99 | 2022-04-10 23:59:17.247 [rank:3] [train], epoch: 26/50, iter: 200/834, loss: 0.31082, top1: 0.62781, throughput: 1304.24 | 2022-04-10 23:59:17.248 [rank:5] [train], epoch: 26/50, iter: 300/834, loss: 0.31321, top1: 0.62505, throughput: 1314.65 | 2022-04-10 23:59:31.852 [rank:2] [train], epoch: 26/50, iter: 300/834, loss: 0.31296, top1: 0.62401, throughput: 1314.54 | 2022-04-10 23:59:31.853 [rank:6] [train], epoch: 26/50, iter: 300/834, loss: 0.31264, top1: 0.62396, throughput: 1314.52 | 2022-04-10 23:59:31.853 [rank:1] [train], epoch: 26/50, iter: 300/834, loss: 0.31374, top1: 0.62594, throughput: 1314.51 | 2022-04-10 23:59:31.854 [rank:4] [train], epoch: 26/50, iter: 300/834, loss: 0.31357, top1: 0.62182, throughput: 1314.39 | 2022-04-10 23:59:31.854 [rank:3] [train], epoch: 26/50, iter: 300/834, loss: 0.31489, top1: 0.62224, throughput: 1314.32 | 2022-04-10 23:59:31.856 [rank:7] [train], epoch: 26/50, iter: 300/834, loss: 0.31338, top1: 0.62583, throughput: 1314.48 | 2022-04-10 23:59:31.854 [rank:0] [train], epoch: 26/50, iter: 300/834, loss: 0.31253, top1: 0.62625, throughput: 1314.41 | 2022-04-10 23:59:31.855 [rank:6] [train], epoch: 26/50, iter: 400/834, loss: 0.31312, top1: 0.62208, throughput: 1310.14 | 2022-04-10 23:59:46.508 [rank:5] [train], epoch: 26/50, iter: 400/834, loss: 0.31549, top1: 0.62292, throughput: 1310.08 | 2022-04-10 23:59:46.508 [rank:2] [train], epoch: 26/50, iter: 400/834, loss: 0.31251, top1: 0.62250, throughput: 1309.87 | 2022-04-10 23:59:46.511 [rank:3] [train], epoch: 26/50, iter: 400/834, loss: 0.31415, top1: 0.62125, throughput: 1310.21 | 2022-04-10 23:59:46.510 [rank:0] [train], epoch: 26/50, iter: 400/834, loss: 0.31325, top1: 0.62438, throughput: 1310.20 | 2022-04-10 23:59:46.509 [rank:4] [train], epoch: 26/50, iter: 400/834, loss: 0.31245, top1: 0.62609, throughput: 1310.04 | 2022-04-10 23:59:46.510 [rank:7] [train], epoch: 26/50, iter: 400/834, loss: 0.31582, top1: 0.61708, throughput: 1310.08 | 2022-04-10 23:59:46.509 [rank:1] [train], epoch: 26/50, iter: 400/834, loss: 0.31180, top1: 0.62547, throughput: 1309.91 | 2022-04-10 23:59:46.512 [rank:5] [train], epoch: 26/50, iter: 500/834, loss: 0.31609, top1: 0.61797, throughput: 1313.98 | 2022-04-11 00:00:01.120 [rank:1] [train], epoch: 26/50, iter: 500/834, loss: 0.31464, top1: 0.62120, throughput: 1314.23 | 2022-04-11 00:00:01.121 [rank:4] [train], epoch: 26/50, iter: 500/834, loss: 0.31225, top1: 0.62396, throughput: 1314.12 | 2022-04-11 00:00:01.121 [rank:3] [train], epoch: 26/50, iter: 500/834, loss: 0.31237, top1: 0.62214, throughput: 1313.92 | 2022-04-11 00:00:01.123 [rank:2] [train], epoch: 26/50, iter: 500/834, loss: 0.31347, top1: 0.62474, throughput: 1314.03 | 2022-04-11 00:00:01.122 [rank:6] [train], epoch: 26/50, iter: 500/834, loss: 0.31359, top1: 0.62536, throughput: 1313.95 | 2022-04-11 00:00:01.120 [rank:0] [train], epoch: 26/50, iter: 500/834, loss: 0.30843, top1: 0.63323, throughput: 1313.89 | 2022-04-11 00:00:01.122 [rank:7] [train], epoch: 26/50, iter: 500/834, loss: 0.31603, top1: 0.61604, throughput: 1313.82 | 2022-04-11 00:00:01.123 [rank:2] [train], epoch: 26/50, iter: 600/834, loss: 0.31349, top1: 0.62458, throughput: 1315.13 | 2022-04-11 00:00:15.722 [rank:6] [train], epoch: 26/50, iter: 600/834, loss: 0.31618, top1: 0.61995, throughput: 1314.87 | 2022-04-11 00:00:15.722 [rank:5] [train], epoch: 26/50, iter: 600/834, loss: 0.31376, top1: 0.62250, throughput: 1314.64 | 2022-04-11 00:00:15.725 [rank:3] [train], epoch: 26/50, iter: 600/834, loss: 0.31193, top1: 0.62938, throughput: 1315.02 | 2022-04-11 00:00:15.723 [rank:1] [train], epoch: 26/50, iter: 600/834, loss: 0.31685, top1: 0.61839, throughput: 1314.71 | 2022-04-11 00:00:15.725 [rank:0] [train], epoch: 26/50, iter: 600/834, loss: 0.31704, top1: 0.61943, throughput: 1314.88 | 2022-04-11 00:00:15.724 [rank:4] [train], epoch: 26/50, iter: 600/834, loss: 0.31231, top1: 0.62865, throughput: 1314.76 | 2022-04-11 00:00:15.724 [rank:7] [train], epoch: 26/50, iter: 600/834, loss: 0.31477, top1: 0.62125, throughput: 1314.76 | 2022-04-11 00:00:15.726 [rank:4] [train], epoch: 26/50, iter: 700/834, loss: 0.31344, top1: 0.62271, throughput: 1313.34 | 2022-04-11 00:00:30.343 [rank:3] [train], epoch: 26/50, iter: 700/834, loss: 0.31467, top1: 0.62156, throughput: 1313.15 | 2022-04-11 00:00:30.345 [rank:6] [train], epoch: 26/50, iter: 700/834, loss: 0.31499, top1: 0.62083, throughput: 1313.08 | 2022-04-11 00:00:30.344 [rank:0] [train], epoch: 26/50, iter: 700/834, loss: 0.31536, top1: 0.62115, throughput: 1313.26 | 2022-04-11 00:00:30.344 [rank:5] [train], epoch: 26/50, iter: 700/834, loss: 0.31442, top1: 0.62563, throughput: 1313.27 | 2022-04-11 00:00:30.345 [rank:7] [train], epoch: 26/50, iter: 700/834, loss: 0.31328, top1: 0.62286, throughput: 1313.44 | 2022-04-11 00:00:30.344 [rank:2] [train], epoch: 26/50, iter: 700/834, loss: 0.31327, top1: 0.62521, throughput: 1312.92 | 2022-04-11 00:00:30.346 [rank:1] [train], epoch: 26/50, iter: 700/834, loss: 0.31579, top1: 0.62115, throughput: 1313.08 | 2022-04-11 00:00:30.347 [rank:2] [train], epoch: 26/50, iter: 800/834, loss: 0.31389, top1: 0.62276, throughput: 1314.70 | 2022-04-11 00:00:44.950 [rank:4] [train], epoch: 26/50, iter: 800/834, loss: 0.31372, top1: 0.62698, throughput: 1314.59 | 2022-04-11 00:00:44.949 [rank:6] [train], epoch: 26/50, iter: 800/834, loss: 0.31387, top1: 0.62458, throughput: 1314.63 | 2022-04-11 00:00:44.949 [rank:5] [train], epoch: 26/50, iter: 800/834, loss: 0.31274, top1: 0.62641, throughput: 1314.66 | 2022-04-11 00:00:44.949 [rank:3] [train], epoch: 26/50, iter: 800/834, loss: 0.31298, top1: 0.62328, throughput: 1314.47 | 2022-04-11 00:00:44.951 [rank:7] [train], epoch: 26/50, iter: 800/834, loss: 0.31459, top1: 0.62589, throughput: 1314.36 | 2022-04-11 00:00:44.952 [rank:0] [train], epoch: 26/50, iter: 800/834, loss: 0.31374, top1: 0.62203, throughput: 1314.52 | 2022-04-11 00:00:44.950 [rank:1] [train], epoch: 26/50, iter: 800/834, loss: 0.31478, top1: 0.61734, throughput: 1314.50 | 2022-04-11 00:00:44.953 [rank:6] [train], epoch: 26/50, iter: 834/834, loss: 0.31418, top1: 0.62163, throughput: 1315.10 | 2022-04-11 00:00:49.913 [rank:4] [train], epoch: 26/50, iter: 834/834, loss: 0.31425, top1: 0.62485, throughput: 1314.77 | 2022-04-11 00:00:49.914 [rank:5] [train], epoch: 26/50, iter: 834/834, loss: 0.31414, top1: 0.62010, throughput: 1314.66 | 2022-04-11 00:00:49.915 [rank:0] [train], epoch: 26/50, iter: 834/834, loss: 0.31488, top1: 0.62040, throughput: 1314.91 | 2022-04-11 00:00:49.915 [rank:2] [train], epoch: 26/50, iter: 834/834, loss: 0.31114, top1: 0.63542, throughput: 1314.61 | 2022-04-11 00:00:49.915 [rank:1] [train], epoch: 26/50, iter: 834/834, loss: 0.31384, top1: 0.61795, throughput: 1315.42 | 2022-04-11 00:00:49.916 [rank:3] [train], epoch: 26/50, iter: 834/834, loss: 0.31327, top1: 0.61857, throughput: 1314.74 | 2022-04-11 00:00:49.916 [rank:7] [train], epoch: 26/50, iter: 834/834, loss: 0.31069, top1: 0.62592, throughput: 1314.98 | 2022-04-11 00:00:49.917 [rank:7] [eval], epoch: 26/50, iter: 125/125, loss: 0.00000, top1: 0.62688, throughput: 595.48 | 2022-04-11 00:01:00.412 [rank:0] [eval], epoch: 26/50, iter: 125/125, loss: 0.00000, top1: 0.62608, throughput: 589.69 | 2022-04-11 00:01:00.513 [rank:6] [eval], epoch: 26/50, iter: 125/125, loss: 0.00000, top1: 0.62544, throughput: 585.44 | 2022-04-11 00:01:00.589 [rank:2] [eval], epoch: 26/50, iter: 125/125, loss: 0.00000, top1: 0.62416, throughput: 584.85 | 2022-04-11 00:01:00.602 [rank:3] [eval], epoch: 26/50, iter: 125/125, loss: 0.00000, top1: 0.62224, throughput: 581.98 | 2022-04-11 00:01:00.656 [rank:1] [eval], epoch: 26/50, iter: 125/125, loss: 0.00000, top1: 0.63632, throughput: 580.76 | 2022-04-11 00:01:00.678 [rank:4] [eval], epoch: 26/50, iter: 125/125, loss: 0.00000, top1: 0.63184, throughput: 576.89 | 2022-04-11 00:01:00.748 [rank:5] [eval], epoch: 26/50, iter: 125/125, loss: 0.00000, top1: 0.61872, throughput: 576.80 | 2022-04-11 00:01:00.750 [rank:5] [train], epoch: 27/50, iter: 100/834, loss: 0.30887, top1: 0.63219, throughput: 1321.27 | 2022-04-11 00:01:15.282 [rank:0] [train], epoch: 27/50, iter: 100/834, loss: 0.31269, top1: 0.62573, throughput: 1299.99 | 2022-04-11 00:01:15.283 [rank:2] [train], epoch: 27/50, iter: 100/834, loss: 0.30646, top1: 0.64104, throughput: 1307.77 | 2022-04-11 00:01:15.283 [rank:4] [train], epoch: 27/50, iter: 100/834, loss: 0.30835, top1: 0.63708, throughput: 1320.80 | 2022-04-11 00:01:15.284 [rank:1] [train], epoch: 27/50, iter: 100/834, loss: 0.30893, top1: 0.63349, throughput: 1314.32 | 2022-04-11 00:01:15.286 [rank:3] [train], epoch: 27/50, iter: 100/834, loss: 0.30597, top1: 0.64089, throughput: 1312.24 | 2022-04-11 00:01:15.287 [rank:6] [train], epoch: 27/50, iter: 100/834, loss: 0.30535, top1: 0.64260, throughput: 1306.68 | 2022-04-11 00:01:15.283 [rank:7] [train], epoch: 27/50, iter: 100/834, loss: 0.30689, top1: 0.63615, throughput: 1290.96 | 2022-04-11 00:01:15.285 [rank:6] [train], epoch: 27/50, iter: 200/834, loss: 0.30848, top1: 0.63448, throughput: 1317.45 | 2022-04-11 00:01:29.856 [rank:4] [train], epoch: 27/50, iter: 200/834, loss: 0.30831, top1: 0.63302, throughput: 1317.71 | 2022-04-11 00:01:29.855 [rank:1] [train], epoch: 27/50, iter: 200/834, loss: 0.30675, top1: 0.63568, throughput: 1317.67 | 2022-04-11 00:01:29.857 [rank:0] [train], epoch: 27/50, iter: 200/834, loss: 0.30831, top1: 0.63620, throughput: 1317.49 | 2022-04-11 00:01:29.856 [rank:5] [train], epoch: 27/50, iter: 200/834, loss: 0.30918, top1: 0.63432, throughput: 1317.39 | 2022-04-11 00:01:29.856 [rank:3] [train], epoch: 27/50, iter: 200/834, loss: 0.31130, top1: 0.62297, throughput: 1317.76 | 2022-04-11 00:01:29.857 [rank:7] [train], epoch: 27/50, iter: 200/834, loss: 0.30819, top1: 0.63698, throughput: 1317.69 | 2022-04-11 00:01:29.856 [rank:2] [train], epoch: 27/50, iter: 200/834, loss: 0.31165, top1: 0.63042, throughput: 1317.29 | 2022-04-11 00:01:29.859 [rank:4] [train], epoch: 27/50, iter: 300/834, loss: 0.30772, top1: 0.63807, throughput: 1314.97 | 2022-04-11 00:01:44.456 [rank:5] [train], epoch: 27/50, iter: 300/834, loss: 0.31027, top1: 0.62943, throughput: 1314.99 | 2022-04-11 00:01:44.457 [rank:1] [train], epoch: 27/50, iter: 300/834, loss: 0.30965, top1: 0.63385, throughput: 1314.94 | 2022-04-11 00:01:44.458 [rank:3] [train], epoch: 27/50, iter: 300/834, loss: 0.31115, top1: 0.63000, throughput: 1314.90 | 2022-04-11 00:01:44.459 [rank:0] [train], epoch: 27/50, iter: 300/834, loss: 0.30851, top1: 0.63240, throughput: 1314.89 | 2022-04-11 00:01:44.458 [rank:6] [train], epoch: 27/50, iter: 300/834, loss: 0.30996, top1: 0.62927, throughput: 1314.83 | 2022-04-11 00:01:44.459 [rank:2] [train], epoch: 27/50, iter: 300/834, loss: 0.30806, top1: 0.63703, throughput: 1315.05 | 2022-04-11 00:01:44.459 [rank:7] [train], epoch: 27/50, iter: 300/834, loss: 0.30538, top1: 0.63880, throughput: 1314.71 | 2022-04-11 00:01:44.460 [rank:6] [train], epoch: 27/50, iter: 400/834, loss: 0.31086, top1: 0.62776, throughput: 1318.21 | 2022-04-11 00:01:59.024 [rank:5] [train], epoch: 27/50, iter: 400/834, loss: 0.30857, top1: 0.63849, throughput: 1317.93 | 2022-04-11 00:01:59.025 [rank:4] [train], epoch: 27/50, iter: 400/834, loss: 0.31055, top1: 0.62750, throughput: 1317.74 | 2022-04-11 00:01:59.027 [rank:3] [train], epoch: 27/50, iter: 400/834, loss: 0.31129, top1: 0.63016, throughput: 1317.97 | 2022-04-11 00:01:59.027 [rank:7] [train], epoch: 27/50, iter: 400/834, loss: 0.30793, top1: 0.63344, throughput: 1318.11 | 2022-04-11 00:01:59.026 [rank:1] [train], epoch: 27/50, iter: 400/834, loss: 0.30891, top1: 0.63464, throughput: 1317.96 | 2022-04-11 00:01:59.026 [rank:2] [train], epoch: 27/50, iter: 400/834, loss: 0.30877, top1: 0.63526, throughput: 1318.11 | 2022-04-11 00:01:59.025 [rank:0] [train], epoch: 27/50, iter: 400/834, loss: 0.30945, top1: 0.62818, throughput: 1317.91 | 2022-04-11 00:01:59.026 [rank:4] [train], epoch: 27/50, iter: 500/834, loss: 0.31221, top1: 0.62792, throughput: 1314.10 | 2022-04-11 00:02:13.637 [rank:6] [train], epoch: 27/50, iter: 500/834, loss: 0.31130, top1: 0.63036, throughput: 1313.73 | 2022-04-11 00:02:13.639 [rank:2] [train], epoch: 27/50, iter: 500/834, loss: 0.30900, top1: 0.63401, throughput: 1313.82 | 2022-04-11 00:02:13.639 [rank:1] [train], epoch: 27/50, iter: 500/834, loss: 0.30921, top1: 0.63240, throughput: 1313.92 | 2022-04-11 00:02:13.639 [rank:3] [train], epoch: 27/50, iter: 500/834, loss: 0.31243, top1: 0.62880, throughput: 1313.70 | 2022-04-11 00:02:13.642 [rank:0] [train], epoch: 27/50, iter: 500/834, loss: 0.31177, top1: 0.62672, throughput: 1313.90 | 2022-04-11 00:02:13.639 [rank:5] [train], epoch: 27/50, iter: 500/834, loss: 0.31160, top1: 0.62797, throughput: 1313.64 | 2022-04-11 00:02:13.641 [rank:7] [train], epoch: 27/50, iter: 500/834, loss: 0.31157, top1: 0.62792, throughput: 1313.80 | 2022-04-11 00:02:13.640 [rank:4] [train], epoch: 27/50, iter: 600/834, loss: 0.31039, top1: 0.62885, throughput: 1315.57 | 2022-04-11 00:02:28.232 [rank:5] [train], epoch: 27/50, iter: 600/834, loss: 0.31227, top1: 0.62839, throughput: 1315.94 | 2022-04-11 00:02:28.232 [rank:7] [train], epoch: 27/50, iter: 600/834, loss: 0.31020, top1: 0.62755, throughput: 1315.76 | 2022-04-11 00:02:28.233 [rank:3] [train], epoch: 27/50, iter: 600/834, loss: 0.31250, top1: 0.62505, throughput: 1315.81 | 2022-04-11 00:02:28.234 [rank:2] [train], epoch: 27/50, iter: 600/834, loss: 0.30915, top1: 0.63214, throughput: 1315.63 | 2022-04-11 00:02:28.233 [rank:6] [train], epoch: 27/50, iter: 600/834, loss: 0.30912, top1: 0.63115, throughput: 1315.55 | 2022-04-11 00:02:28.234 [rank:1] [train], epoch: 27/50, iter: 600/834, loss: 0.31406, top1: 0.62141, throughput: 1315.41 | 2022-04-11 00:02:28.235 [rank:0] [train], epoch: 27/50, iter: 600/834, loss: 0.31367, top1: 0.62318, throughput: 1315.49 | 2022-04-11 00:02:28.235 [rank:2] [train], epoch: 27/50, iter: 700/834, loss: 0.30983, top1: 0.63109, throughput: 1313.67 | 2022-04-11 00:02:42.848 [rank:6] [train], epoch: 27/50, iter: 700/834, loss: 0.31167, top1: 0.62240, throughput: 1313.98 | 2022-04-11 00:02:42.846 [rank:4] [train], epoch: 27/50, iter: 700/834, loss: 0.31120, top1: 0.63026, throughput: 1313.80 | 2022-04-11 00:02:42.846 [rank:5] [train], epoch: 27/50, iter: 700/834, loss: 0.30879, top1: 0.62990, throughput: 1313.66 | 2022-04-11 00:02:42.847 [rank:1] [train], epoch: 27/50, iter: 700/834, loss: 0.31095, top1: 0.62198, throughput: 1313.95 | 2022-04-11 00:02:42.848 [rank:3] [train], epoch: 27/50, iter: 700/834, loss: 0.30842, top1: 0.63250, throughput: 1313.61 | 2022-04-11 00:02:42.850 [rank:0] [train], epoch: 27/50, iter: 700/834, loss: 0.31076, top1: 0.63203, throughput: 1313.92 | 2022-04-11 00:02:42.848 [rank:7] [train], epoch: 27/50, iter: 700/834, loss: 0.30937, top1: 0.62766, throughput: 1313.63 | 2022-04-11 00:02:42.849 [rank:5] [train], epoch: 27/50, iter: 800/834, loss: 0.31274, top1: 0.62432, throughput: 1315.80 | 2022-04-11 00:02:57.439 [rank:7] [train], epoch: 27/50, iter: 800/834, loss: 0.30977, top1: 0.63167, throughput: 1315.75 | 2022-04-11 00:02:57.441 [rank:6] [train], epoch: 27/50, iter: 800/834, loss: 0.30934, top1: 0.63094, throughput: 1315.57 | 2022-04-11 00:02:57.440 [rank:2] [train], epoch: 27/50, iter: 800/834, loss: 0.30909, top1: 0.62979, throughput: 1315.83 | 2022-04-11 00:02:57.440 [rank:4] [train], epoch: 27/50, iter: 800/834, loss: 0.31069, top1: 0.63161, throughput: 1315.49 | 2022-04-11 00:02:57.441 [rank:1] [train], epoch: 27/50, iter: 800/834, loss: 0.31250, top1: 0.62750, throughput: 1315.50 | 2022-04-11 00:02:57.443 [rank:0] [train], epoch: 27/50, iter: 800/834, loss: 0.31249, top1: 0.62885, throughput: 1315.63 | 2022-04-11 00:02:57.441 [rank:3] [train], epoch: 27/50, iter: 800/834, loss: 0.30958, top1: 0.62937, throughput: 1315.57 | 2022-04-11 00:02:57.445 [rank:4] [train], epoch: 27/50, iter: 834/834, loss: 0.30793, top1: 0.62500, throughput: 1312.75 | 2022-04-11 00:03:02.414 [rank:6] [train], epoch: 27/50, iter: 834/834, loss: 0.31001, top1: 0.63067, throughput: 1312.32 | 2022-04-11 00:03:02.415 [rank:5] [train], epoch: 27/50, iter: 834/834, loss: 0.30882, top1: 0.62745, throughput: 1312.06 | 2022-04-11 00:03:02.414 [rank:7] [train], epoch: 27/50, iter: 834/834, loss: 0.31338, top1: 0.62332, throughput: 1312.38 | 2022-04-11 00:03:02.415 [rank:0] [train], epoch: 27/50, iter: 834/834, loss: 0.30776, top1: 0.63434, throughput: 1311.30 | 2022-04-11 00:03:02.420 [rank:2] [train], epoch: 27/50, iter: 834/834, loss: 0.31365, top1: 0.62653, throughput: 1310.95 | 2022-04-11 00:03:02.420 [rank:1] [train], epoch: 27/50, iter: 834/834, loss: 0.30947, top1: 0.63741, throughput: 1311.44 | 2022-04-11 00:03:02.421 [rank:3] [train], epoch: 27/50, iter: 834/834, loss: 0.30828, top1: 0.63159, throughput: 1311.76 | 2022-04-11 00:03:02.421 [rank:7] [eval], epoch: 27/50, iter: 125/125, loss: 0.00000, top1: 0.64592, throughput: 591.02 | 2022-04-11 00:03:12.990 [rank:0] [eval], epoch: 27/50, iter: 125/125, loss: 0.00000, top1: 0.64592, throughput: 591.19 | 2022-04-11 00:03:12.991 [rank:2] [eval], epoch: 27/50, iter: 125/125, loss: 0.00000, top1: 0.63024, throughput: 587.77 | 2022-04-11 00:03:13.053 [rank:3] [eval], epoch: 27/50, iter: 125/125, loss: 0.00000, top1: 0.63440, throughput: 584.35 | 2022-04-11 00:03:13.117 [rank:6] [eval], epoch: 27/50, iter: 125/125, loss: 0.00000, top1: 0.64640, throughput: 583.62 | 2022-04-11 00:03:13.124 [rank:4] [eval], epoch: 27/50, iter: 125/125, loss: 0.00000, top1: 0.63264, throughput: 581.70 | 2022-04-11 00:03:13.158 [rank:1] [eval], epoch: 27/50, iter: 125/125, loss: 0.00000, top1: 0.64416, throughput: 577.23 | 2022-04-11 00:03:13.248 [rank:5] [eval], epoch: 27/50, iter: 125/125, loss: 0.00000, top1: 0.62464, throughput: 571.30 | 2022-04-11 00:03:13.354 [rank:6] [train], epoch: 28/50, iter: 100/834, loss: 0.30391, top1: 0.64094, throughput: 1299.21 | 2022-04-11 00:03:27.902 [rank:5] [train], epoch: 28/50, iter: 100/834, loss: 0.30234, top1: 0.64422, throughput: 1319.80 | 2022-04-11 00:03:27.902 [rank:1] [train], epoch: 28/50, iter: 100/834, loss: 0.30686, top1: 0.63828, throughput: 1310.16 | 2022-04-11 00:03:27.903 [rank:4] [train], epoch: 28/50, iter: 100/834, loss: 0.30324, top1: 0.64552, throughput: 1302.11 | 2022-04-11 00:03:27.904 [rank:2] [train], epoch: 28/50, iter: 100/834, loss: 0.30583, top1: 0.63458, throughput: 1292.76 | 2022-04-11 00:03:27.905 [rank:3] [train], epoch: 28/50, iter: 100/834, loss: 0.30374, top1: 0.64479, throughput: 1298.38 | 2022-04-11 00:03:27.904 [rank:7] [train], epoch: 28/50, iter: 100/834, loss: 0.30791, top1: 0.63448, throughput: 1287.33 | 2022-04-11 00:03:27.905 [rank:0] [train], epoch: 28/50, iter: 100/834, loss: 0.30518, top1: 0.64089, throughput: 1287.38 | 2022-04-11 00:03:27.905 [rank:3] [train], epoch: 28/50, iter: 200/834, loss: 0.30588, top1: 0.64016, throughput: 1318.87 | 2022-04-11 00:03:42.462 [rank:4] [train], epoch: 28/50, iter: 200/834, loss: 0.30351, top1: 0.64714, throughput: 1318.77 | 2022-04-11 00:03:42.463 [rank:2] [train], epoch: 28/50, iter: 200/834, loss: 0.30426, top1: 0.63917, throughput: 1318.94 | 2022-04-11 00:03:42.462 [rank:6] [train], epoch: 28/50, iter: 200/834, loss: 0.30617, top1: 0.63286, throughput: 1318.61 | 2022-04-11 00:03:42.463 [rank:1] [train], epoch: 28/50, iter: 200/834, loss: 0.30637, top1: 0.63984, throughput: 1318.72 | 2022-04-11 00:03:42.463 [rank:5] [train], epoch: 28/50, iter: 200/834, loss: 0.30397, top1: 0.64234, throughput: 1318.48 | 2022-04-11 00:03:42.464 [rank:0] [train], epoch: 28/50, iter: 200/834, loss: 0.30728, top1: 0.63677, throughput: 1318.93 | 2022-04-11 00:03:42.463 [rank:7] [train], epoch: 28/50, iter: 200/834, loss: 0.30652, top1: 0.63740, throughput: 1318.60 | 2022-04-11 00:03:42.466 [rank:5] [train], epoch: 28/50, iter: 300/834, loss: 0.30453, top1: 0.64161, throughput: 1306.04 | 2022-04-11 00:03:57.165 [rank:4] [train], epoch: 28/50, iter: 300/834, loss: 0.30742, top1: 0.63557, throughput: 1305.88 | 2022-04-11 00:03:57.165 [rank:1] [train], epoch: 28/50, iter: 300/834, loss: 0.30661, top1: 0.63682, throughput: 1305.78 | 2022-04-11 00:03:57.166 [rank:0] [train], epoch: 28/50, iter: 300/834, loss: 0.30593, top1: 0.63922, throughput: 1305.83 | 2022-04-11 00:03:57.166 [rank:2] [train], epoch: 28/50, iter: 300/834, loss: 0.30570, top1: 0.64010, throughput: 1305.76 | 2022-04-11 00:03:57.166 [rank:3] [train], epoch: 28/50, iter: 300/834, loss: 0.30693, top1: 0.63651, throughput: 1305.67 | 2022-04-11 00:03:57.167 [rank:7] [train], epoch: 28/50, iter: 300/834, loss: 0.30520, top1: 0.63917, throughput: 1306.07 | 2022-04-11 00:03:57.166 [rank:6] [train], epoch: 28/50, iter: 300/834, loss: 0.30539, top1: 0.64000, throughput: 1305.71 | 2022-04-11 00:03:57.167 [rank:6] [train], epoch: 28/50, iter: 400/834, loss: 0.30537, top1: 0.64151, throughput: 1316.03 | 2022-04-11 00:04:11.757 [rank:2] [train], epoch: 28/50, iter: 400/834, loss: 0.30505, top1: 0.64073, throughput: 1315.82 | 2022-04-11 00:04:11.758 [rank:4] [train], epoch: 28/50, iter: 400/834, loss: 0.30844, top1: 0.63016, throughput: 1315.98 | 2022-04-11 00:04:11.755 [rank:3] [train], epoch: 28/50, iter: 400/834, loss: 0.30442, top1: 0.64271, throughput: 1316.06 | 2022-04-11 00:04:11.756 [rank:7] [train], epoch: 28/50, iter: 400/834, loss: 0.30661, top1: 0.63797, throughput: 1316.01 | 2022-04-11 00:04:11.756 [rank:5] [train], epoch: 28/50, iter: 400/834, loss: 0.30702, top1: 0.63250, throughput: 1315.85 | 2022-04-11 00:04:11.757 [rank:1] [train], epoch: 28/50, iter: 400/834, loss: 0.30597, top1: 0.64005, throughput: 1315.83 | 2022-04-11 00:04:11.758 [rank:0] [train], epoch: 28/50, iter: 400/834, loss: 0.30618, top1: 0.63896, throughput: 1315.74 | 2022-04-11 00:04:11.759 [rank:7] [train], epoch: 28/50, iter: 500/834, loss: 0.30599, top1: 0.63880, throughput: 1314.27 | 2022-04-11 00:04:26.365 [rank:4] [train], epoch: 28/50, iter: 500/834, loss: 0.30651, top1: 0.63641, throughput: 1314.15 | 2022-04-11 00:04:26.365 [rank:2] [train], epoch: 28/50, iter: 500/834, loss: 0.30633, top1: 0.63714, throughput: 1314.30 | 2022-04-11 00:04:26.366 [rank:1] [train], epoch: 28/50, iter: 500/834, loss: 0.30765, top1: 0.63266, throughput: 1314.40 | 2022-04-11 00:04:26.365 [rank:3] [train], epoch: 28/50, iter: 500/834, loss: 0.30838, top1: 0.63563, throughput: 1314.24 | 2022-04-11 00:04:26.366 [rank:6] [train], epoch: 28/50, iter: 500/834, loss: 0.30802, top1: 0.63312, throughput: 1314.21 | 2022-04-11 00:04:26.366 [rank:5] [train], epoch: 28/50, iter: 500/834, loss: 0.30740, top1: 0.63635, throughput: 1314.21 | 2022-04-11 00:04:26.366 [rank:0] [train], epoch: 28/50, iter: 500/834, loss: 0.30478, top1: 0.64042, throughput: 1314.42 | 2022-04-11 00:04:26.366 [rank:5] [train], epoch: 28/50, iter: 600/834, loss: 0.30886, top1: 0.63255, throughput: 1313.64 | 2022-04-11 00:04:40.982 [rank:2] [train], epoch: 28/50, iter: 600/834, loss: 0.30655, top1: 0.63891, throughput: 1313.67 | 2022-04-11 00:04:40.982 [rank:6] [train], epoch: 28/50, iter: 600/834, loss: 0.30968, top1: 0.63427, throughput: 1313.52 | 2022-04-11 00:04:40.983 [rank:0] [train], epoch: 28/50, iter: 600/834, loss: 0.30790, top1: 0.63687, throughput: 1313.60 | 2022-04-11 00:04:40.982 [rank:4] [train], epoch: 28/50, iter: 600/834, loss: 0.30631, top1: 0.63464, throughput: 1313.47 | 2022-04-11 00:04:40.983 [rank:1] [train], epoch: 28/50, iter: 600/834, loss: 0.30720, top1: 0.62990, throughput: 1313.24 | 2022-04-11 00:04:40.986 [rank:3] [train], epoch: 28/50, iter: 600/834, loss: 0.30622, top1: 0.63526, throughput: 1313.39 | 2022-04-11 00:04:40.984 [rank:7] [train], epoch: 28/50, iter: 600/834, loss: 0.30855, top1: 0.63547, throughput: 1313.46 | 2022-04-11 00:04:40.983 [rank:6] [train], epoch: 28/50, iter: 700/834, loss: 0.30707, top1: 0.63438, throughput: 1314.80 | 2022-04-11 00:04:55.586 [rank:3] [train], epoch: 28/50, iter: 700/834, loss: 0.30759, top1: 0.63536, throughput: 1314.72 | 2022-04-11 00:04:55.588 [rank:1] [train], epoch: 28/50, iter: 700/834, loss: 0.30701, top1: 0.63552, throughput: 1314.99 | 2022-04-11 00:04:55.587 [rank:5] [train], epoch: 28/50, iter: 700/834, loss: 0.30542, top1: 0.63505, throughput: 1314.55 | 2022-04-11 00:04:55.588 [rank:7] [train], epoch: 28/50, iter: 700/834, loss: 0.30945, top1: 0.63313, throughput: 1314.69[rank:4] [train], epoch: 28/50, iter: 700/834, loss: 0.30605, top1: 0.63724, throughput: 1314.74 | 2022-04-11 00:04:55.587 | 2022-04-11 00:04:55.587 [rank:2] [train], epoch: 28/50, iter: 700/834, loss: 0.30548, top1: 0.63755, throughput: 1314.55 | 2022-04-11 00:04:55.587 [rank:0] [train], epoch: 28/50, iter: 700/834, loss: 0.30867, top1: 0.63052, throughput: 1314.47 | 2022-04-11 00:04:55.589 [rank:2] [train], epoch: 28/50, iter: 800/834, loss: 0.30880, top1: 0.63266, throughput: 1311.78 | 2022-04-11 00:05:10.224 [rank:6] [train], epoch: 28/50, iter: 800/834, loss: 0.30929, top1: 0.63208, throughput: 1311.71 | 2022-04-11 00:05:10.224 [rank:5] [train], epoch: 28/50, iter: 800/834, loss: 0.30602, top1: 0.63547, throughput: 1311.83 | 2022-04-11 00:05:10.224 [rank:1] [train], epoch: 28/50, iter: 800/834, loss: 0.31026, top1: 0.63104, throughput: 1311.61 | 2022-04-11 00:05:10.225 [rank:4] [train], epoch: 28/50, iter: 800/834, loss: 0.30963, top1: 0.62714, throughput: 1311.60 | 2022-04-11 00:05:10.225 [rank:3] [train], epoch: 28/50, iter: 800/834, loss: 0.30547, top1: 0.63932, throughput: 1311.65 | 2022-04-11 00:05:10.226 [rank:7] [train], epoch: 28/50, iter: 800/834, loss: 0.30781, top1: 0.63755, throughput: 1311.72 | 2022-04-11 00:05:10.224 [rank:0] [train], epoch: 28/50, iter: 800/834, loss: 0.30532, top1: 0.64286, throughput: 1311.59 | 2022-04-11 00:05:10.228 [rank:5] [train], epoch: 28/50, iter: 834/834, loss: 0.30365, top1: 0.64782, throughput: 1315.60 | 2022-04-11 00:05:15.186 [rank:4] [train], epoch: 28/50, iter: 834/834, loss: 0.30384, top1: 0.63725, throughput: 1315.92 | 2022-04-11 00:05:15.186 [rank:2] [train], epoch: 28/50, iter: 834/834, loss: 0.30789, top1: 0.63143, throughput: 1315.33 | 2022-04-11 00:05:15.187 [rank:1] [train], epoch: 28/50, iter: 834/834, loss: 0.30424, top1: 0.64262, throughput: 1315.38 | 2022-04-11 00:05:15.188 [rank:0] [train], epoch: 28/50, iter: 834/834, loss: 0.30272, top1: 0.63879, throughput: 1316.12 | 2022-04-11 00:05:15.188 [rank:7] [train], epoch: 28/50, iter: 834/834, loss: 0.30781, top1: 0.62209, throughput: 1315.22 | 2022-04-11 00:05:15.188 [rank:6] [train], epoch: 28/50, iter: 834/834, loss: 0.30357, top1: 0.64246, throughput: 1314.90 | 2022-04-11 00:05:15.188 [rank:3] [train], epoch: 28/50, iter: 834/834, loss: 0.30754, top1: 0.63833, throughput: 1315.50 | 2022-04-11 00:05:15.189 [rank:7] [eval], epoch: 28/50, iter: 125/125, loss: 0.00000, top1: 0.64352, throughput: 574.81 | 2022-04-11 00:05:26.061 [rank:0] [eval], epoch: 28/50, iter: 125/125, loss: 0.00000, top1: 0.65456, throughput: 574.41 | 2022-04-11 00:05:26.068 [rank:2] [eval], epoch: 28/50, iter: 125/125, loss: 0.00000, top1: 0.64880, throughput: 573.72 | 2022-04-11 00:05:26.081 [rank:1] [eval], epoch: 28/50, iter: 125/125, loss: 0.00000, top1: 0.65552, throughput: 570.76 | 2022-04-11 00:05:26.138 [rank:4] [eval], epoch: 28/50, iter: 125/125, loss: 0.00000, top1: 0.64224, throughput: 569.48 | 2022-04-11 00:05:26.161 [rank:3] [eval], epoch: 28/50, iter: 125/125, loss: 0.00000, top1: 0.64800, throughput: 567.47 | 2022-04-11 00:05:26.202 [rank:6] [eval], epoch: 28/50, iter: 125/125, loss: 0.00000, top1: 0.64928, throughput: 567.09 | 2022-04-11 00:05:26.209 [rank:5] [eval], epoch: 28/50, iter: 125/125, loss: 0.00000, top1: 0.64000, throughput: 556.74 | 2022-04-11 00:05:26.412 [rank:1] [train], epoch: 29/50, iter: 100/834, loss: 0.30379, top1: 0.64870, throughput: 1295.29 | 2022-04-11 00:05:40.961 [rank:6] [train], epoch: 29/50, iter: 100/834, loss: 0.29924, top1: 0.65250, throughput: 1301.60 | 2022-04-11 00:05:40.961 [rank:4] [train], epoch: 29/50, iter: 100/834, loss: 0.30036, top1: 0.65182, throughput: 1297.22 | 2022-04-11 00:05:40.962 [rank:5] [train], epoch: 29/50, iter: 100/834, loss: 0.29910, top1: 0.65130, throughput: 1319.55 | 2022-04-11 00:05:40.962 [rank:0] [train], epoch: 29/50, iter: 100/834, loss: 0.30517, top1: 0.63896, throughput: 1289.22 | 2022-04-11 00:05:40.961 [rank:7] [train], epoch: 29/50, iter: 100/834, loss: 0.30206, top1: 0.64682, throughput: 1288.53 | 2022-04-11 00:05:40.961 [rank:2] [train], epoch: 29/50, iter: 100/834, loss: 0.30101, top1: 0.64740, throughput: 1289.95 | 2022-04-11 00:05:40.965 [rank:3] [train], epoch: 29/50, iter: 100/834, loss: 0.29997, top1: 0.64995, throughput: 1300.75 | 2022-04-11 00:05:40.963 [rank:5] [train], epoch: 29/50, iter: 200/834, loss: 0.30332, top1: 0.64818, throughput: 1315.60 | 2022-04-11 00:05:55.556 [rank:0] [train], epoch: 29/50, iter: 200/834, loss: 0.30058, top1: 0.65036, throughput: 1315.40 | 2022-04-11 00:05:55.557 [rank:7] [train], epoch: 29/50, iter: 200/834, loss: 0.30110, top1: 0.64802, throughput: 1315.39 | 2022-04-11 00:05:55.558 [rank:6] [train], epoch: 29/50, iter: 200/834, loss: 0.30194, top1: 0.64130, throughput: 1315.18 | 2022-04-11 00:05:55.559 [rank:1] [train], epoch: 29/50, iter: 200/834, loss: 0.30575, top1: 0.64042, throughput: 1315.17 | 2022-04-11 00:05:55.560 [rank:4] [train], epoch: 29/50, iter: 200/834, loss: 0.30197, top1: 0.64234, throughput: 1315.43 | 2022-04-11 00:05:55.558 [rank:2] [train], epoch: 29/50, iter: 200/834, loss: 0.30107, top1: 0.64922, throughput: 1315.56 | 2022-04-11 00:05:55.560 [rank:3] [train], epoch: 29/50, iter: 200/834, loss: 0.30211, top1: 0.64458, throughput: 1315.37 | 2022-04-11 00:05:55.560 [rank:2] [train], epoch: 29/50, iter: 300/834, loss: 0.30445, top1: 0.64354, throughput: 1314.71 | 2022-04-11 00:06:10.164 [rank:4] [train], epoch: 29/50, iter: 300/834, loss: 0.30247, top1: 0.64490, throughput: 1314.74 | 2022-04-11 00:06:10.161 [rank:5] [train], epoch: 29/50, iter: 300/834, loss: 0.30476, top1: 0.63891, throughput: 1314.55 | 2022-04-11 00:06:10.162 [rank:1] [train], epoch: 29/50, iter: 300/834, loss: 0.30404, top1: 0.64599, throughput: 1314.73 | 2022-04-11 00:06:10.164 [rank:0] [train], epoch: 29/50, iter: 300/834, loss: 0.30198, top1: 0.64865, throughput: 1314.55 | 2022-04-11 00:06:10.163 [rank:3] [train], epoch: 29/50, iter: 300/834, loss: 0.30504, top1: 0.64057, throughput: 1314.68 | 2022-04-11 00:06:10.164 [rank:6] [train], epoch: 29/50, iter: 300/834, loss: 0.30244, top1: 0.64479, throughput: 1314.61 | 2022-04-11 00:06:10.164 [rank:7] [train], epoch: 29/50, iter: 300/834, loss: 0.30181, top1: 0.64646, throughput: 1314.41 | 2022-04-11 00:06:10.165 [rank:6] [train], epoch: 29/50, iter: 400/834, loss: 0.30093, top1: 0.64990, throughput: 1314.81 | 2022-04-11 00:06:24.767 [rank:5] [train], epoch: 29/50, iter: 400/834, loss: 0.30488, top1: 0.64354, throughput: 1314.63 | 2022-04-11 00:06:24.767 [rank:4] [train], epoch: 29/50, iter: 400/834, loss: 0.30559, top1: 0.63896, throughput: 1314.53 | 2022-04-11 00:06:24.767 [rank:2] [train], epoch: 29/50, iter: 400/834, loss: 0.30590, top1: 0.63641, throughput: 1314.69 | 2022-04-11 00:06:24.768 [rank:3] [train], epoch: 29/50, iter: 400/834, loss: 0.30360, top1: 0.64359, throughput: 1314.53 | 2022-04-11 00:06:24.770 [rank:7] [train], epoch: 29/50, iter: 400/834, loss: 0.30316, top1: 0.64396, throughput: 1314.84 | 2022-04-11 00:06:24.768 [rank:1] [train], epoch: 29/50, iter: 400/834, loss: 0.30302, top1: 0.64578, throughput: 1314.46 | 2022-04-11 00:06:24.770 [rank:0] [train], epoch: 29/50, iter: 400/834, loss: 0.30413, top1: 0.64276, throughput: 1314.55 | 2022-04-11 00:06:24.769 [rank:2] [train], epoch: 29/50, iter: 500/834, loss: 0.30501, top1: 0.63990, throughput: 1314.62 | 2022-04-11 00:06:39.373 [rank:6] [train], epoch: 29/50, iter: 500/834, loss: 0.30138, top1: 0.64339, throughput: 1314.61 | 2022-04-11 00:06:39.372 [rank:0] [train], epoch: 29/50, iter: 500/834, loss: 0.30197, top1: 0.64266, throughput: 1314.69 | 2022-04-11 00:06:39.373 [rank:4] [train], epoch: 29/50, iter: 500/834, loss: 0.30181, top1: 0.64385, throughput: 1314.57 | 2022-04-11 00:06:39.373 [rank:5] [train], epoch: 29/50, iter: 500/834, loss: 0.30058, top1: 0.64844, throughput: 1314.43[rank:1] [train], epoch: 29/50, iter: 500/834, loss: 0.30054, top1: 0.64375, throughput: 1314.71 | 2022-04-11 00:06:39.374 | 2022-04-11 00:06:39.374 [rank:3] [train], epoch: 29/50, iter: 500/834, loss: 0.30205, top1: 0.64401, throughput: 1314.59 | 2022-04-11 00:06:39.375 [rank:7] [train], epoch: 29/50, iter: 500/834, loss: 0.30361, top1: 0.64609, throughput: 1314.36 | 2022-04-11 00:06:39.376 [rank:5] [train], epoch: 29/50, iter: 600/834, loss: 0.30403, top1: 0.64615, throughput: 1314.93 | 2022-04-11 00:06:53.976 [rank:6] [train], epoch: 29/50, iter: 600/834, loss: 0.30482, top1: 0.64453, throughput: 1314.81 | 2022-04-11 00:06:53.975 [rank:4] [train], epoch: 29/50, iter: 600/834, loss: 0.30587, top1: 0.63375, throughput: 1314.80 | 2022-04-11 00:06:53.976 [rank:1] [train], epoch: 29/50, iter: 600/834, loss: 0.30367, top1: 0.64062, throughput: 1314.79 | 2022-04-11 00:06:53.977 [rank:0] [train], epoch: 29/50, iter: 600/834, loss: 0.30245, top1: 0.64797, throughput: 1314.67 | 2022-04-11 00:06:53.977 [rank:2] [train], epoch: 29/50, iter: 600/834, loss: 0.30282, top1: 0.64625, throughput: 1314.52 | 2022-04-11 00:06:53.979 [rank:3] [train], epoch: 29/50, iter: 600/834, loss: 0.30509, top1: 0.63859, throughput: 1314.60 | 2022-04-11 00:06:53.981 [rank:7] [train], epoch: 29/50, iter: 600/834, loss: 0.30453, top1: 0.64229, throughput: 1314.77 | 2022-04-11 00:06:53.979 [rank:2] [train], epoch: 29/50, iter: 700/834, loss: 0.30329, top1: 0.64760, throughput: 1315.40 | 2022-04-11 00:07:08.575 [rank:3] [train], epoch: 29/50, iter: 700/834, loss: 0.30190, top1: 0.64729, throughput: 1315.41 | 2022-04-11 00:07:08.577 [rank:6] [train], epoch: 29/50, iter: 700/834, loss: 0.30213, top1: 0.64432, throughput: 1315.18 | 2022-04-11 00:07:08.574 [rank:1] [train], epoch: 29/50, iter: 700/834, loss: 0.30419, top1: 0.64557, throughput: 1315.23 | 2022-04-11 00:07:08.576 [rank:4] [train], epoch: 29/50, iter: 700/834, loss: 0.30525, top1: 0.64042, throughput: 1315.07 | 2022-04-11 00:07:08.576 [rank:0] [train], epoch: 29/50, iter: 700/834, loss: 0.30509, top1: 0.63776, throughput: 1315.29 | 2022-04-11 00:07:08.575 [rank:7] [train], epoch: 29/50, iter: 700/834, loss: 0.30273, top1: 0.64740, throughput: 1315.20 | 2022-04-11 00:07:08.578 [rank:5] [train], epoch: 29/50, iter: 700/834, loss: 0.30269, top1: 0.63839, throughput: 1315.00 | 2022-04-11 00:07:08.576 [rank:2] [train], epoch: 29/50, iter: 800/834, loss: 0.30325, top1: 0.64370, throughput: 1314.96 | 2022-04-11 00:07:23.177 [rank:5] [train], epoch: 29/50, iter: 800/834, loss: 0.30408, top1: 0.63849, throughput: 1315.11 | 2022-04-11 00:07:23.176 [rank:6] [train], epoch: 29/50, iter: 800/834, loss: 0.30727, top1: 0.63344, throughput: 1314.82 | 2022-04-11 00:07:23.177 [rank:1] [train], epoch: 29/50, iter: 800/834, loss: 0.30352, top1: 0.64255, throughput: 1314.79 | 2022-04-11 00:07:23.179 [rank:4] [train], epoch: 29/50, iter: 800/834, loss: 0.30327, top1: 0.63911, throughput: 1314.87 | 2022-04-11 00:07:23.178 [rank:3] [train], epoch: 29/50, iter: 800/834, loss: 0.30520, top1: 0.64000, throughput: 1314.77 | 2022-04-11 00:07:23.180 [rank:7] [train], epoch: 29/50, iter: 800/834, loss: 0.30660, top1: 0.63615, throughput: 1314.92 | 2022-04-11 00:07:23.179 [rank:0] [train], epoch: 29/50, iter: 800/834, loss: 0.30519, top1: 0.63917, throughput: 1314.69 | 2022-04-11 00:07:23.179 [rank:2] [train], epoch: 29/50, iter: 834/834, loss: 0.30483, top1: 0.64124, throughput: 1310.09 | 2022-04-11 00:07:28.159 [rank:6] [train], epoch: 29/50, iter: 834/834, loss: 0.30670, top1: 0.63664, throughput: 1310.29 | 2022-04-11 00:07:28.159 [rank:4] [train], epoch: 29/50, iter: 834/834, loss: 0.29959, top1: 0.65028, throughput: 1310.54 | 2022-04-11 00:07:28.159 [rank:1] [train], epoch: 29/50, iter: 834/834, loss: 0.30347, top1: 0.63940, throughput: 1310.00 | 2022-04-11 00:07:28.162 [rank:5] [train], epoch: 29/50, iter: 834/834, loss: 0.30542, top1: 0.63128, throughput: 1309.14 | 2022-04-11 00:07:28.162 [rank:0] [train], epoch: 29/50, iter: 834/834, loss: 0.30260, top1: 0.64844, throughput: 1309.89 | 2022-04-11 00:07:28.163 [rank:3] [train], epoch: 29/50, iter: 834/834, loss: 0.30452, top1: 0.64277, throughput: 1309.86[rank:7] [train], epoch: 29/50, iter: 834/834, loss: 0.30700, top1: 0.63358, throughput: 1309.65 | 2022-04-11 00:07:28.164| 2022-04-11 00:07:28.164 [rank:0] [eval], epoch: 29/50, iter: 125/125, loss: 0.00000, top1: 0.64912, throughput: 581.09 | 2022-04-11 00:07:38.918 [rank:7] [eval], epoch: 29/50, iter: 125/125, loss: 0.00000, top1: 0.64480, throughput: 581.01 | 2022-04-11 00:07:38.921 [rank:2] [eval], epoch: 29/50, iter: 125/125, loss: 0.00000, top1: 0.64080, throughput: 575.87 | 2022-04-11 00:07:39.013 [rank:6] [eval], epoch: 29/50, iter: 125/125, loss: 0.00000, top1: 0.64256, throughput: 572.93 | 2022-04-11 00:07:39.068 [rank:3] [eval], epoch: 29/50, iter: 125/125, loss: 0.00000, top1: 0.63760, throughput: 573.18 | 2022-04-11 00:07:39.068 [rank:5] [eval], epoch: 29/50, iter: 125/125, loss: 0.00000, top1: 0.63472, throughput: 567.70 | 2022-04-11 00:07:39.172 [rank:1] [eval], epoch: 29/50, iter: 125/125, loss: 0.00000, top1: 0.64864, throughput: 565.23 | 2022-04-11 00:07:39.219 [rank:4] [eval], epoch: 29/50, iter: 125/125, loss: 0.00000, top1: 0.63952, throughput: 562.37 | 2022-04-11 00:07:39.273 [rank:4] [train], epoch: 30/50, iter: 100/834, loss: 0.30017, top1: 0.64885, throughput: 1311.00 | 2022-04-11 00:07:53.918 [rank:6] [train], epoch: 30/50, iter: 100/834, loss: 0.29851, top1: 0.65062, throughput: 1292.85 | 2022-04-11 00:07:53.919 [rank:1] [train], epoch: 30/50, iter: 100/834, loss: 0.29629, top1: 0.65823, throughput: 1306.04 | 2022-04-11 00:07:53.920 [rank:3] [train], epoch: 30/50, iter: 100/834, loss: 0.29919, top1: 0.64927, throughput: 1292.60 | 2022-04-11 00:07:53.922 [rank:5] [train], epoch: 30/50, iter: 100/834, loss: 0.30010, top1: 0.65245, throughput: 1301.81 | 2022-04-11 00:07:53.921 [rank:2] [train], epoch: 30/50, iter: 100/834, loss: 0.29851, top1: 0.65281, throughput: 1287.83 | 2022-04-11 00:07:53.921 [rank:0] [train], epoch: 30/50, iter: 100/834, loss: 0.29586, top1: 0.65547, throughput: 1279.88 | 2022-04-11 00:07:53.920 [rank:7] [train], epoch: 30/50, iter: 100/834, loss: 0.29802, top1: 0.65719, throughput: 1280.01 | 2022-04-11 00:07:53.921 [rank:4] [train], epoch: 30/50, iter: 200/834, loss: 0.29817, top1: 0.65474, throughput: 1316.04 | 2022-04-11 00:08:08.507 [rank:6] [train], epoch: 30/50, iter: 200/834, loss: 0.29876, top1: 0.65005, throughput: 1316.06 | 2022-04-11 00:08:08.508 [rank:7] [train], epoch: 30/50, iter: 200/834, loss: 0.30070, top1: 0.64708, throughput: 1316.21 | 2022-04-11 00:08:08.508 [rank:2] [train], epoch: 30/50, iter: 200/834, loss: 0.29720, top1: 0.65672, throughput: 1316.18 | 2022-04-11 00:08:08.509 [rank:1] [train], epoch: 30/50, iter: 200/834, loss: 0.29963, top1: 0.65214, throughput: 1316.04 | 2022-04-11 00:08:08.510 [rank:0] [train], epoch: 30/50, iter: 200/834, loss: 0.29844, top1: 0.65302, throughput: 1316.07 | 2022-04-11 00:08:08.509 [rank:3] [train], epoch: 30/50, iter: 200/834, loss: 0.29761, top1: 0.65495, throughput: 1316.00 | 2022-04-11 00:08:08.511 [rank:5] [train], epoch: 30/50, iter: 200/834, loss: 0.29777, top1: 0.65370, throughput: 1316.11 | 2022-04-11 00:08:08.509 [rank:5] [train], epoch: 30/50, iter: 300/834, loss: 0.29904, top1: 0.64984, throughput: 1315.68 | 2022-04-11 00:08:23.102 [rank:4] [train], epoch: 30/50, iter: 300/834, loss: 0.30152, top1: 0.64745, throughput: 1315.48 | 2022-04-11 00:08:23.103 [rank:1] [train], epoch: 30/50, iter: 300/834, loss: 0.29981, top1: 0.65057, throughput: 1315.63 | 2022-04-11 00:08:23.103 [rank:3] [train], epoch: 30/50, iter: 300/834, loss: 0.29947, top1: 0.65172, throughput: 1315.62 | 2022-04-11 00:08:23.105 [rank:6] [train], epoch: 30/50, iter: 300/834, loss: 0.29926, top1: 0.64828, throughput: 1315.40 | 2022-04-11 00:08:23.104 [rank:0] [train], epoch: 30/50, iter: 300/834, loss: 0.29867, top1: 0.65323, throughput: 1315.46 | 2022-04-11 00:08:23.104 [rank:2] [train], epoch: 30/50, iter: 300/834, loss: 0.30042, top1: 0.64724, throughput: 1315.34 | 2022-04-11 00:08:23.106 [rank:7] [train], epoch: 30/50, iter: 300/834, loss: 0.30209, top1: 0.64599, throughput: 1315.25 | 2022-04-11 00:08:23.106 [rank:6] [train], epoch: 30/50, iter: 400/834, loss: 0.29747, top1: 0.65740, throughput: 1313.92 | 2022-04-11 00:08:37.717 [rank:4] [train], epoch: 30/50, iter: 400/834, loss: 0.30060, top1: 0.64599, throughput: 1313.99 | 2022-04-11 00:08:37.715 [rank:3] [train], epoch: 30/50, iter: 400/834, loss: 0.30079, top1: 0.65391, throughput: 1314.01 | 2022-04-11 00:08:37.717 [rank:1] [train], epoch: 30/50, iter: 400/834, loss: 0.30395, top1: 0.64260, throughput: 1313.72 | 2022-04-11 00:08:37.718 [rank:2] [train], epoch: 30/50, iter: 400/834, loss: 0.29853, top1: 0.65135, throughput: 1314.03 | 2022-04-11 00:08:37.718 [rank:5] [train], epoch: 30/50, iter: 400/834, loss: 0.29951, top1: 0.65229, throughput: 1313.67 | 2022-04-11 00:08:37.718 [rank:0] [train], epoch: 30/50, iter: 400/834, loss: 0.30046, top1: 0.64974, throughput: 1313.95 | 2022-04-11 00:08:37.717 [rank:7] [train], epoch: 30/50, iter: 400/834, loss: 0.29906, top1: 0.65188, throughput: 1313.92 | 2022-04-11 00:08:37.719 [rank:2] [train], epoch: 30/50, iter: 500/834, loss: 0.30227, top1: 0.64536, throughput: 1314.79 | 2022-04-11 00:08:52.321 [rank:5] [train], epoch: 30/50, iter: 500/834, loss: 0.30111, top1: 0.64703, throughput: 1314.84 | 2022-04-11 00:08:52.320 [rank:1] [train], epoch: 30/50, iter: 500/834, loss: 0.30031, top1: 0.64354, throughput: 1314.85 | 2022-04-11 00:08:52.321 [rank:4] [train], epoch: 30/50, iter: 500/834, loss: 0.29867, top1: 0.65005, throughput: 1314.51 | 2022-04-11 00:08:52.321 [rank:0] [train], epoch: 30/50, iter: 500/834, loss: 0.30123, top1: 0.65000, throughput: 1314.67 | 2022-04-11 00:08:52.321 [rank:6] [train], epoch: 30/50, iter: 500/834, loss: 0.30114, top1: 0.65026, throughput: 1314.69 | 2022-04-11 00:08:52.321 [rank:7] [train], epoch: 30/50, iter: 500/834, loss: 0.29715, top1: 0.65589, throughput: 1314.87 | 2022-04-11 00:08:52.321 [rank:3] [train], epoch: 30/50, iter: 500/834, loss: 0.29977, top1: 0.64839, throughput: 1314.34 | 2022-04-11 00:08:52.325 [rank:4] [train], epoch: 30/50, iter: 600/834, loss: 0.30065, top1: 0.64927, throughput: 1314.98 | 2022-04-11 00:09:06.922 [rank:6] [train], epoch: 30/50, iter: 600/834, loss: 0.30066, top1: 0.64750, throughput: 1314.89 | 2022-04-11 00:09:06.923 [rank:5] [train], epoch: 30/50, iter: 600/834, loss: 0.30106, top1: 0.65156, throughput: 1314.87 | 2022-04-11 00:09:06.923 [rank:7] [train], epoch: 30/50, iter: 600/834, loss: 0.29759, top1: 0.65693, throughput: 1314.93[rank:2] [train], epoch: 30/50, iter: 600/834, loss: 0.30004, top1: 0.64958, throughput: 1314.86 | 2022-04-11 00:09:06.923 | 2022-04-11 00:09:06.923 [rank:1] [train], epoch: 30/50, iter: 600/834, loss: 0.30109, top1: 0.64797, throughput: 1314.84 | 2022-04-11 00:09:06.923 [rank:3] [train], epoch: 30/50, iter: 600/834, loss: 0.29907, top1: 0.65177, throughput: 1315.19 | 2022-04-11 00:09:06.924 [rank:0] [train], epoch: 30/50, iter: 600/834, loss: 0.29937, top1: 0.65292, throughput: 1314.86 | 2022-04-11 00:09:06.924 [rank:5] [train], epoch: 30/50, iter: 700/834, loss: 0.30109, top1: 0.64885, throughput: 1313.83 | 2022-04-11 00:09:21.536 [rank:2] [train], epoch: 30/50, iter: 700/834, loss: 0.30082, top1: 0.64870, throughput: 1313.64 | 2022-04-11 00:09:21.539 [rank:6] [train], epoch: 30/50, iter: 700/834, loss: 0.29964, top1: 0.65005, throughput: 1313.79 | 2022-04-11 00:09:21.537 [rank:4] [train], epoch: 30/50, iter: 700/834, loss: 0.30067, top1: 0.64927, throughput: 1313.60 | 2022-04-11 00:09:21.538 [rank:1] [train], epoch: 30/50, iter: 700/834, loss: 0.29855, top1: 0.65385, throughput: 1313.61 | 2022-04-11 00:09:21.540 [rank:3] [train], epoch: 30/50, iter: 700/834, loss: 0.29858, top1: 0.64859, throughput: 1313.66 | 2022-04-11 00:09:21.539 [rank:0] [train], epoch: 30/50, iter: 700/834, loss: 0.30066, top1: 0.64906, throughput: 1313.67 | 2022-04-11 00:09:21.539 [rank:7] [train], epoch: 30/50, iter: 700/834, loss: 0.29783, top1: 0.65203, throughput: 1313.60 | 2022-04-11 00:09:21.539 [rank:6] [train], epoch: 30/50, iter: 800/834, loss: 0.30092, top1: 0.64932, throughput: 1316.08 | 2022-04-11 00:09:36.126 [rank:1] [train], epoch: 30/50, iter: 800/834, loss: 0.30127, top1: 0.64719, throughput: 1316.32 | 2022-04-11 00:09:36.126 [rank:3] [train], epoch: 30/50, iter: 800/834, loss: 0.29908, top1: 0.65005, throughput: 1316.15 | 2022-04-11 00:09:36.127 [rank:5] [train], epoch: 30/50, iter: 800/834, loss: 0.30231, top1: 0.64828, throughput: 1316.05 | 2022-04-11 00:09:36.125 [rank:4] [train], epoch: 30/50, iter: 800/834, loss: 0.29965, top1: 0.65104, throughput: 1316.22 | 2022-04-11 00:09:36.126 [rank:2] [train], epoch: 30/50, iter: 800/834, loss: 0.30142, top1: 0.64526, throughput: 1316.25 | 2022-04-11 00:09:36.126 [rank:7] [train], epoch: 30/50, iter: 800/834, loss: 0.30073, top1: 0.64969, throughput: 1316.03[rank:0] [train], epoch: 30/50, iter: 800/834, loss: 0.29989, top1: 0.64813, throughput: 1316.18 | 2022-04-11 00:09:36.129 | 2022-04-11 00:09:36.127 [rank:6] [train], epoch: 30/50, iter: 834/834, loss: 0.30535, top1: 0.64047, throughput: 1309.11 | 2022-04-11 00:09:41.113 [rank:4] [train], epoch: 30/50, iter: 834/834, loss: 0.30259, top1: 0.63879, throughput: 1308.75 | 2022-04-11 00:09:41.113 [rank:5] [train], epoch: 30/50, iter: 834/834, loss: 0.29802, top1: 0.65456, throughput: 1308.43 | 2022-04-11 00:09:41.115 [rank:0] [train], epoch: 30/50, iter: 834/834, loss: 0.29916, top1: 0.64767, throughput: 1308.78 | 2022-04-11 00:09:41.115 [rank:2] [train], epoch: 30/50, iter: 834/834, loss: 0.29936, top1: 0.65211, throughput: 1308.53 | 2022-04-11 00:09:41.115 [rank:1] [train], epoch: 30/50, iter: 834/834, loss: 0.30330, top1: 0.63925, throughput: 1308.35 | 2022-04-11 00:09:41.115 [rank:7] [train], epoch: 30/50, iter: 834/834, loss: 0.30297, top1: 0.64323, throughput: 1309.23 | 2022-04-11 00:09:41.115 [rank:3] [train], epoch: 30/50, iter: 834/834, loss: 0.29683, top1: 0.65242, throughput: 1308.38 | 2022-04-11 00:09:41.117 [rank:7] [eval], epoch: 30/50, iter: 125/125, loss: 0.00000, top1: 0.66912, throughput: 588.45 | 2022-04-11 00:09:51.736 [rank:0] [eval], epoch: 30/50, iter: 125/125, loss: 0.00000, top1: 0.66192, throughput: 587.70 | 2022-04-11 00:09:51.749 [rank:2] [eval], epoch: 30/50, iter: 125/125, loss: 0.00000, top1: 0.64720, throughput: 584.77 | 2022-04-11 00:09:51.803 [rank:3] [eval], epoch: 30/50, iter: 125/125, loss: 0.00000, top1: 0.65072, throughput: 583.83 | 2022-04-11 00:09:51.822 [rank:6] [eval], epoch: 30/50, iter: 125/125, loss: 0.00000, top1: 0.66000, throughput: 581.38 | 2022-04-11 00:09:51.863 [rank:4] [eval], epoch: 30/50, iter: 125/125, loss: 0.00000, top1: 0.65984, throughput: 580.41 | 2022-04-11 00:09:51.882 [rank:5] [eval], epoch: 30/50, iter: 125/125, loss: 0.00000, top1: 0.64992, throughput: 574.46 | 2022-04-11 00:09:51.994 [rank:1] [eval], epoch: 30/50, iter: 125/125, loss: 0.00000, top1: 0.66368, throughput: 570.42 | 2022-04-11 00:09:52.072 [rank:6] [train], epoch: 31/50, iter: 100/834, loss: 0.29310, top1: 0.66198, throughput: 1299.99 | 2022-04-11 00:10:06.632 [rank:3] [train], epoch: 31/50, iter: 100/834, loss: 0.29527, top1: 0.66151, throughput: 1296.28 | 2022-04-11 00:10:06.634 [rank:4] [train], epoch: 31/50, iter: 100/834, loss: 0.29377, top1: 0.66469, throughput: 1301.48 | 2022-04-11 00:10:06.634 [rank:7] [train], epoch: 31/50, iter: 100/834, loss: 0.29235, top1: 0.66922, throughput: 1288.84 | 2022-04-11 00:10:06.633 [rank:5] [train], epoch: 31/50, iter: 100/834, loss: 0.29344, top1: 0.66328, throughput: 1311.50[rank:1] [train], epoch: 31/50, iter: 100/834, loss: 0.29056, top1: 0.66755, throughput: 1318.52 | 2022-04-11 00:10:06.634 | 2022-04-11 00:10:06.634 [rank:2] [train], epoch: 31/50, iter: 100/834, loss: 0.29219, top1: 0.66760, throughput: 1294.53 | 2022-04-11 00:10:06.634 [rank:0] [train], epoch: 31/50, iter: 100/834, loss: 0.29291, top1: 0.66047, throughput: 1289.76 | 2022-04-11 00:10:06.636 [rank:5] [train], epoch: 31/50, iter: 200/834, loss: 0.29570, top1: 0.65807, throughput: 1318.80 | 2022-04-11 00:10:21.193 [rank:1] [train], epoch: 31/50, iter: 200/834, loss: 0.29260, top1: 0.66443, throughput: 1318.52 | 2022-04-11 00:10:21.196 [rank:3] [train], epoch: 31/50, iter: 200/834, loss: 0.29298, top1: 0.66443, throughput: 1318.36 | 2022-04-11 00:10:21.197 [rank:6] [train], epoch: 31/50, iter: 200/834, loss: 0.29681, top1: 0.65589, throughput: 1318.34 | 2022-04-11 00:10:21.196 [rank:4] [train], epoch: 31/50, iter: 200/834, loss: 0.29353, top1: 0.65974, throughput: 1318.39 | 2022-04-11 00:10:21.197 [rank:0] [train], epoch: 31/50, iter: 200/834, loss: 0.29563, top1: 0.65964, throughput: 1318.52 | 2022-04-11 00:10:21.198 [rank:2] [train], epoch: 31/50, iter: 200/834, loss: 0.29304, top1: 0.66568, throughput: 1318.55 | 2022-04-11 00:10:21.196 [rank:7] [train], epoch: 31/50, iter: 200/834, loss: 0.29805, top1: 0.64859, throughput: 1318.26 | 2022-04-11 00:10:21.197 [rank:4] [train], epoch: 31/50, iter: 300/834, loss: 0.29557, top1: 0.65953, throughput: 1317.87 | 2022-04-11 00:10:35.766 [rank:1] [train], epoch: 31/50, iter: 300/834, loss: 0.29588, top1: 0.65938, throughput: 1317.67 | 2022-04-11 00:10:35.767 [rank:2] [train], epoch: 31/50, iter: 300/834, loss: 0.29586, top1: 0.65672, throughput: 1317.67 | 2022-04-11 00:10:35.767 [rank:5] [train], epoch: 31/50, iter: 300/834, loss: 0.29564, top1: 0.66224, throughput: 1317.37 | 2022-04-11 00:10:35.767 [rank:7] [train], epoch: 31/50, iter: 300/834, loss: 0.29580, top1: 0.66068, throughput: 1317.76 | 2022-04-11 00:10:35.768 [rank:6] [train], epoch: 31/50, iter: 300/834, loss: 0.29719, top1: 0.65349, throughput: 1317.57 | 2022-04-11 00:10:35.768 [rank:0] [train], epoch: 31/50, iter: 300/834, loss: 0.29436, top1: 0.65917, throughput: 1317.57 | 2022-04-11 00:10:35.770 [rank:3] [train], epoch: 31/50, iter: 300/834, loss: 0.29564, top1: 0.66115, throughput: 1317.38 | 2022-04-11 00:10:35.772 [rank:2] [train], epoch: 31/50, iter: 400/834, loss: 0.29775, top1: 0.65448, throughput: 1313.78[rank:6] [train], epoch: 31/50, iter: 400/834, loss: 0.29713, top1: 0.65229, throughput: 1314.00 | 2022-04-11 00:10:50.380 | 2022-04-11 00:10:50.381 [rank:5] [train], epoch: 31/50, iter: 400/834, loss: 0.29724, top1: 0.65479, throughput: 1313.94 | 2022-04-11 00:10:50.380 [rank:4] [train], epoch: 31/50, iter: 400/834, loss: 0.29569, top1: 0.65797, throughput: 1313.75 | 2022-04-11 00:10:50.381 [rank:7] [train], epoch: 31/50, iter: 400/834, loss: 0.29680, top1: 0.65365, throughput: 1313.86 | 2022-04-11 00:10:50.381 [rank:3] [train], epoch: 31/50, iter: 400/834, loss: 0.29691, top1: 0.65781, throughput: 1314.02 | 2022-04-11 00:10:50.383 [rank:1] [train], epoch: 31/50, iter: 400/834, loss: 0.29768, top1: 0.65557, throughput: 1313.50 | 2022-04-11 00:10:50.384 [rank:0] [train], epoch: 31/50, iter: 400/834, loss: 0.29937, top1: 0.65089, throughput: 1313.96 | 2022-04-11 00:10:50.382 [rank:6] [train], epoch: 31/50, iter: 500/834, loss: 0.29788, top1: 0.65198, throughput: 1316.45 | 2022-04-11 00:11:04.965 [rank:5] [train], epoch: 31/50, iter: 500/834, loss: 0.29902, top1: 0.65063, throughput: 1316.27 | 2022-04-11 00:11:04.967 [rank:7] [train], epoch: 31/50, iter: 500/834, loss: 0.29559, top1: 0.65734, throughput: 1316.40 | 2022-04-11 00:11:04.966 [rank:1] [train], epoch: 31/50, iter: 500/834, loss: 0.29909, top1: 0.65104, throughput: 1316.68 | 2022-04-11 00:11:04.966 [rank:4] [train], epoch: 31/50, iter: 500/834, loss: 0.29702, top1: 0.65552, throughput: 1316.37 | 2022-04-11 00:11:04.966 [rank:2] [train], epoch: 31/50, iter: 500/834, loss: 0.29533, top1: 0.65885, throughput: 1316.42 | 2022-04-11 00:11:04.966 [rank:3] [train], epoch: 31/50, iter: 500/834, loss: 0.29588, top1: 0.65833, throughput: 1316.46 | 2022-04-11 00:11:04.968 [rank:0] [train], epoch: 31/50, iter: 500/834, loss: 0.29694, top1: 0.65234, throughput: 1316.39 | 2022-04-11 00:11:04.967 [rank:6] [train], epoch: 31/50, iter: 600/834, loss: 0.29888, top1: 0.65266, throughput: 1313.81 | 2022-04-11 00:11:19.579 [rank:5] [train], epoch: 31/50, iter: 600/834, loss: 0.29679, top1: 0.65646, throughput: 1313.96 | 2022-04-11 00:11:19.579 [rank:4] [train], epoch: 31/50, iter: 600/834, loss: 0.29741, top1: 0.65505, throughput: 1313.95 | 2022-04-11 00:11:19.579 [rank:1] [train], epoch: 31/50, iter: 600/834, loss: 0.29620, top1: 0.65490, throughput: 1313.83 | 2022-04-11 00:11:19.580 [rank:2] [train], epoch: 31/50, iter: 600/834, loss: 0.29712, top1: 0.65672, throughput: 1313.66 | 2022-04-11 00:11:19.582 [rank:0] [train], epoch: 31/50, iter: 600/834, loss: 0.29734, top1: 0.65453, throughput: 1313.87 | 2022-04-11 00:11:19.581[rank:3] [train], epoch: 31/50, iter: 600/834, loss: 0.29840, top1: 0.64844, throughput: 1313.68 | 2022-04-11 00:11:19.583 [rank:7] [train], epoch: 31/50, iter: 600/834, loss: 0.29781, top1: 0.65573, throughput: 1313.75 | 2022-04-11 00:11:19.581 [rank:6] [train], epoch: 31/50, iter: 700/834, loss: 0.29628, top1: 0.65839, throughput: 1313.52 | 2022-04-11 00:11:34.196 [rank:5] [train], epoch: 31/50, iter: 700/834, loss: 0.29641, top1: 0.65391, throughput: 1313.42 | 2022-04-11 00:11:34.197 [rank:1] [train], epoch: 31/50, iter: 700/834, loss: 0.29694, top1: 0.65406, throughput: 1313.51 | 2022-04-11 00:11:34.197 [rank:7] [train], epoch: 31/50, iter: 700/834, loss: 0.29909, top1: 0.65641, throughput: 1313.60 | 2022-04-11 00:11:34.197 [rank:0] [train], epoch: 31/50, iter: 700/834, loss: 0.29573, top1: 0.65745, throughput: 1313.54 | 2022-04-11 00:11:34.198 [rank:4] [train], epoch: 31/50, iter: 700/834, loss: 0.29713, top1: 0.65672, throughput: 1313.38 | 2022-04-11 00:11:34.198 [rank:3] [train], epoch: 31/50, iter: 700/834, loss: 0.29626, top1: 0.65568, throughput: 1313.36 | 2022-04-11 00:11:34.202 [rank:2] [train], epoch: 31/50, iter: 700/834, loss: 0.29566, top1: 0.66349, throughput: 1313.41 | 2022-04-11 00:11:34.200 [rank:4] [train], epoch: 31/50, iter: 800/834, loss: 0.29862, top1: 0.65297, throughput: 1319.95 | 2022-04-11 00:11:48.744 [rank:2] [train], epoch: 31/50, iter: 800/834, loss: 0.29823, top1: 0.64969, throughput: 1319.93 | 2022-04-11 00:11:48.746 [rank:5] [train], epoch: 31/50, iter: 800/834, loss: 0.29675, top1: 0.65708, throughput: 1319.92 | 2022-04-11 00:11:48.744 [rank:6] [train], epoch: 31/50, iter: 800/834, loss: 0.29520, top1: 0.65875, throughput: 1319.73 | 2022-04-11 00:11:48.745 [rank:7] [train], epoch: 31/50, iter: 800/834, loss: 0.29965, top1: 0.65156, throughput: 1319.83 | 2022-04-11 00:11:48.744 [rank:1] [train], epoch: 31/50, iter: 800/834, loss: 0.29569, top1: 0.65781, throughput: 1319.60 | 2022-04-11 00:11:48.747 [rank:0] [train], epoch: 31/50, iter: 800/834, loss: 0.29591, top1: 0.65625, throughput: 1319.76 | 2022-04-11 00:11:48.746 [rank:3] [train], epoch: 31/50, iter: 800/834, loss: 0.29641, top1: 0.65714, throughput: 1319.92 | 2022-04-11 00:11:48.749 [rank:5] [train], epoch: 31/50, iter: 834/834, loss: 0.29686, top1: 0.65594, throughput: 1313.79 | 2022-04-11 00:11:53.713 [rank:4] [train], epoch: 31/50, iter: 834/834, loss: 0.30147, top1: 0.64185, throughput: 1313.96 | 2022-04-11 00:11:53.712 [rank:0] [train], epoch: 31/50, iter: 834/834, loss: 0.29644, top1: 0.66054, throughput: 1314.31 | 2022-04-11 00:11:53.713 [rank:3] [train], epoch: 31/50, iter: 834/834, loss: 0.29368, top1: 0.66268, throughput: 1314.81 | 2022-04-11 00:11:53.714 [rank:1] [train], epoch: 31/50, iter: 834/834, loss: 0.30391, top1: 0.64767, throughput: 1313.92 | 2022-04-11 00:11:53.716 [rank:6] [train], epoch: 31/50, iter: 834/834, loss: 0.29939, top1: 0.64844, throughput: 1313.17 [rank:2] [train], epoch: 31/50, iter: 834/834, loss: 0.29940, top1: 0.64782, throughput: 1313.80| 2022-04-11 00:11:53.716 | 2022-04-11 00:11:53.715 [rank:7] [train], epoch: 31/50, iter: 834/834, loss: 0.29978, top1: 0.64767, throughput: 1312.72 | 2022-04-11 00:11:53.717 [rank:7] [eval], epoch: 31/50, iter: 125/125, loss: 0.00000, top1: 0.67536, throughput: 581.80 | 2022-04-11 00:12:04.460 [rank:0] [eval], epoch: 31/50, iter: 125/125, loss: 0.00000, top1: 0.67984, throughput: 581.43 | 2022-04-11 00:12:04.462 [rank:2] [eval], epoch: 31/50, iter: 125/125, loss: 0.00000, top1: 0.65904, throughput: 576.81 | 2022-04-11 00:12:04.551 [rank:6] [eval], epoch: 31/50, iter: 125/125, loss: 0.00000, top1: 0.67312, throughput: 574.92 | 2022-04-11 00:12:04.587 [rank:4] [eval], epoch: 31/50, iter: 125/125, loss: 0.00000, top1: 0.66496, throughput: 574.42 | 2022-04-11 00:12:04.593 [rank:3] [eval], epoch: 31/50, iter: 125/125, loss: 0.00000, top1: 0.66768, throughput: 574.25 | 2022-04-11 00:12:04.597 [rank:5] [eval], epoch: 31/50, iter: 125/125, loss: 0.00000, top1: 0.65344, throughput: 565.84 | 2022-04-11 00:12:04.758 [rank:1] [eval], epoch: 31/50, iter: 125/125, loss: 0.00000, top1: 0.67040, throughput: 561.99 | 2022-04-11 00:12:04.837 [rank:2] [train], epoch: 32/50, iter: 100/834, loss: 0.29124, top1: 0.66615, throughput: 1295.43 | 2022-04-11 00:12:19.372 [rank:5] [train], epoch: 32/50, iter: 100/834, loss: 0.29140, top1: 0.66958, throughput: 1313.85 | 2022-04-11 00:12:19.372 [rank:0] [train], epoch: 32/50, iter: 100/834, loss: 0.29268, top1: 0.66422, throughput: 1287.65 | 2022-04-11 00:12:19.373 [rank:4] [train], epoch: 32/50, iter: 100/834, loss: 0.29017, top1: 0.66724, throughput: 1298.94 | 2022-04-11 00:12:19.374 [rank:1] [train], epoch: 32/50, iter: 100/834, loss: 0.29057, top1: 0.66474, throughput: 1320.67 | 2022-04-11 00:12:19.375 [rank:3] [train], epoch: 32/50, iter: 100/834, loss: 0.28831, top1: 0.67323, throughput: 1299.34 | 2022-04-11 00:12:19.374 [rank:6] [train], epoch: 32/50, iter: 100/834, loss: 0.29371, top1: 0.66094, throughput: 1298.43 | 2022-04-11 00:12:19.374 [rank:7] [train], epoch: 32/50, iter: 100/834, loss: 0.29568, top1: 0.66042, throughput: 1287.39 | 2022-04-11 00:12:19.374 [rank:6] [train], epoch: 32/50, iter: 200/834, loss: 0.29056, top1: 0.66688, throughput: 1311.97 | 2022-04-11 00:12:34.008 [rank:7] [train], epoch: 32/50, iter: 200/834, loss: 0.29371, top1: 0.66167, throughput: 1311.80 | 2022-04-11 00:12:34.010 [rank:4] [train], epoch: 32/50, iter: 200/834, loss: 0.29127, top1: 0.66380, throughput: 1311.92 | 2022-04-11 00:12:34.009 [rank:2] [train], epoch: 32/50, iter: 200/834, loss: 0.29300, top1: 0.66120, throughput: 1311.69 | 2022-04-11 00:12:34.010 [rank:5] [train], epoch: 32/50, iter: 200/834, loss: 0.29186, top1: 0.66583, throughput: 1311.67 | 2022-04-11 00:12:34.009 [rank:1] [train], epoch: 32/50, iter: 200/834, loss: 0.29367, top1: 0.66000, throughput: 1311.84 | 2022-04-11 00:12:34.011 [rank:0] [train], epoch: 32/50, iter: 200/834, loss: 0.28889, top1: 0.67167, throughput: 1311.69 | 2022-04-11 00:12:34.010 [rank:3] [train], epoch: 32/50, iter: 200/834, loss: 0.29358, top1: 0.66255, throughput: 1311.65 | 2022-04-11 00:12:34.012 [rank:2] [train], epoch: 32/50, iter: 300/834, loss: 0.28938, top1: 0.67104, throughput: 1313.34 | 2022-04-11 00:12:48.629 [rank:6] [train], epoch: 32/50, iter: 300/834, loss: 0.29158, top1: 0.66047, throughput: 1313.35 | 2022-04-11 00:12:48.627 [rank:5] [train], epoch: 32/50, iter: 300/834, loss: 0.29254, top1: 0.66589, throughput: 1313.40 | 2022-04-11 00:12:48.628 [rank:3] [train], epoch: 32/50, iter: 300/834, loss: 0.29285, top1: 0.66266, throughput: 1313.52 | 2022-04-11 00:12:48.629 [rank:0] [train], epoch: 32/50, iter: 300/834, loss: 0.29277, top1: 0.66526, throughput: 1313.43 | 2022-04-11 00:12:48.629 [rank:1] [train], epoch: 32/50, iter: 300/834, loss: 0.29269, top1: 0.66740, throughput: 1313.37 | 2022-04-11 00:12:48.630 [rank:7] [train], epoch: 32/50, iter: 300/834, loss: 0.28980, top1: 0.67026, throughput: 1313.40 | 2022-04-11 00:12:48.629 [rank:4] [train], epoch: 32/50, iter: 300/834, loss: 0.29248, top1: 0.66740, throughput: 1313.14 | 2022-04-11 00:12:48.630 [rank:6] [train], epoch: 32/50, iter: 400/834, loss: 0.29146, top1: 0.66266, throughput: 1312.68 | 2022-04-11 00:13:03.254 [rank:1] [train], epoch: 32/50, iter: 400/834, loss: 0.29571, top1: 0.65667, throughput: 1312.92 | 2022-04-11 00:13:03.253 [rank:4] [train], epoch: 32/50, iter: 400/834, loss: 0.29068, top1: 0.66917, throughput: 1312.93[rank:5] [train], epoch: 32/50, iter: 400/834, loss: 0.29510, top1: 0.66260, throughput: 1312.69 | 2022-04-11 00:13:03.254| 2022-04-11 00:13:03.254 [rank:7] [train], epoch: 32/50, iter: 400/834, loss: 0.29247, top1: 0.66307, throughput: 1312.73 | 2022-04-11 00:13:03.255 [rank:3] [train], epoch: 32/50, iter: 400/834, loss: 0.29359, top1: 0.66052, throughput: 1312.65 | 2022-04-11 00:13:03.256 [rank:2] [train], epoch: 32/50, iter: 400/834, loss: 0.29202, top1: 0.66521, throughput: 1312.73 | 2022-04-11 00:13:03.255 [rank:0] [train], epoch: 32/50, iter: 400/834, loss: 0.29304, top1: 0.66333, throughput: 1312.77 | 2022-04-11 00:13:03.254 [rank:5] [train], epoch: 32/50, iter: 500/834, loss: 0.29322, top1: 0.65828, throughput: 1312.00 | 2022-04-11 00:13:17.888 [rank:2] [train], epoch: 32/50, iter: 500/834, loss: 0.29358, top1: 0.66469, throughput: 1312.08 | 2022-04-11 00:13:17.888 [rank:1] [train], epoch: 32/50, iter: 500/834, loss: 0.29007, top1: 0.66469, throughput: 1311.91 | 2022-04-11 00:13:17.889 [rank:6] [train], epoch: 32/50, iter: 500/834, loss: 0.29329, top1: 0.66021, throughput: 1312.11 | 2022-04-11 00:13:17.887 [rank:4] [train], epoch: 32/50, iter: 500/834, loss: 0.29104, top1: 0.66453, throughput: 1311.99 | 2022-04-11 00:13:17.888 [rank:7] [train], epoch: 32/50, iter: 500/834, loss: 0.29206, top1: 0.66771, throughput: 1312.06 | 2022-04-11 00:13:17.888 [rank:3] [train], epoch: 32/50, iter: 500/834, loss: 0.29246, top1: 0.66229, throughput: 1311.97 | 2022-04-11 00:13:17.891 [rank:0] [train], epoch: 32/50, iter: 500/834, loss: 0.29338, top1: 0.66276, throughput: 1311.89 | 2022-04-11 00:13:17.889 [rank:6] [train], epoch: 32/50, iter: 600/834, loss: 0.29244, top1: 0.66432, throughput: 1313.66 | 2022-04-11 00:13:32.503 [rank:1] [train], epoch: 32/50, iter: 600/834, loss: 0.29325, top1: 0.66146, throughput: 1313.67 | 2022-04-11 00:13:32.504 [rank:3] [train], epoch: 32/50, iter: 600/834, loss: 0.29456, top1: 0.65922, throughput: 1313.84 | 2022-04-11 00:13:32.504 [rank:7] [train], epoch: 32/50, iter: 600/834, loss: 0.29160, top1: 0.66771, throughput: 1313.70 | 2022-04-11 00:13:32.503 [rank:4] [train], epoch: 32/50, iter: 600/834, loss: 0.29569, top1: 0.66052, throughput: 1313.63 | 2022-04-11 00:13:32.504 [rank:5] [train], epoch: 32/50, iter: 600/834, loss: 0.29098, top1: 0.66792, throughput: 1313.74 | 2022-04-11 00:13:32.503 [rank:2] [train], epoch: 32/50, iter: 600/834, loss: 0.29481, top1: 0.65703, throughput: 1313.59 | 2022-04-11 00:13:32.505 [rank:0] [train], epoch: 32/50, iter: 600/834, loss: 0.29399, top1: 0.66557, throughput: 1313.57 | 2022-04-11 00:13:32.506 [rank:4] [train], epoch: 32/50, iter: 700/834, loss: 0.29358, top1: 0.66365, throughput: 1313.32 | 2022-04-11 00:13:47.124 [rank:2] [train], epoch: 32/50, iter: 700/834, loss: 0.29144, top1: 0.66672, throughput: 1313.19 | 2022-04-11 00:13:47.125 [rank:1] [train], epoch: 32/50, iter: 700/834, loss: 0.29404, top1: 0.65927, throughput: 1313.24 | 2022-04-11 00:13:47.125 [rank:6] [train], epoch: 32/50, iter: 700/834, loss: 0.29286, top1: 0.66260, throughput: 1313.14 | 2022-04-11 00:13:47.124 [rank:7] [train], epoch: 32/50, iter: 700/834, loss: 0.29261, top1: 0.66286, throughput: 1313.12 | 2022-04-11 00:13:47.125 [rank:3] [train], epoch: 32/50, iter: 700/834, loss: 0.29400, top1: 0.65979, throughput: 1313.08 | 2022-04-11 00:13:47.126 [rank:5] [train], epoch: 32/50, iter: 700/834, loss: 0.29559, top1: 0.65729, throughput: 1313.06 | 2022-04-11 00:13:47.126 [rank:0] [train], epoch: 32/50, iter: 700/834, loss: 0.29423, top1: 0.65839, throughput: 1313.12 | 2022-04-11 00:13:47.128 [rank:5] [train], epoch: 32/50, iter: 800/834, loss: 0.29047, top1: 0.66693, throughput: 1313.86 | 2022-04-11 00:14:01.739 [rank:2] [train], epoch: 32/50, iter: 800/834, loss: 0.29320, top1: 0.66349, throughput: 1313.77 | 2022-04-11 00:14:01.740 [rank:4] [train], epoch: 32/50, iter: 800/834, loss: 0.29199, top1: 0.66562, throughput: 1313.63 | 2022-04-11 00:14:01.740 [rank:3] [train], epoch: 32/50, iter: 800/834, loss: 0.29283, top1: 0.66453, throughput: 1313.73 | 2022-04-11 00:14:01.741 [rank:1] [train], epoch: 32/50, iter: 800/834, loss: 0.29070, top1: 0.67188, throughput: 1313.38 | 2022-04-11 00:14:01.743 [rank:0] [train], epoch: 32/50, iter: 800/834, loss: 0.29139, top1: 0.66599, throughput: 1313.67 | 2022-04-11 00:14:01.743 [rank:6] [train], epoch: 32/50, iter: 800/834, loss: 0.29297, top1: 0.66135, throughput: 1313.65 | 2022-04-11 00:14:01.740 [rank:7] [train], epoch: 32/50, iter: 800/834, loss: 0.29372, top1: 0.66156, throughput: 1313.50 | 2022-04-11 00:14:01.743 [rank:5] [train], epoch: 32/50, iter: 834/834, loss: 0.29305, top1: 0.66008, throughput: 1307.41 | 2022-04-11 00:14:06.732 [rank:1] [train], epoch: 32/50, iter: 834/834, loss: 0.29287, top1: 0.65993, throughput: 1308.66 | 2022-04-11 00:14:06.732 [rank:2] [train], epoch: 32/50, iter: 834/834, loss: 0.29059, top1: 0.66422, throughput: 1307.72 | 2022-04-11 00:14:06.732 [rank:7] [train], epoch: 32/50, iter: 834/834, loss: 0.28942, top1: 0.66774, throughput: 1308.49 | 2022-04-11 00:14:06.731 [rank:0] [train], epoch: 32/50, iter: 834/834, loss: 0.29606, top1: 0.66529, throughput: 1308.51 | 2022-04-11 00:14:06.732 [rank:4] [train], epoch: 32/50, iter: 834/834, loss: 0.29596, top1: 0.65962, throughput: 1307.49 | 2022-04-11 00:14:06.733 [rank:6] [train], epoch: 32/50, iter: 834/834, loss: 0.29598, top1: 0.65748, throughput: 1307.42 | 2022-04-11 00:14:06.733 [rank:3] [train], epoch: 32/50, iter: 834/834, loss: 0.29256, top1: 0.66636, throughput: 1307.52 | 2022-04-11 00:14:06.734 [rank:2] [eval], epoch: 32/50, iter: 125/125, loss: 0.00000, top1: 0.66368, throughput: 587.21 | 2022-04-11 00:14:17.375 [rank:7] [eval], epoch: 32/50, iter: 125/125, loss: 0.00000, top1: 0.66848, throughput: 587.10 | 2022-04-11 00:14:17.377 [rank:0] [eval], epoch: 32/50, iter: 125/125, loss: 0.00000, top1: 0.66976, throughput: 586.79 | 2022-04-11 00:14:17.383 [rank:1] [eval], epoch: 32/50, iter: 125/125, loss: 0.00000, top1: 0.67392, throughput: 580.04 | 2022-04-11 00:14:17.507 [rank:6] [eval], epoch: 32/50, iter: 125/125, loss: 0.00000, top1: 0.66400, throughput: 579.53 | 2022-04-11 00:14:17.518 [rank:3] [eval], epoch: 32/50, iter: 125/125, loss: 0.00000, top1: 0.66128, throughput: 579.11 | 2022-04-11 00:14:17.526 [rank:4] [eval], epoch: 32/50, iter: 125/125, loss: 0.00000, top1: 0.66192, throughput: 575.27 | 2022-04-11 00:14:17.597 [rank:5] [eval], epoch: 32/50, iter: 125/125, loss: 0.00000, top1: 0.66560, throughput: 568.90 | 2022-04-11 00:14:17.718 [rank:5] [train], epoch: 33/50, iter: 100/834, loss: 0.28489, top1: 0.67943, throughput: 1319.97 | 2022-04-11 00:14:32.264 [rank:2] [train], epoch: 33/50, iter: 100/834, loss: 0.28562, top1: 0.68193, throughput: 1289.51 | 2022-04-11 00:14:32.265 [rank:6] [train], epoch: 33/50, iter: 100/834, loss: 0.28555, top1: 0.68240, throughput: 1301.87 | 2022-04-11 00:14:32.266 [rank:1] [train], epoch: 33/50, iter: 100/834, loss: 0.28471, top1: 0.68052, throughput: 1300.96 | 2022-04-11 00:14:32.265 [rank:7] [train], epoch: 33/50, iter: 100/834, loss: 0.29005, top1: 0.66479, throughput: 1289.62 | 2022-04-11 00:14:32.265 [rank:4] [train], epoch: 33/50, iter: 100/834, loss: 0.28725, top1: 0.67802, throughput: 1308.91 | 2022-04-11 00:14:32.266 [rank:3] [train], epoch: 33/50, iter: 100/834, loss: 0.28507, top1: 0.67646, throughput: 1302.56 | 2022-04-11 00:14:32.267 [rank:0] [train], epoch: 33/50, iter: 100/834, loss: 0.28478, top1: 0.67661, throughput: 1290.09 | 2022-04-11 00:14:32.266 [rank:4] [train], epoch: 33/50, iter: 200/834, loss: 0.28709, top1: 0.67302, throughput: 1315.39 | 2022-04-11 00:14:46.862 [rank:2] [train], epoch: 33/50, iter: 200/834, loss: 0.28881, top1: 0.66938, throughput: 1315.21 | 2022-04-11 00:14:46.863 [rank:5] [train], epoch: 33/50, iter: 200/834, loss: 0.28878, top1: 0.67427, throughput: 1315.25 | 2022-04-11 00:14:46.862 [rank:6] [train], epoch: 33/50, iter: 200/834, loss: 0.29061, top1: 0.67167, throughput: 1315.31 | 2022-04-11 00:14:46.863 [rank:7] [train], epoch: 33/50, iter: 200/834, loss: 0.28803, top1: 0.67375, throughput: 1315.23 | 2022-04-11 00:14:46.863 [rank:1] [train], epoch: 33/50, iter: 200/834, loss: 0.28639, top1: 0.67771, throughput: 1315.19 | 2022-04-11 00:14:46.864 [rank:3] [train], epoch: 33/50, iter: 200/834, loss: 0.28594, top1: 0.67531, throughput: 1315.03 | 2022-04-11 00:14:46.867 [rank:0] [train], epoch: 33/50, iter: 200/834, loss: 0.28722, top1: 0.67500, throughput: 1315.11 | 2022-04-11 00:14:46.866 [rank:6] [train], epoch: 33/50, iter: 300/834, loss: 0.28935, top1: 0.67094, throughput: 1316.35 | 2022-04-11 00:15:01.449 [rank:5] [train], epoch: 33/50, iter: 300/834, loss: 0.28896, top1: 0.67000, throughput: 1316.22 | 2022-04-11 00:15:01.449 [rank:4] [train], epoch: 33/50, iter: 300/834, loss: 0.28605, top1: 0.67573, throughput: 1316.24 | 2022-04-11 00:15:01.449 [rank:1] [train], epoch: 33/50, iter: 300/834, loss: 0.28749, top1: 0.67427, throughput: 1316.11 | 2022-04-11 00:15:01.452 [rank:3] [train], epoch: 33/50, iter: 300/834, loss: 0.28639, top1: 0.67583, throughput: 1316.49 | 2022-04-11 00:15:01.451 [rank:0] [train], epoch: 33/50, iter: 300/834, loss: 0.28551, top1: 0.67958, throughput: 1316.42 | 2022-04-11 00:15:01.451 [rank:2] [train], epoch: 33/50, iter: 300/834, loss: 0.28916, top1: 0.67099, throughput: 1316.22 | 2022-04-11 00:15:01.450 [rank:7] [train], epoch: 33/50, iter: 300/834, loss: 0.28935, top1: 0.67031, throughput: 1316.04 | 2022-04-11 00:15:01.453 [rank:6] [train], epoch: 33/50, iter: 400/834, loss: 0.28985, top1: 0.66677, throughput: 1312.14 | 2022-04-11 00:15:16.081 [rank:4] [train], epoch: 33/50, iter: 400/834, loss: 0.28993, top1: 0.66953, throughput: 1312.34 | 2022-04-11 00:15:16.080 [rank:2] [train], epoch: 33/50, iter: 400/834, loss: 0.29070, top1: 0.66599, throughput: 1312.29 | 2022-04-11 00:15:16.081 [rank:0] [train], epoch: 33/50, iter: 400/834, loss: 0.28726, top1: 0.67760, throughput: 1312.21 | 2022-04-11 00:15:16.082 [rank:5] [train], epoch: 33/50, iter: 400/834, loss: 0.28893, top1: 0.67193, throughput: 1312.30 | 2022-04-11 00:15:16.080 [rank:1] [train], epoch: 33/50, iter: 400/834, loss: 0.28478, top1: 0.68292, throughput: 1312.46 | 2022-04-11 00:15:16.081 [rank:3] [train], epoch: 33/50, iter: 400/834, loss: 0.28960, top1: 0.66630, throughput: 1312.25 | 2022-04-11 00:15:16.083 [rank:7] [train], epoch: 33/50, iter: 400/834, loss: 0.29033, top1: 0.67000, throughput: 1312.35 | 2022-04-11 00:15:16.083 [rank:4] [train], epoch: 33/50, iter: 500/834, loss: 0.28986, top1: 0.66646, throughput: 1315.72 | 2022-04-11 00:15:30.672 [rank:5] [train], epoch: 33/50, iter: 500/834, loss: 0.28891, top1: 0.66635, throughput: 1315.88 | 2022-04-11 00:15:30.671 [rank:0] [train], epoch: 33/50, iter: 500/834, loss: 0.28943, top1: 0.66745, throughput: 1316.07 | 2022-04-11 00:15:30.671 [rank:7] [train], epoch: 33/50, iter: 500/834, loss: 0.28900, top1: 0.66984, throughput: 1316.13 | 2022-04-11 00:15:30.671 [rank:1] [train], epoch: 33/50, iter: 500/834, loss: 0.29160, top1: 0.66687, throughput: 1315.93 | 2022-04-11 00:15:30.672 [rank:6] [train], epoch: 33/50, iter: 500/834, loss: 0.28748, top1: 0.67302, throughput: 1315.90[rank:2] [train], epoch: 33/50, iter: 500/834, loss: 0.29116, top1: 0.66677, throughput: 1315.93 | 2022-04-11 00:15:30.672 | 2022-04-11 00:15:30.672 [rank:3] [train], epoch: 33/50, iter: 500/834, loss: 0.28880, top1: 0.67068, throughput: 1315.85 | 2022-04-11 00:15:30.674 [rank:6] [train], epoch: 33/50, iter: 600/834, loss: 0.28866, top1: 0.67047, throughput: 1314.92 | 2022-04-11 00:15:45.274 [rank:5] [train], epoch: 33/50, iter: 600/834, loss: 0.28947, top1: 0.66797, throughput: 1314.73 | 2022-04-11 00:15:45.275 [rank:2] [train], epoch: 33/50, iter: 600/834, loss: 0.28749, top1: 0.67349, throughput: 1314.84 | 2022-04-11 00:15:45.274 [rank:4] [train], epoch: 33/50, iter: 600/834, loss: 0.28774, top1: 0.67349, throughput: 1314.94 | 2022-04-11 00:15:45.274 [rank:1] [train], epoch: 33/50, iter: 600/834, loss: 0.28655, top1: 0.67609, throughput: 1314.83 | 2022-04-11 00:15:45.274 [rank:7] [train], epoch: 33/50, iter: 600/834, loss: 0.28811, top1: 0.67260, throughput: 1314.76 | 2022-04-11 00:15:45.274 [rank:3] [train], epoch: 33/50, iter: 600/834, loss: 0.28966, top1: 0.66750, throughput: 1314.74 | 2022-04-11 00:15:45.277 [rank:0] [train], epoch: 33/50, iter: 600/834, loss: 0.28788, top1: 0.67547, throughput: 1314.56 | 2022-04-11 00:15:45.277 [rank:4] [train], epoch: 33/50, iter: 700/834, loss: 0.29042, top1: 0.66458, throughput: 1314.66 | 2022-04-11 00:15:59.878 [rank:0] [train], epoch: 33/50, iter: 700/834, loss: 0.28773, top1: 0.67234, throughput: 1314.93 | 2022-04-11 00:15:59.879 [rank:1] [train], epoch: 33/50, iter: 700/834, loss: 0.28900, top1: 0.67063, throughput: 1314.62 | 2022-04-11 00:15:59.879 [rank:5] [train], epoch: 33/50, iter: 700/834, loss: 0.29043, top1: 0.66734, throughput: 1314.75 | 2022-04-11 00:15:59.878 [rank:3] [train], epoch: 33/50, iter: 700/834, loss: 0.28960, top1: 0.66740, throughput: 1314.60 | 2022-04-11 00:15:59.883 [rank:6] [train], epoch: 33/50, iter: 700/834, loss: 0.29124, top1: 0.66740, throughput: 1314.43 | 2022-04-11 00:15:59.881 [rank:2] [train], epoch: 33/50, iter: 700/834, loss: 0.29003, top1: 0.67047, throughput: 1314.48 | 2022-04-11 00:15:59.881 [rank:7] [train], epoch: 33/50, iter: 700/834, loss: 0.29088, top1: 0.66786, throughput: 1314.62 | 2022-04-11 00:15:59.879 [rank:2] [train], epoch: 33/50, iter: 800/834, loss: 0.29162, top1: 0.66703, throughput: 1314.33 | 2022-04-11 00:16:14.489 [rank:4] [train], epoch: 33/50, iter: 800/834, loss: 0.28921, top1: 0.66927, throughput: 1314.32 | 2022-04-11 00:16:14.487 [rank:5] [train], epoch: 33/50, iter: 800/834, loss: 0.28797, top1: 0.67297, throughput: 1314.36 | 2022-04-11 00:16:14.486 [rank:3] [train], epoch: 33/50, iter: 800/834, loss: 0.29082, top1: 0.66844, throughput: 1314.62 | 2022-04-11 00:16:14.488 [rank:1] [train], epoch: 33/50, iter: 800/834, loss: 0.29153, top1: 0.66552, throughput: 1314.34 | 2022-04-11 00:16:14.487 [rank:0] [train], epoch: 33/50, iter: 800/834, loss: 0.28897, top1: 0.66724, throughput: 1314.30 | 2022-04-11 00:16:14.487 [rank:6] [train], epoch: 33/50, iter: 800/834, loss: 0.28787, top1: 0.67531, throughput: 1314.17 | 2022-04-11 00:16:14.491 [rank:7] [train], epoch: 33/50, iter: 800/834, loss: 0.28937, top1: 0.67078, throughput: 1313.83 | 2022-04-11 00:16:14.493 [rank:4] [train], epoch: 33/50, iter: 834/834, loss: 0.29177, top1: 0.67371, throughput: 1307.94 | 2022-04-11 00:16:19.478 [rank:1] [train], epoch: 33/50, iter: 834/834, loss: 0.29241, top1: 0.66651, throughput: 1308.15 | 2022-04-11 00:16:19.478 [rank:7] [train], epoch: 33/50, iter: 834/834, loss: 0.28844, top1: 0.67080, throughput: 1309.68 | 2022-04-11 00:16:19.478 [rank:2] [train], epoch: 33/50, iter: 834/834, loss: 0.29132, top1: 0.66222, throughput: 1308.56 | 2022-04-11 00:16:19.478 [rank:6] [train], epoch: 33/50, iter: 834/834, loss: 0.28895, top1: 0.67218, throughput: 1308.83 | 2022-04-11 00:16:19.478 [rank:5] [train], epoch: 33/50, iter: 834/834, loss: 0.29181, top1: 0.66207, throughput: 1307.50 | 2022-04-11 00:16:19.479 [rank:0] [train], epoch: 33/50, iter: 834/834, loss: 0.28921, top1: 0.66667, throughput: 1307.81 | 2022-04-11 00:16:19.479 [rank:3] [train], epoch: 33/50, iter: 834/834, loss: 0.28744, top1: 0.67096, throughput: 1307.98 | 2022-04-11 00:16:19.479 [rank:0] [eval], epoch: 33/50, iter: 125/125, loss: 0.00000, top1: 0.68736, throughput: 576.42 | 2022-04-11 00:16:30.321 [rank:7] [eval], epoch: 33/50, iter: 125/125, loss: 0.00000, top1: 0.68352, throughput: 576.27 | 2022-04-11 00:16:30.323 [rank:2] [eval], epoch: 33/50, iter: 125/125, loss: 0.00000, top1: 0.68048, throughput: 574.56 | 2022-04-11 00:16:30.355 [rank:4] [eval], epoch: 33/50, iter: 125/125, loss: 0.00000, top1: 0.68704, throughput: 573.90 | 2022-04-11 00:16:30.368 [rank:6] [eval], epoch: 33/50, iter: 125/125, loss: 0.00000, top1: 0.68128, throughput: 567.57 | 2022-04-11 00:16:30.490 [rank:5] [eval], epoch: 33/50, iter: 125/125, loss: 0.00000, top1: 0.67024, throughput: 567.24 | 2022-04-11 00:16:30.497 [rank:3] [eval], epoch: 33/50, iter: 125/125, loss: 0.00000, top1: 0.67920, throughput: 566.46 | 2022-04-11 00:16:30.512 [rank:1] [eval], epoch: 33/50, iter: 125/125, loss: 0.00000, top1: 0.68128, throughput: 559.54 | 2022-04-11 00:16:30.648 [rank:4] [train], epoch: 34/50, iter: 100/834, loss: 0.28499, top1: 0.67833, throughput: 1296.78 | 2022-04-11 00:16:45.174 [rank:5] [train], epoch: 34/50, iter: 100/834, loss: 0.28314, top1: 0.68109, throughput: 1308.17 | 2022-04-11 00:16:45.174 [rank:6] [train], epoch: 34/50, iter: 100/834, loss: 0.28519, top1: 0.67891, throughput: 1307.59 | 2022-04-11 00:16:45.174 [rank:0] [train], epoch: 34/50, iter: 100/834, loss: 0.28190, top1: 0.68391, throughput: 1292.59 | 2022-04-11 00:16:45.175 [rank:1] [train], epoch: 34/50, iter: 100/834, loss: 0.28480, top1: 0.67969, throughput: 1321.46 | 2022-04-11 00:16:45.177 [rank:3] [train], epoch: 34/50, iter: 100/834, loss: 0.28225, top1: 0.68479, throughput: 1309.18 | 2022-04-11 00:16:45.178 [rank:7] [train], epoch: 34/50, iter: 100/834, loss: 0.28181, top1: 0.68437, throughput: 1292.77 | 2022-04-11 00:16:45.175 [rank:2] [train], epoch: 34/50, iter: 100/834, loss: 0.28452, top1: 0.67990, throughput: 1295.43 | 2022-04-11 00:16:45.177 [rank:6] [train], epoch: 34/50, iter: 200/834, loss: 0.28317, top1: 0.68094, throughput: 1315.73 | 2022-04-11 00:16:59.766 [rank:4] [train], epoch: 34/50, iter: 200/834, loss: 0.28591, top1: 0.67604, throughput: 1315.78 | 2022-04-11 00:16:59.766 [rank:1] [train], epoch: 34/50, iter: 200/834, loss: 0.28559, top1: 0.67630, throughput: 1316.00 | 2022-04-11 00:16:59.767 [rank:5] [train], epoch: 34/50, iter: 200/834, loss: 0.28269, top1: 0.68484, throughput: 1315.73 | 2022-04-11 00:16:59.767 [rank:0] [train], epoch: 34/50, iter: 200/834, loss: 0.28437, top1: 0.68021, throughput: 1315.73 | 2022-04-11 00:16:59.768 [rank:2] [train], epoch: 34/50, iter: 200/834, loss: 0.28587, top1: 0.67807, throughput: 1315.82 | 2022-04-11 00:16:59.768 [rank:7] [train], epoch: 34/50, iter: 200/834, loss: 0.28175, top1: 0.68417, throughput: 1315.74 | 2022-04-11 00:16:59.768 [rank:3] [train], epoch: 34/50, iter: 200/834, loss: 0.28549, top1: 0.67714, throughput: 1315.70 | 2022-04-11 00:16:59.771 [rank:5] [train], epoch: 34/50, iter: 300/834, loss: 0.28386, top1: 0.67927, throughput: 1306.04 | 2022-04-11 00:17:14.468 [rank:6] [train], epoch: 34/50, iter: 300/834, loss: 0.28522, top1: 0.68312, throughput: 1305.99 | 2022-04-11 00:17:14.468 [rank:4] [train], epoch: 34/50, iter: 300/834, loss: 0.28554, top1: 0.67651, throughput: 1305.90 | 2022-04-11 00:17:14.469 [rank:1] [train], epoch: 34/50, iter: 300/834, loss: 0.28314, top1: 0.68125, throughput: 1305.88 | 2022-04-11 00:17:14.469 [rank:3] [train], epoch: 34/50, iter: 300/834, loss: 0.28491, top1: 0.67937, throughput: 1306.17 | 2022-04-11 00:17:14.470 [rank:7] [train], epoch: 34/50, iter: 300/834, loss: 0.28272, top1: 0.68474, throughput: 1305.99 | 2022-04-11 00:17:14.469 [rank:0] [train], epoch: 34/50, iter: 300/834, loss: 0.28323, top1: 0.68385, throughput: 1305.83 | 2022-04-11 00:17:14.471 [rank:2] [train], epoch: 34/50, iter: 300/834, loss: 0.28567, top1: 0.67703, throughput: 1305.86 | 2022-04-11 00:17:14.471 [rank:6] [train], epoch: 34/50, iter: 400/834, loss: 0.28340, top1: 0.68224, throughput: 1315.69 | 2022-04-11 00:17:29.061 [rank:5] [train], epoch: 34/50, iter: 400/834, loss: 0.28526, top1: 0.67901, throughput: 1315.66 | 2022-04-11 00:17:29.061 [rank:3] [train], epoch: 34/50, iter: 400/834, loss: 0.28327, top1: 0.68318, throughput: 1315.64 | 2022-04-11 00:17:29.064 [rank:2] [train], epoch: 34/50, iter: 400/834, loss: 0.28503, top1: 0.67995, throughput: 1315.67 | 2022-04-11 00:17:29.065 [rank:0] [train], epoch: 34/50, iter: 400/834, loss: 0.28568, top1: 0.67901, throughput: 1315.79 | 2022-04-11 00:17:29.063 [rank:4] [train], epoch: 34/50, iter: 400/834, loss: 0.28703, top1: 0.67292, throughput: 1315.49[rank:1] [train], epoch: 34/50, iter: 400/834, loss: 0.28509, top1: 0.67906, throughput: 1315.55 | 2022-04-11 00:17:29.064 | 2022-04-11 00:17:29.064 [rank:7] [train], epoch: 34/50, iter: 400/834, loss: 0.28409, top1: 0.68115, throughput: 1315.64 | 2022-04-11 00:17:29.063 [rank:4] [train], epoch: 34/50, iter: 500/834, loss: 0.28604, top1: 0.67880, throughput: 1314.60 | 2022-04-11 00:17:43.669 [rank:2] [train], epoch: 34/50, iter: 500/834, loss: 0.28396, top1: 0.68005, throughput: 1314.58 | 2022-04-11 00:17:43.670 [rank:5] [train], epoch: 34/50, iter: 500/834, loss: 0.28192, top1: 0.68167, throughput: 1314.31 | 2022-04-11 00:17:43.670 [rank:6] [train], epoch: 34/50, iter: 500/834, loss: 0.28444, top1: 0.67813, throughput: 1314.29 | 2022-04-11 00:17:43.670 [rank:0] [train], epoch: 34/50, iter: 500/834, loss: 0.28907, top1: 0.67255, throughput: 1314.45 | 2022-04-11 00:17:43.670 [rank:7] [train], epoch: 34/50, iter: 500/834, loss: 0.28294, top1: 0.68286, throughput: 1314.38 | 2022-04-11 00:17:43.670 [rank:3] [train], epoch: 34/50, iter: 500/834, loss: 0.28348, top1: 0.67797, throughput: 1314.23 | 2022-04-11 00:17:43.673 [rank:1] [train], epoch: 34/50, iter: 500/834, loss: 0.28540, top1: 0.67641, throughput: 1314.20 | 2022-04-11 00:17:43.674 [rank:4] [train], epoch: 34/50, iter: 600/834, loss: 0.28493, top1: 0.67818, throughput: 1314.33 | 2022-04-11 00:17:58.277 [rank:2] [train], epoch: 34/50, iter: 600/834, loss: 0.28545, top1: 0.67948, throughput: 1314.21 | 2022-04-11 00:17:58.280 [rank:5] [train], epoch: 34/50, iter: 600/834, loss: 0.28509, top1: 0.68229, throughput: 1314.29 | 2022-04-11 00:17:58.278 [rank:6] [train], epoch: 34/50, iter: 600/834, loss: 0.28570, top1: 0.67500, throughput: 1314.17 | 2022-04-11 00:17:58.280 [rank:1] [train], epoch: 34/50, iter: 600/834, loss: 0.28634, top1: 0.67875, throughput: 1314.66 | 2022-04-11 00:17:58.278 [rank:3] [train], epoch: 34/50, iter: 600/834, loss: 0.28246, top1: 0.68344, throughput: 1314.41 | 2022-04-11 00:17:58.280 [rank:7] [train], epoch: 34/50, iter: 600/834, loss: 0.28664, top1: 0.67641, throughput: 1314.31 | 2022-04-11 00:17:58.279 [rank:0] [train], epoch: 34/50, iter: 600/834, loss: 0.28594, top1: 0.67891, throughput: 1314.05 | 2022-04-11 00:17:58.281 [rank:5] [train], epoch: 34/50, iter: 700/834, loss: 0.28479, top1: 0.67542, throughput: 1315.78 | 2022-04-11 00:18:12.870 [rank:6] [train], epoch: 34/50, iter: 700/834, loss: 0.28647, top1: 0.67625, throughput: 1315.95 | 2022-04-11 00:18:12.870 [rank:4] [train], epoch: 34/50, iter: 700/834, loss: 0.28317, top1: 0.67958, throughput: 1315.62 | 2022-04-11 00:18:12.871 [rank:2] [train], epoch: 34/50, iter: 700/834, loss: 0.28467, top1: 0.67937, throughput: 1315.88 | 2022-04-11 00:18:12.871 [rank:7] [train], epoch: 34/50, iter: 700/834, loss: 0.28281, top1: 0.68224, throughput: 1315.57 | 2022-04-11 00:18:12.873 [rank:3] [train], epoch: 34/50, iter: 700/834, loss: 0.28398, top1: 0.68099, throughput: 1315.37 | 2022-04-11 00:18:12.877 [rank:1] [train], epoch: 34/50, iter: 700/834, loss: 0.28459, top1: 0.67568, throughput: 1315.31 | 2022-04-11 00:18:12.876 [rank:0] [train], epoch: 34/50, iter: 700/834, loss: 0.28474, top1: 0.67750, throughput: 1315.75 | 2022-04-11 00:18:12.874 [rank:4] [train], epoch: 34/50, iter: 800/834, loss: 0.28634, top1: 0.67339, throughput: 1314.14 | 2022-04-11 00:18:27.481 [rank:6] [train], epoch: 34/50, iter: 800/834, loss: 0.28381, top1: 0.68010, throughput: 1314.06 | 2022-04-11 00:18:27.481 [rank:5] [train], epoch: 34/50, iter: 800/834, loss: 0.28489, top1: 0.67516, throughput: 1314.06 | 2022-04-11 00:18:27.481 [rank:2] [train], epoch: 34/50, iter: 800/834, loss: 0.28382, top1: 0.68453, throughput: 1313.83 | 2022-04-11 00:18:27.484 [rank:1] [train], epoch: 34/50, iter: 800/834, loss: 0.28486, top1: 0.67896, throughput: 1314.29 | 2022-04-11 00:18:27.484 [rank:0] [train], epoch: 34/50, iter: 800/834, loss: 0.28417, top1: 0.68344, throughput: 1314.27 | 2022-04-11 00:18:27.483 [rank:3] [train], epoch: 34/50, iter: 800/834, loss: 0.28516, top1: 0.67594, throughput: 1314.43 | 2022-04-11 00:18:27.484 [rank:7] [train], epoch: 34/50, iter: 800/834, loss: 0.28711, top1: 0.67260, throughput: 1314.02 | 2022-04-11 00:18:27.485 [rank:4] [train], epoch: 34/50, iter: 834/834, loss: 0.28256, top1: 0.69133, throughput: 1304.51 | 2022-04-11 00:18:32.486 [rank:1] [train], epoch: 34/50, iter: 834/834, loss: 0.28224, top1: 0.67907, throughput: 1304.78 | 2022-04-11 00:18:32.487 [rank:7] [train], epoch: 34/50, iter: 834/834, loss: 0.28693, top1: 0.67785, throughput: 1304.88 | 2022-04-11 00:18:32.488 [rank:5] [train], epoch: 34/50, iter: 834/834, loss: 0.28089, top1: 0.69026, throughput: 1303.88 | 2022-04-11 00:18:32.488 [rank:0] [train], epoch: 34/50, iter: 834/834, loss: 0.29073, top1: 0.66697, throughput: 1304.32 | 2022-04-11 00:18:32.488 [rank:6] [train], epoch: 34/50, iter: 834/834, loss: 0.28295, top1: 0.67892, throughput: 1303.69 | 2022-04-11 00:18:32.488 [rank:2] [train], epoch: 34/50, iter: 834/834, loss: 0.28577, top1: 0.67218, throughput: 1304.39 | 2022-04-11 00:18:32.489 [rank:3] [train], epoch: 34/50, iter: 834/834, loss: 0.28910, top1: 0.67096, throughput: 1303.96 | 2022-04-11 00:18:32.490 [rank:7] [eval], epoch: 34/50, iter: 125/125, loss: 0.00000, top1: 0.69424, throughput: 585.75 | 2022-04-11 00:18:43.158 [rank:2] [eval], epoch: 34/50, iter: 125/125, loss: 0.00000, top1: 0.67616, throughput: 585.65 | 2022-04-11 00:18:43.161 [rank:0] [eval], epoch: 34/50, iter: 125/125, loss: 0.00000, top1: 0.69408, throughput: 585.49 | 2022-04-11 00:18:43.163 [rank:4] [eval], epoch: 34/50, iter: 125/125, loss: 0.00000, top1: 0.69312, throughput: 580.19 | 2022-04-11 00:18:43.258 [rank:6] [eval], epoch: 34/50, iter: 125/125, loss: 0.00000, top1: 0.69200, throughput: 577.58 | 2022-04-11 00:18:43.309 [rank:3] [eval], epoch: 34/50, iter: 125/125, loss: 0.00000, top1: 0.68448, throughput: 575.68 | 2022-04-11 00:18:43.347 [rank:5] [eval], epoch: 34/50, iter: 125/125, loss: 0.00000, top1: 0.68512, throughput: 571.59 | 2022-04-11 00:18:43.423 [rank:1] [eval], epoch: 34/50, iter: 125/125, loss: 0.00000, top1: 0.69328, throughput: 569.87 | 2022-04-11 00:18:43.455 [rank:4] [train], epoch: 35/50, iter: 100/834, loss: 0.27966, top1: 0.69146, throughput: 1302.03 | 2022-04-11 00:18:58.004 [rank:5] [train], epoch: 35/50, iter: 100/834, loss: 0.28144, top1: 0.68703, throughput: 1316.69 | 2022-04-11 00:18:58.005 [rank:1] [train], epoch: 35/50, iter: 100/834, loss: 0.27974, top1: 0.68240, throughput: 1319.44 | 2022-04-11 00:18:58.006 [rank:2] [train], epoch: 35/50, iter: 100/834, loss: 0.27954, top1: 0.68823, throughput: 1293.38 | 2022-04-11 00:18:58.006 [rank:6] [train], epoch: 35/50, iter: 100/834, loss: 0.27639, top1: 0.69870, throughput: 1306.42 | 2022-04-11 00:18:58.006 [rank:3] [train], epoch: 35/50, iter: 100/834, loss: 0.28216, top1: 0.68542, throughput: 1309.67 | 2022-04-11 00:18:58.007 [rank:7] [train], epoch: 35/50, iter: 100/834, loss: 0.27779, top1: 0.69505, throughput: 1293.03 | 2022-04-11 00:18:58.006 [rank:0] [train], epoch: 35/50, iter: 100/834, loss: 0.27592, top1: 0.69385, throughput: 1293.28 | 2022-04-11 00:18:58.009 [rank:4] [train], epoch: 35/50, iter: 200/834, loss: 0.28013, top1: 0.68698, throughput: 1317.09 | 2022-04-11 00:19:12.582 [rank:1] [train], epoch: 35/50, iter: 200/834, loss: 0.28132, top1: 0.68422, throughput: 1317.36 | 2022-04-11 00:19:12.581 [rank:6] [train], epoch: 35/50, iter: 200/834, loss: 0.27789, top1: 0.69391, throughput: 1317.24 | 2022-04-11 00:19:12.582 [rank:2] [train], epoch: 35/50, iter: 200/834, loss: 0.28072, top1: 0.68583, throughput: 1317.29 | 2022-04-11 00:19:12.581 [rank:5] [train], epoch: 35/50, iter: 200/834, loss: 0.28042, top1: 0.68823, throughput: 1317.04 | 2022-04-11 00:19:12.583 [rank:7] [train], epoch: 35/50, iter: 200/834, loss: 0.28055, top1: 0.68844, throughput: 1317.36 | 2022-04-11 00:19:12.581 [rank:3] [train], epoch: 35/50, iter: 200/834, loss: 0.28237, top1: 0.68271, throughput: 1317.18 | 2022-04-11 00:19:12.584 [rank:0] [train], epoch: 35/50, iter: 200/834, loss: 0.28152, top1: 0.68828, throughput: 1317.42 | 2022-04-11 00:19:12.583 [rank:6] [train], epoch: 35/50, iter: 300/834, loss: 0.27982, top1: 0.69036, throughput: 1313.92 | 2022-04-11 00:19:27.195 [rank:4] [train], epoch: 35/50, iter: 300/834, loss: 0.27993, top1: 0.68729, throughput: 1313.91 | 2022-04-11 00:19:27.195 [rank:2] [train], epoch: 35/50, iter: 300/834, loss: 0.27930, top1: 0.69193, throughput: 1313.75 | 2022-04-11 00:19:27.196 [rank:0] [train], epoch: 35/50, iter: 300/834, loss: 0.28136, top1: 0.68339, throughput: 1313.86 | 2022-04-11 00:19:27.196 [rank:1] [train], epoch: 35/50, iter: 300/834, loss: 0.28103, top1: 0.68505, throughput: 1313.65 | 2022-04-11 00:19:27.197[rank:3] [train], epoch: 35/50, iter: 300/834, loss: 0.28190, top1: 0.68453, throughput: 1313.73 | 2022-04-11 00:19:27.199 [rank:5] [train], epoch: 35/50, iter: 300/834, loss: 0.28058, top1: 0.68740, throughput: 1313.80 | 2022-04-11 00:19:27.197 [rank:7] [train], epoch: 35/50, iter: 300/834, loss: 0.28295, top1: 0.68349, throughput: 1313.58 | 2022-04-11 00:19:27.198 [rank:5] [train], epoch: 35/50, iter: 400/834, loss: 0.28313, top1: 0.67891, throughput: 1317.30 | 2022-04-11 00:19:41.772 [rank:6] [train], epoch: 35/50, iter: 400/834, loss: 0.28122, top1: 0.68719, throughput: 1316.85 | 2022-04-11 00:19:41.775 [rank:2] [train], epoch: 35/50, iter: 400/834, loss: 0.28356, top1: 0.68057, throughput: 1316.92 | 2022-04-11 00:19:41.775 [rank:3] [train], epoch: 35/50, iter: 400/834, loss: 0.28016, top1: 0.68880, throughput: 1317.33 | 2022-04-11 00:19:41.774 [rank:1] [train], epoch: 35/50, iter: 400/834, loss: 0.28357, top1: 0.68219, throughput: 1317.14 | 2022-04-11 00:19:41.774 [rank:7] [train], epoch: 35/50, iter: 400/834, loss: 0.28123, top1: 0.68917, throughput: 1317.14 | 2022-04-11 00:19:41.775 [rank:4] [train], epoch: 35/50, iter: 400/834, loss: 0.28122, top1: 0.68521, throughput: 1316.78 | 2022-04-11 00:19:41.776 [rank:0] [train], epoch: 35/50, iter: 400/834, loss: 0.28258, top1: 0.67901, throughput: 1316.86 | 2022-04-11 00:19:41.776 [rank:5] [train], epoch: 35/50, iter: 500/834, loss: 0.28069, top1: 0.68406, throughput: 1313.74 | 2022-04-11 00:19:56.387 [rank:6] [train], epoch: 35/50, iter: 500/834, loss: 0.28075, top1: 0.69010, throughput: 1313.93 | 2022-04-11 00:19:56.387 [rank:3] [train], epoch: 35/50, iter: 500/834, loss: 0.28098, top1: 0.68891, throughput: 1313.71 | 2022-04-11 00:19:56.389 [rank:4] [train], epoch: 35/50, iter: 500/834, loss: 0.28286, top1: 0.68146, throughput: 1314.00 | 2022-04-11 00:19:56.387 [rank:2] [train], epoch: 35/50, iter: 500/834, loss: 0.28136, top1: 0.68172, throughput: 1313.84 | 2022-04-11 00:19:56.389 [rank:1] [train], epoch: 35/50, iter: 500/834, loss: 0.28064, top1: 0.68521, throughput: 1313.56 | 2022-04-11 00:19:56.391 [rank:0] [train], epoch: 35/50, iter: 500/834, loss: 0.28091, top1: 0.68797, throughput: 1313.76 | 2022-04-11 00:19:56.391 [rank:7] [train], epoch: 35/50, iter: 500/834, loss: 0.28142, top1: 0.68427, throughput: 1313.66 | 2022-04-11 00:19:56.390 [rank:2] [train], epoch: 35/50, iter: 600/834, loss: 0.28380, top1: 0.67729, throughput: 1313.74 | 2022-04-11 00:20:11.004 [rank:6] [train], epoch: 35/50, iter: 600/834, loss: 0.27927, top1: 0.69089, throughput: 1313.65 | 2022-04-11 00:20:11.003 [rank:5] [train], epoch: 35/50, iter: 600/834, loss: 0.28008, top1: 0.68531, throughput: 1313.63 | 2022-04-11 00:20:11.003 [rank:4] [train], epoch: 35/50, iter: 600/834, loss: 0.27993, top1: 0.69260, throughput: 1313.68 | 2022-04-11 00:20:11.003 [rank:3] [train], epoch: 35/50, iter: 600/834, loss: 0.27965, top1: 0.69021, throughput: 1313.69 | 2022-04-11 00:20:11.004 [rank:1] [train], epoch: 35/50, iter: 600/834, loss: 0.28083, top1: 0.68609, throughput: 1313.84 | 2022-04-11 00:20:11.004 [rank:0] [train], epoch: 35/50, iter: 600/834, loss: 0.27911, top1: 0.68995, throughput: 1313.67 | 2022-04-11 00:20:11.006 [rank:7] [train], epoch: 35/50, iter: 600/834, loss: 0.27786, top1: 0.69547, throughput: 1313.57 | 2022-04-11 00:20:11.007 [rank:5] [train], epoch: 35/50, iter: 700/834, loss: 0.27990, top1: 0.68906, throughput: 1317.06 | 2022-04-11 00:20:25.581 [rank:2] [train], epoch: 35/50, iter: 700/834, loss: 0.28356, top1: 0.67875, throughput: 1317.08 | 2022-04-11 00:20:25.582 [rank:1] [train], epoch: 35/50, iter: 700/834, loss: 0.28182, top1: 0.68927, throughput: 1317.07 | 2022-04-11 00:20:25.582 [rank:6] [train], epoch: 35/50, iter: 700/834, loss: 0.28021, top1: 0.68865, throughput: 1316.95 | 2022-04-11 00:20:25.582 [rank:3] [train], epoch: 35/50, iter: 700/834, loss: 0.28112, top1: 0.68526, throughput: 1316.91 | 2022-04-11 00:20:25.584 [rank:0] [train], epoch: 35/50, iter: 700/834, loss: 0.27953, top1: 0.69307, throughput: 1316.96[rank:4] [train], epoch: 35/50, iter: 700/834, loss: 0.28397, top1: 0.68292, throughput: 1316.58 | 2022-04-11 00:20:25.586| 2022-04-11 00:20:25.585 [rank:7] [train], epoch: 35/50, iter: 700/834, loss: 0.28144, top1: 0.68578, throughput: 1316.80 | 2022-04-11 00:20:25.588 [rank:2] [train], epoch: 35/50, iter: 800/834, loss: 0.27938, top1: 0.68932, throughput: 1316.71 | 2022-04-11 00:20:40.163 [rank:3] [train], epoch: 35/50, iter: 800/834, loss: 0.28289, top1: 0.68526, throughput: 1316.77 | 2022-04-11 00:20:40.165 [rank:6] [train], epoch: 35/50, iter: 800/834, loss: 0.28197, top1: 0.68615, throughput: 1316.80 | 2022-04-11 00:20:40.163 [rank:5] [train], epoch: 35/50, iter: 800/834, loss: 0.28057, top1: 0.68802, throughput: 1316.75 | 2022-04-11 00:20:40.162 [rank:4] [train], epoch: 35/50, iter: 800/834, loss: 0.28091, top1: 0.68797, throughput: 1317.14[rank:1] [train], epoch: 35/50, iter: 800/834, loss: 0.27989, top1: 0.68911, throughput: 1316.70 | 2022-04-11 00:20:40.164| 2022-04-11 00:20:40.163 [rank:0] [train], epoch: 35/50, iter: 800/834, loss: 0.28100, top1: 0.68719, throughput: 1317.04 | 2022-04-11 00:20:40.163 [rank:7] [train], epoch: 35/50, iter: 800/834, loss: 0.28231, top1: 0.68651, throughput: 1317.09 | 2022-04-11 00:20:40.165 [rank:5] [train], epoch: 35/50, iter: 834/834, loss: 0.27879, top1: 0.68689, throughput: 1306.18 | 2022-04-11 00:20:45.160 [rank:4] [train], epoch: 35/50, iter: 834/834, loss: 0.28243, top1: 0.68244, throughput: 1306.53 | 2022-04-11 00:20:45.160 [rank:2] [train], epoch: 35/50, iter: 834/834, loss: 0.27936, top1: 0.68658, throughput: 1306.62 | 2022-04-11 00:20:45.160 [rank:1] [train], epoch: 35/50, iter: 834/834, loss: 0.28097, top1: 0.68536, throughput: 1306.64 | 2022-04-11 00:20:45.160 [rank:7] [train], epoch: 35/50, iter: 834/834, loss: 0.28078, top1: 0.68091, throughput: 1306.83 | 2022-04-11 00:20:45.161 [rank:0] [train], epoch: 35/50, iter: 834/834, loss: 0.28014, top1: 0.68229, throughput: 1306.22 | 2022-04-11 00:20:45.161 [rank:6] [train], epoch: 35/50, iter: 834/834, loss: 0.28357, top1: 0.68091, throughput: 1305.98 | 2022-04-11 00:20:45.162 [rank:3] [train], epoch: 35/50, iter: 834/834, loss: 0.27959, top1: 0.69087, throughput: 1306.10 | 2022-04-11 00:20:45.163 [rank:0] [eval], epoch: 35/50, iter: 125/125, loss: 0.00000, top1: 0.69632, throughput: 592.06 | 2022-04-11 00:20:55.717 [rank:7] [eval], epoch: 35/50, iter: 125/125, loss: 0.00000, top1: 0.69520, throughput: 591.56 | 2022-04-11 00:20:55.726 [rank:6] [eval], epoch: 35/50, iter: 125/125, loss: 0.00000, top1: 0.68304, throughput: 584.29 | 2022-04-11 00:20:55.858 [rank:2] [eval], epoch: 35/50, iter: 125/125, loss: 0.00000, top1: 0.68000, throughput: 584.04 | 2022-04-11 00:20:55.861 [rank:4] [eval], epoch: 35/50, iter: 125/125, loss: 0.00000, top1: 0.68160, throughput: 582.00 | 2022-04-11 00:20:55.898 [rank:3] [eval], epoch: 35/50, iter: 125/125, loss: 0.00000, top1: 0.68352, throughput: 577.05 | 2022-04-11 00:20:55.994 [rank:1] [eval], epoch: 35/50, iter: 125/125, loss: 0.00000, top1: 0.69104, throughput: 576.63 | 2022-04-11 00:20:55.999 [rank:5] [eval], epoch: 35/50, iter: 125/125, loss: 0.00000, top1: 0.67712, throughput: 573.30 | 2022-04-11 00:20:56.062 [rank:6] [train], epoch: 36/50, iter: 100/834, loss: 0.27482, top1: 0.69703, throughput: 1295.76 | 2022-04-11 00:21:10.676 [rank:1] [train], epoch: 36/50, iter: 100/834, loss: 0.27482, top1: 0.69865, throughput: 1307.93 | 2022-04-11 00:21:10.679 [rank:5] [train], epoch: 36/50, iter: 100/834, loss: 0.27289, top1: 0.70385, throughput: 1313.56 | 2022-04-11 00:21:10.678 [rank:4] [train], epoch: 36/50, iter: 100/834, loss: 0.27281, top1: 0.70073, throughput: 1299.09 | 2022-04-11 00:21:10.678 [rank:3] [train], epoch: 36/50, iter: 100/834, loss: 0.27539, top1: 0.70109, throughput: 1307.44 | 2022-04-11 00:21:10.679[rank:0] [train], epoch: 36/50, iter: 100/834, loss: 0.27054, top1: 0.70656, throughput: 1283.34 | 2022-04-11 00:21:10.678 [rank:2] [train], epoch: 36/50, iter: 100/834, loss: 0.27424, top1: 0.69937, throughput: 1295.65 | 2022-04-11 00:21:10.680 [rank:7] [train], epoch: 36/50, iter: 100/834, loss: 0.27424, top1: 0.70234, throughput: 1283.87 | 2022-04-11 00:21:10.681 [rank:4] [train], epoch: 36/50, iter: 200/834, loss: 0.27440, top1: 0.69818, throughput: 1316.61 | 2022-04-11 00:21:25.261 [rank:2] [train], epoch: 36/50, iter: 200/834, loss: 0.27646, top1: 0.69880, throughput: 1316.63 | 2022-04-11 00:21:25.262 [rank:6] [train], epoch: 36/50, iter: 200/834, loss: 0.27487, top1: 0.69896, throughput: 1316.28 | 2022-04-11 00:21:25.263 [rank:1] [train], epoch: 36/50, iter: 200/834, loss: 0.27473, top1: 0.69609, throughput: 1316.37 | 2022-04-11 00:21:25.264 [rank:5] [train], epoch: 36/50, iter: 200/834, loss: 0.27634, top1: 0.69609, throughput: 1316.44 | 2022-04-11 00:21:25.263 [rank:3] [train], epoch: 36/50, iter: 200/834, loss: 0.27421, top1: 0.69839, throughput: 1316.43 | 2022-04-11 00:21:25.264 [rank:0] [train], epoch: 36/50, iter: 200/834, loss: 0.27638, top1: 0.69729, throughput: 1316.46 | 2022-04-11 00:21:25.263 [rank:7] [train], epoch: 36/50, iter: 200/834, loss: 0.27338, top1: 0.69917, throughput: 1316.55 | 2022-04-11 00:21:25.264 [rank:2] [train], epoch: 36/50, iter: 300/834, loss: 0.27642, top1: 0.69583, throughput: 1315.25 | 2022-04-11 00:21:39.860 [rank:5] [train], epoch: 36/50, iter: 300/834, loss: 0.27594, top1: 0.69583, throughput: 1315.43 | 2022-04-11 00:21:39.859 [rank:6] [train], epoch: 36/50, iter: 300/834, loss: 0.27649, top1: 0.69443, throughput: 1315.23 | 2022-04-11 00:21:39.861 [rank:1] [train], epoch: 36/50, iter: 300/834, loss: 0.27474, top1: 0.70094, throughput: 1315.35 | 2022-04-11 00:21:39.861 [rank:3] [train], epoch: 36/50, iter: 300/834, loss: 0.27584, top1: 0.69693, throughput: 1315.25 | 2022-04-11 00:21:39.862 [rank:7] [train], epoch: 36/50, iter: 300/834, loss: 0.27657, top1: 0.69354, throughput: 1315.32 | 2022-04-11 00:21:39.861 [rank:4] [train], epoch: 36/50, iter: 300/834, loss: 0.27625, top1: 0.69458, throughput: 1315.03 | 2022-04-11 00:21:39.861 [rank:0] [train], epoch: 36/50, iter: 300/834, loss: 0.27731, top1: 0.69281, throughput: 1315.17 | 2022-04-11 00:21:39.862 [rank:6] [train], epoch: 36/50, iter: 400/834, loss: 0.27830, top1: 0.69026, throughput: 1316.87 | 2022-04-11 00:21:54.441 [rank:4] [train], epoch: 36/50, iter: 400/834, loss: 0.27890, top1: 0.68979, throughput: 1316.93 | 2022-04-11 00:21:54.441 [rank:5] [train], epoch: 36/50, iter: 400/834, loss: 0.27741, top1: 0.69359, throughput: 1316.88 | 2022-04-11 00:21:54.439 [rank:7] [train], epoch: 36/50, iter: 400/834, loss: 0.27517, top1: 0.69729, throughput: 1317.01 | 2022-04-11 00:21:54.440 [rank:3] [train], epoch: 36/50, iter: 400/834, loss: 0.27685, top1: 0.69385, throughput: 1316.98 | 2022-04-11 00:21:54.441 [rank:2] [train], epoch: 36/50, iter: 400/834, loss: 0.27842, top1: 0.69214, throughput: 1316.89 | 2022-04-11 00:21:54.440 [rank:1] [train], epoch: 36/50, iter: 400/834, loss: 0.27705, top1: 0.69380, throughput: 1316.73 | 2022-04-11 00:21:54.443 [rank:0] [train], epoch: 36/50, iter: 400/834, loss: 0.27842, top1: 0.69203, throughput: 1316.96 | 2022-04-11 00:21:54.441 [rank:4] [train], epoch: 36/50, iter: 500/834, loss: 0.27613, top1: 0.69833, throughput: 1314.45 | 2022-04-11 00:22:09.047 [rank:3] [train], epoch: 36/50, iter: 500/834, loss: 0.27739, top1: 0.69083, throughput: 1314.26 | 2022-04-11 00:22:09.050 [rank:2] [train], epoch: 36/50, iter: 500/834, loss: 0.27907, top1: 0.68958, throughput: 1314.34 | 2022-04-11 00:22:09.048 [rank:1] [train], epoch: 36/50, iter: 500/834, loss: 0.27602, top1: 0.69615, throughput: 1314.50 | 2022-04-11 00:22:09.049 [rank:0] [train], epoch: 36/50, iter: 500/834, loss: 0.27808, top1: 0.69359, throughput: 1314.40 | 2022-04-11 00:22:09.048 [rank:5] [train], epoch: 36/50, iter: 500/834, loss: 0.27485, top1: 0.69917, throughput: 1314.24 | 2022-04-11 00:22:09.048 [rank:6] [train], epoch: 36/50, iter: 500/834, loss: 0.27452, top1: 0.70167, throughput: 1314.26 | 2022-04-11 00:22:09.050 [rank:7] [train], epoch: 36/50, iter: 500/834, loss: 0.27719, top1: 0.69682, throughput: 1314.25 | 2022-04-11 00:22:09.049 [rank:5] [train], epoch: 36/50, iter: 600/834, loss: 0.27835, top1: 0.69318, throughput: 1315.95 | 2022-04-11 00:22:23.639 [rank:6] [train], epoch: 36/50, iter: 600/834, loss: 0.27733, top1: 0.69224, throughput: 1315.94 | 2022-04-11 00:22:23.640 [rank:2] [train], epoch: 36/50, iter: 600/834, loss: 0.27920, top1: 0.68901, throughput: 1315.69 | 2022-04-11 00:22:23.641 [rank:1] [train], epoch: 36/50, iter: 600/834, loss: 0.27772, top1: 0.69047, throughput: 1315.83 | 2022-04-11 00:22:23.640 [rank:7] [train], epoch: 36/50, iter: 600/834, loss: 0.27896, top1: 0.68807, throughput: 1315.80 | 2022-04-11 00:22:23.641 [rank:0] [train], epoch: 36/50, iter: 600/834, loss: 0.27937, top1: 0.68932, throughput: 1315.76 | 2022-04-11 00:22:23.641 [rank:4] [train], epoch: 36/50, iter: 600/834, loss: 0.27631, top1: 0.69589, throughput: 1315.71 | 2022-04-11 00:22:23.640 [rank:3] [train], epoch: 36/50, iter: 600/834, loss: 0.27751, top1: 0.69385, throughput: 1315.83 | 2022-04-11 00:22:23.642 [rank:7] [train], epoch: 36/50, iter: 700/834, loss: 0.27707, top1: 0.69740, throughput: 1314.18 | 2022-04-11 00:22:38.251 [rank:4] [train], epoch: 36/50, iter: 700/834, loss: 0.27780, top1: 0.69198, throughput: 1314.15 | 2022-04-11 00:22:38.251 [rank:5] [train], epoch: 36/50, iter: 700/834, loss: 0.27708, top1: 0.69661, throughput: 1313.96 | 2022-04-11 00:22:38.251 [rank:2] [train], epoch: 36/50, iter: 700/834, loss: 0.27703, top1: 0.69349, throughput: 1314.24 | 2022-04-11 00:22:38.250 [rank:0] [train], epoch: 36/50, iter: 700/834, loss: 0.27701, top1: 0.69740, throughput: 1314.20 | 2022-04-11 00:22:38.250 [rank:6] [train], epoch: 36/50, iter: 700/834, loss: 0.27653, top1: 0.69536, throughput: 1314.11[rank:3] [train], epoch: 36/50, iter: 700/834, loss: 0.27842, top1: 0.68813, throughput: 1314.15 | 2022-04-11 00:22:38.252 | 2022-04-11 00:22:38.251 [rank:1] [train], epoch: 36/50, iter: 700/834, loss: 0.27489, top1: 0.69813, throughput: 1314.00 | 2022-04-11 00:22:38.252 [rank:2] [train], epoch: 36/50, iter: 800/834, loss: 0.27748, top1: 0.69734, throughput: 1312.64 | 2022-04-11 00:22:52.877 [rank:4] [train], epoch: 36/50, iter: 800/834, loss: 0.27527, top1: 0.70104, throughput: 1312.83 | 2022-04-11 00:22:52.875 [rank:5] [train], epoch: 36/50, iter: 800/834, loss: 0.27710, top1: 0.69000, throughput: 1312.68 | 2022-04-11 00:22:52.877 [rank:1] [train], epoch: 36/50, iter: 800/834, loss: 0.27957, top1: 0.68969, throughput: 1312.61 | 2022-04-11 00:22:52.880 [rank:7] [train], epoch: 36/50, iter: 800/834, loss: 0.27987, top1: 0.69047, throughput: 1312.50[rank:6] [train], epoch: 36/50, iter: 800/834, loss: 0.27588, top1: 0.69714, throughput: 1312.63 | 2022-04-11 00:22:52.878| 2022-04-11 00:22:52.879 [rank:3] [train], epoch: 36/50, iter: 800/834, loss: 0.27689, top1: 0.69370, throughput: 1312.59 | 2022-04-11 00:22:52.879 [rank:0] [train], epoch: 36/50, iter: 800/834, loss: 0.27727, top1: 0.69193, throughput: 1312.54 | 2022-04-11 00:22:52.878 [rank:6] [train], epoch: 36/50, iter: 834/834, loss: 0.27462, top1: 0.69393, throughput: 1312.52 | 2022-04-11 00:22:57.851 [rank:2] [train], epoch: 36/50, iter: 834/834, loss: 0.27645, top1: 0.69638, throughput: 1312.36 | 2022-04-11 00:22:57.852 [rank:5] [train], epoch: 36/50, iter: 834/834, loss: 0.27398, top1: 0.69393, throughput: 1311.98 | 2022-04-11 00:22:57.853 [rank:7] [train], epoch: 36/50, iter: 834/834, loss: 0.27960, top1: 0.69010, throughput: 1312.55 | 2022-04-11 00:22:57.853 [rank:4] [train], epoch: 36/50, iter: 834/834, loss: 0.28047, top1: 0.68061, throughput: 1311.21 | 2022-04-11 00:22:57.854 [rank:0] [train], epoch: 36/50, iter: 834/834, loss: 0.27704, top1: 0.69041, throughput: 1312.21 | 2022-04-11 00:22:57.853 [rank:1] [train], epoch: 36/50, iter: 834/834, loss: 0.27450, top1: 0.70358, throughput: 1311.97 | 2022-04-11 00:22:57.855 [rank:3] [train], epoch: 36/50, iter: 834/834, loss: 0.27420, top1: 0.69868, throughput: 1311.68 | 2022-04-11 00:22:57.856 [rank:7] [eval], epoch: 36/50, iter: 125/125, loss: 0.00000, top1: 0.69792, throughput: 581.54 | 2022-04-11 00:23:08.600 [rank:2] [eval], epoch: 36/50, iter: 125/125, loss: 0.00000, top1: 0.68208, throughput: 581.19 | 2022-04-11 00:23:08.605 [rank:0] [eval], epoch: 36/50, iter: 125/125, loss: 0.00000, top1: 0.69552, throughput: 581.21 | 2022-04-11 00:23:08.607 [rank:3] [eval], epoch: 36/50, iter: 125/125, loss: 0.00000, top1: 0.68640, throughput: 578.04 | 2022-04-11 00:23:08.668 [rank:1] [eval], epoch: 36/50, iter: 125/125, loss: 0.00000, top1: 0.68704, throughput: 573.90 | 2022-04-11 00:23:08.746 [rank:6] [eval], epoch: 36/50, iter: 125/125, loss: 0.00000, top1: 0.69008, throughput: 571.76 | 2022-04-11 00:23:08.783 [rank:4] [eval], epoch: 36/50, iter: 125/125, loss: 0.00000, top1: 0.69072, throughput: 570.76 | 2022-04-11 00:23:08.804 [rank:5] [eval], epoch: 36/50, iter: 125/125, loss: 0.00000, top1: 0.68320, throughput: 562.60 | 2022-04-11 00:23:08.962 [rank:6] [train], epoch: 37/50, iter: 100/834, loss: 0.27157, top1: 0.70302, throughput: 1304.60 | 2022-04-11 00:23:23.500 [rank:4] [train], epoch: 37/50, iter: 100/834, loss: 0.27213, top1: 0.70479, throughput: 1306.50 | 2022-04-11 00:23:23.500 [rank:5] [train], epoch: 37/50, iter: 100/834, loss: 0.26922, top1: 0.71359, throughput: 1320.60 | 2022-04-11 00:23:23.501 [rank:1] [train], epoch: 37/50, iter: 100/834, loss: 0.27095, top1: 0.70937, throughput: 1301.22 | 2022-04-11 00:23:23.501 [rank:0] [train], epoch: 37/50, iter: 100/834, loss: 0.27258, top1: 0.70307, throughput: 1289.11 | 2022-04-11 00:23:23.501 [rank:3] [train], epoch: 37/50, iter: 100/834, loss: 0.26898, top1: 0.71281, throughput: 1294.33 | 2022-04-11 00:23:23.502 [rank:7] [train], epoch: 37/50, iter: 100/834, loss: 0.27462, top1: 0.69495, throughput: 1288.42 | 2022-04-11 00:23:23.502 [rank:2] [train], epoch: 37/50, iter: 100/834, loss: 0.27000, top1: 0.71292, throughput: 1288.80 | 2022-04-11 00:23:23.503 [rank:6] [train], epoch: 37/50, iter: 200/834, loss: 0.27156, top1: 0.70198, throughput: 1312.61 | 2022-04-11 00:23:38.127 [rank:5] [train], epoch: 37/50, iter: 200/834, loss: 0.27152, top1: 0.70484, throughput: 1312.72 | 2022-04-11 00:23:38.127 [rank:4] [train], epoch: 37/50, iter: 200/834, loss: 0.27313, top1: 0.70333, throughput: 1312.57 | 2022-04-11 00:23:38.128 [rank:2] [train], epoch: 37/50, iter: 200/834, loss: 0.26984, top1: 0.70927, throughput: 1312.82 | 2022-04-11 00:23:38.128 [rank:3] [train], epoch: 37/50, iter: 200/834, loss: 0.27145, top1: 0.70516, throughput: 1312.64 | 2022-04-11 00:23:38.129 [rank:7] [train], epoch: 37/50, iter: 200/834, loss: 0.27352, top1: 0.70458, throughput: 1312.67 | 2022-04-11 00:23:38.129 [rank:1] [train], epoch: 37/50, iter: 200/834, loss: 0.27229, top1: 0.70370, throughput: 1312.32 | 2022-04-11 00:23:38.132 [rank:0] [train], epoch: 37/50, iter: 200/834, loss: 0.27427, top1: 0.69859, throughput: 1312.25 | 2022-04-11 00:23:38.132 [rank:5] [train], epoch: 37/50, iter: 300/834, loss: 0.27244, top1: 0.70234, throughput: 1316.61 | 2022-04-11 00:23:52.710 [rank:4] [train], epoch: 37/50, iter: 300/834, loss: 0.27258, top1: 0.70651, throughput: 1316.66 | 2022-04-11 00:23:52.710 [rank:6] [train], epoch: 37/50, iter: 300/834, loss: 0.26878, top1: 0.71141, throughput: 1316.58 | 2022-04-11 00:23:52.710 [rank:7] [train], epoch: 37/50, iter: 300/834, loss: 0.27169, top1: 0.70349, throughput: 1316.58 | 2022-04-11 00:23:52.712 [rank:0] [train], epoch: 37/50, iter: 300/834, loss: 0.27472, top1: 0.69672, throughput: 1316.86 | 2022-04-11 00:23:52.712 [rank:1] [train], epoch: 37/50, iter: 300/834, loss: 0.27235, top1: 0.70625, throughput: 1316.56 | 2022-04-11 00:23:52.715 [rank:3] [train], epoch: 37/50, iter: 300/834, loss: 0.27293, top1: 0.70630, throughput: 1316.38 | 2022-04-11 00:23:52.715 [rank:2] [train], epoch: 37/50, iter: 300/834, loss: 0.27393, top1: 0.69870, throughput: 1316.19 | 2022-04-11 00:23:52.716 [rank:2] [train], epoch: 37/50, iter: 400/834, loss: 0.27318, top1: 0.70333, throughput: 1317.08 | 2022-04-11 00:24:07.293 [rank:6] [train], epoch: 37/50, iter: 400/834, loss: 0.27249, top1: 0.70500, throughput: 1316.51 | 2022-04-11 00:24:07.294 [rank:4] [train], epoch: 37/50, iter: 400/834, loss: 0.26994, top1: 0.71083, throughput: 1316.59 | 2022-04-11 00:24:07.293 [rank:5] [train], epoch: 37/50, iter: 400/834, loss: 0.27119, top1: 0.70474, throughput: 1316.64 | 2022-04-11 00:24:07.293 [rank:1] [train], epoch: 37/50, iter: 400/834, loss: 0.27233, top1: 0.70604, throughput: 1316.96 | 2022-04-11 00:24:07.294 [rank:3] [train], epoch: 37/50, iter: 400/834, loss: 0.27362, top1: 0.70604, throughput: 1316.73 | 2022-04-11 00:24:07.296 [rank:7] [train], epoch: 37/50, iter: 400/834, loss: 0.27304, top1: 0.70005, throughput: 1316.63 | 2022-04-11 00:24:07.295 [rank:0] [train], epoch: 37/50, iter: 400/834, loss: 0.27299, top1: 0.70047, throughput: 1316.52 | 2022-04-11 00:24:07.296 [rank:4] [train], epoch: 37/50, iter: 500/834, loss: 0.27071, top1: 0.70729, throughput: 1314.30 | 2022-04-11 00:24:21.902 [rank:5] [train], epoch: 37/50, iter: 500/834, loss: 0.27449, top1: 0.70083, throughput: 1314.29 | 2022-04-11 00:24:21.901 [rank:6] [train], epoch: 37/50, iter: 500/834, loss: 0.27560, top1: 0.69797, throughput: 1314.41 | 2022-04-11 00:24:21.902 [rank:1] [train], epoch: 37/50, iter: 500/834, loss: 0.27581, top1: 0.69521, throughput: 1314.30 | 2022-04-11 00:24:21.903 [rank:3] [train], epoch: 37/50, iter: 500/834, loss: 0.27486, top1: 0.69927, throughput: 1314.37 | 2022-04-11 00:24:21.904 [rank:7] [train], epoch: 37/50, iter: 500/834, loss: 0.27054, top1: 0.70380, throughput: 1314.10 | 2022-04-11 00:24:21.905 [rank:2] [train], epoch: 37/50, iter: 500/834, loss: 0.27279, top1: 0.70250, throughput: 1314.22 | 2022-04-11 00:24:21.903 [rank:0] [train], epoch: 37/50, iter: 500/834, loss: 0.27114, top1: 0.70766, throughput: 1314.22 | 2022-04-11 00:24:21.905 [rank:5] [train], epoch: 37/50, iter: 600/834, loss: 0.27256, top1: 0.70594, throughput: 1314.96 | 2022-04-11 00:24:36.502 [rank:3] [train], epoch: 37/50, iter: 600/834, loss: 0.27264, top1: 0.70401, throughput: 1315.12 | 2022-04-11 00:24:36.504 [rank:4] [train], epoch: 37/50, iter: 600/834, loss: 0.27267, top1: 0.70672, throughput: 1315.05[rank:6] [train], epoch: 37/50, iter: 600/834, loss: 0.27284, top1: 0.70500, throughput: 1314.99 | 2022-04-11 00:24:36.503 | 2022-04-11 00:24:36.502 [rank:1] [train], epoch: 37/50, iter: 600/834, loss: 0.27297, top1: 0.70120, throughput: 1314.99 | 2022-04-11 00:24:36.504 [rank:7] [train], epoch: 37/50, iter: 600/834, loss: 0.27226, top1: 0.70375, throughput: 1315.31 | 2022-04-11 00:24:36.503 [rank:2] [train], epoch: 37/50, iter: 600/834, loss: 0.27349, top1: 0.70089, throughput: 1314.67 | 2022-04-11 00:24:36.507 [rank:0] [train], epoch: 37/50, iter: 600/834, loss: 0.27105, top1: 0.70406, throughput: 1314.79 | 2022-04-11 00:24:36.509 [rank:5] [train], epoch: 37/50, iter: 700/834, loss: 0.27299, top1: 0.70021, throughput: 1315.57 | 2022-04-11 00:24:51.097 [rank:3] [train], epoch: 37/50, iter: 700/834, loss: 0.27307, top1: 0.70271, throughput: 1315.37 | 2022-04-11 00:24:51.100 [rank:4] [train], epoch: 37/50, iter: 700/834, loss: 0.27255, top1: 0.70271, throughput: 1315.45 | 2022-04-11 00:24:51.098 [rank:7] [train], epoch: 37/50, iter: 700/834, loss: 0.27240, top1: 0.70464, throughput: 1315.45 | 2022-04-11 00:24:51.099 [rank:6] [train], epoch: 37/50, iter: 700/834, loss: 0.27273, top1: 0.69953, throughput: 1315.33 | 2022-04-11 00:24:51.100 [rank:1] [train], epoch: 37/50, iter: 700/834, loss: 0.27009, top1: 0.70802, throughput: 1315.44 | 2022-04-11 00:24:51.100 [rank:2] [train], epoch: 37/50, iter: 700/834, loss: 0.27109, top1: 0.70417, throughput: 1315.74 | 2022-04-11 00:24:51.100 [rank:0] [train], epoch: 37/50, iter: 700/834, loss: 0.27397, top1: 0.70224, throughput: 1315.81 | 2022-04-11 00:24:51.100 [rank:4] [train], epoch: 37/50, iter: 800/834, loss: 0.27409, top1: 0.70031, throughput: 1313.90 | 2022-04-11 00:25:05.711 [rank:5] [train], epoch: 37/50, iter: 800/834, loss: 0.27345, top1: 0.69786, throughput: 1313.81 | 2022-04-11 00:25:05.711 [rank:6] [train], epoch: 37/50, iter: 800/834, loss: 0.27175, top1: 0.70021, throughput: 1314.01 | 2022-04-11 00:25:05.711 [rank:2] [train], epoch: 37/50, iter: 800/834, loss: 0.27272, top1: 0.70380, throughput: 1313.74 | 2022-04-11 00:25:05.715 [rank:1] [train], epoch: 37/50, iter: 800/834, loss: 0.27505, top1: 0.69776, throughput: 1313.70 | 2022-04-11 00:25:05.715 [rank:0] [train], epoch: 37/50, iter: 800/834, loss: 0.27360, top1: 0.70052, throughput: 1313.87 | 2022-04-11 00:25:05.714 [rank:3] [train], epoch: 37/50, iter: 800/834, loss: 0.27279, top1: 0.69839, throughput: 1313.83 | 2022-04-11 00:25:05.714 [rank:7] [train], epoch: 37/50, iter: 800/834, loss: 0.27323, top1: 0.70286, throughput: 1313.84 | 2022-04-11 00:25:05.712 [rank:5] [train], epoch: 37/50, iter: 834/834, loss: 0.27481, top1: 0.70098, throughput: 1312.89 | 2022-04-11 00:25:10.683 [rank:4] [train], epoch: 37/50, iter: 834/834, loss: 0.27902, top1: 0.69684, throughput: 1312.56 | 2022-04-11 00:25:10.684 [rank:6] [train], epoch: 37/50, iter: 834/834, loss: 0.27538, top1: 0.70190, throughput: 1312.87 | 2022-04-11 00:25:10.684 [rank:2] [train], epoch: 37/50, iter: 834/834, loss: 0.27432, top1: 0.69945, throughput: 1313.67 | 2022-04-11 00:25:10.684 [rank:3] [train], epoch: 37/50, iter: 834/834, loss: 0.26834, top1: 0.70864, throughput: 1313.10 | 2022-04-11 00:25:10.685 [rank:0] [train], epoch: 37/50, iter: 834/834, loss: 0.27134, top1: 0.70221, throughput: 1312.94 | 2022-04-11 00:25:10.686 [rank:1] [train], epoch: 37/50, iter: 834/834, loss: 0.27087, top1: 0.70358, throughput: 1313.06 | 2022-04-11 00:25:10.686 [rank:7] [train], epoch: 37/50, iter: 834/834, loss: 0.26983, top1: 0.70895, throughput: 1312.35 | 2022-04-11 00:25:10.686 [rank:2] [eval], epoch: 37/50, iter: 125/125, loss: 0.00000, top1: 0.70912, throughput: 573.68 | 2022-04-11 00:25:21.578 [rank:0] [eval], epoch: 37/50, iter: 125/125, loss: 0.00000, top1: 0.71792, throughput: 573.51 | 2022-04-11 00:25:21.583 [rank:7] [eval], epoch: 37/50, iter: 125/125, loss: 0.00000, top1: 0.71584, throughput: 573.19 | 2022-04-11 00:25:21.590 [rank:4] [eval], epoch: 37/50, iter: 125/125, loss: 0.00000, top1: 0.69904, throughput: 570.49 | 2022-04-11 00:25:21.640 [rank:3] [eval], epoch: 37/50, iter: 125/125, loss: 0.00000, top1: 0.70960, throughput: 569.96 | 2022-04-11 00:25:21.651 [rank:6] [eval], epoch: 37/50, iter: 125/125, loss: 0.00000, top1: 0.71296, throughput: 566.73 | 2022-04-11 00:25:21.712 [rank:1] [eval], epoch: 37/50, iter: 125/125, loss: 0.00000, top1: 0.71664, throughput: 564.58 | 2022-04-11 00:25:21.756 [rank:5] [eval], epoch: 37/50, iter: 125/125, loss: 0.00000, top1: 0.69584, throughput: 556.11 | 2022-04-11 00:25:21.922 [rank:4] [train], epoch: 38/50, iter: 100/834, loss: 0.26584, top1: 0.71417, throughput: 1296.33 | 2022-04-11 00:25:36.451 [rank:5] [train], epoch: 38/50, iter: 100/834, loss: 0.26720, top1: 0.71521, throughput: 1321.52 | 2022-04-11 00:25:36.451 [rank:3] [train], epoch: 38/50, iter: 100/834, loss: 0.26798, top1: 0.71427, throughput: 1297.13 [rank:6] [train], epoch: 38/50, iter: 100/834, loss: 0.26889, top1: 0.71224, throughput: 1302.54| 2022-04-11 00:25:36.453 [rank:0] [train], epoch: 38/50, iter: 100/834, loss: 0.26713, top1: 0.71667, throughput: 1291.32 | 2022-04-11 00:25:36.452| 2022-04-11 00:25:36.452 [rank:1] [train], epoch: 38/50, iter: 100/834, loss: 0.26795, top1: 0.71130, throughput: 1306.10 | 2022-04-11 00:25:36.457 [rank:2] [train], epoch: 38/50, iter: 100/834, loss: 0.26980, top1: 0.70667, throughput: 1290.47 | 2022-04-11 00:25:36.457 [rank:7] [train], epoch: 38/50, iter: 100/834, loss: 0.26528, top1: 0.71854, throughput: 1291.76 | 2022-04-11 00:25:36.454 [rank:6] [train], epoch: 38/50, iter: 200/834, loss: 0.26518, top1: 0.71401, throughput: 1317.27 | 2022-04-11 00:25:51.028 [rank:5] [train], epoch: 38/50, iter: 200/834, loss: 0.26924, top1: 0.70776, throughput: 1317.06 | 2022-04-11 00:25:51.029 [rank:4] [train], epoch: 38/50, iter: 200/834, loss: 0.26865, top1: 0.71198, throughput: 1317.03 | 2022-04-11 00:25:51.029 [rank:0] [train], epoch: 38/50, iter: 200/834, loss: 0.27160, top1: 0.70708, throughput: 1317.09 | 2022-04-11 00:25:51.030 [rank:1] [train], epoch: 38/50, iter: 200/834, loss: 0.26732, top1: 0.71443, throughput: 1317.33 | 2022-04-11 00:25:51.032 [rank:2] [train], epoch: 38/50, iter: 200/834, loss: 0.26770, top1: 0.71536, throughput: 1317.27 | 2022-04-11 00:25:51.032 [rank:7] [train], epoch: 38/50, iter: 200/834, loss: 0.26550, top1: 0.71771, throughput: 1317.14 | 2022-04-11 00:25:51.031 [rank:3] [train], epoch: 38/50, iter: 200/834, loss: 0.26846, top1: 0.71042, throughput: 1316.78 | 2022-04-11 00:25:51.034 [rank:6] [train], epoch: 38/50, iter: 300/834, loss: 0.26766, top1: 0.71156, throughput: 1314.47 | 2022-04-11 00:26:05.635 [rank:5] [train], epoch: 38/50, iter: 300/834, loss: 0.26669, top1: 0.71776, throughput: 1314.51 | 2022-04-11 00:26:05.635 [rank:2] [train], epoch: 38/50, iter: 300/834, loss: 0.26728, top1: 0.71620, throughput: 1314.74 | 2022-04-11 00:26:05.636 [rank:4] [train], epoch: 38/50, iter: 300/834, loss: 0.26702, top1: 0.71297, throughput: 1314.39 | 2022-04-11 00:26:05.636 [rank:0] [train], epoch: 38/50, iter: 300/834, loss: 0.26898, top1: 0.70875, throughput: 1314.56 | 2022-04-11 00:26:05.635 [rank:3] [train], epoch: 38/50, iter: 300/834, loss: 0.26974, top1: 0.70943, throughput: 1314.80 | 2022-04-11 00:26:05.637 [rank:7] [train], epoch: 38/50, iter: 300/834, loss: 0.26999, top1: 0.71052, throughput: 1314.58 | 2022-04-11 00:26:05.636 [rank:1] [train], epoch: 38/50, iter: 300/834, loss: 0.27018, top1: 0.70312, throughput: 1314.39 | 2022-04-11 00:26:05.639 [rank:2] [train], epoch: 38/50, iter: 400/834, loss: 0.26719, top1: 0.71318, throughput: 1313.95 | 2022-04-11 00:26:20.248 [rank:5] [train], epoch: 38/50, iter: 400/834, loss: 0.26772, top1: 0.71609, throughput: 1313.89 | 2022-04-11 00:26:20.248 [rank:6] [train], epoch: 38/50, iter: 400/834, loss: 0.26629, top1: 0.71630, throughput: 1313.57 | 2022-04-11 00:26:20.251 [rank:1] [train], epoch: 38/50, iter: 400/834, loss: 0.26792, top1: 0.71099, throughput: 1314.21 | 2022-04-11 00:26:20.249 [rank:4] [train], epoch: 38/50, iter: 400/834, loss: 0.26801, top1: 0.71198, throughput: 1313.96 | 2022-04-11 00:26:20.249 [rank:3] [train], epoch: 38/50, iter: 400/834, loss: 0.27148, top1: 0.70542, throughput: 1313.79 | 2022-04-11 00:26:20.251 [rank:0] [train], epoch: 38/50, iter: 400/834, loss: 0.26983, top1: 0.70547, throughput: 1313.81 | 2022-04-11 00:26:20.249 [rank:7] [train], epoch: 38/50, iter: 400/834, loss: 0.26579, top1: 0.71797, throughput: 1313.62 | 2022-04-11 00:26:20.252 [rank:5] [train], epoch: 38/50, iter: 500/834, loss: 0.26688, top1: 0.71234, throughput: 1312.22 | 2022-04-11 00:26:34.880 [rank:2] [train], epoch: 38/50, iter: 500/834, loss: 0.26870, top1: 0.70823, throughput: 1312.19 | 2022-04-11 00:26:34.880 [rank:6] [train], epoch: 38/50, iter: 500/834, loss: 0.26630, top1: 0.71802, throughput: 1312.45 | 2022-04-11 00:26:34.880 [rank:4] [train], epoch: 38/50, iter: 500/834, loss: 0.26625, top1: 0.71365, throughput: 1312.07 | 2022-04-11 00:26:34.882 [rank:1] [train], epoch: 38/50, iter: 500/834, loss: 0.26749, top1: 0.71208, throughput: 1312.07 | 2022-04-11 00:26:34.882 [rank:7] [train], epoch: 38/50, iter: 500/834, loss: 0.27027, top1: 0.70755, throughput: 1312.44 | 2022-04-11 00:26:34.882 [rank:0] [train], epoch: 38/50, iter: 500/834, loss: 0.26526, top1: 0.71922, throughput: 1312.13 | 2022-04-11 00:26:34.882 [rank:3] [train], epoch: 38/50, iter: 500/834, loss: 0.26884, top1: 0.71177, throughput: 1312.23 | 2022-04-11 00:26:34.883 [rank:5] [train], epoch: 38/50, iter: 600/834, loss: 0.26725, top1: 0.71271, throughput: 1314.61 | 2022-04-11 00:26:49.485 [rank:4] [train], epoch: 38/50, iter: 600/834, loss: 0.26836, top1: 0.70974, throughput: 1314.83[rank:6] [train], epoch: 38/50, iter: 600/834, loss: 0.26790, top1: 0.71104, throughput: 1314.51 | 2022-04-11 00:26:49.485 | 2022-04-11 00:26:49.487 [rank:7] [train], epoch: 38/50, iter: 600/834, loss: 0.26939, top1: 0.71026, throughput: 1314.68 | 2022-04-11 00:26:49.486 [rank:2] [train], epoch: 38/50, iter: 600/834, loss: 0.26839, top1: 0.71167, throughput: 1314.48 | 2022-04-11 00:26:49.487 [rank:1] [train], epoch: 38/50, iter: 600/834, loss: 0.27056, top1: 0.70922, throughput: 1314.62 | 2022-04-11 00:26:49.487 [rank:3] [train], epoch: 38/50, iter: 600/834, loss: 0.26607, top1: 0.71823, throughput: 1314.66 | 2022-04-11 00:26:49.488 [rank:0] [train], epoch: 38/50, iter: 600/834, loss: 0.26744, top1: 0.71167, throughput: 1314.48 | 2022-04-11 00:26:49.488 [rank:4] [train], epoch: 38/50, iter: 700/834, loss: 0.26926, top1: 0.70755, throughput: 1313.75 | 2022-04-11 00:27:04.099 [rank:5] [train], epoch: 38/50, iter: 700/834, loss: 0.27139, top1: 0.70427, throughput: 1313.78 | 2022-04-11 00:27:04.099 [rank:3] [train], epoch: 38/50, iter: 700/834, loss: 0.26973, top1: 0.70839, throughput: 1313.81 | 2022-04-11 00:27:04.101 [rank:7] [train], epoch: 38/50, iter: 700/834, loss: 0.27151, top1: 0.70448, throughput: 1313.80 | 2022-04-11 00:27:04.100 [rank:6] [train], epoch: 38/50, iter: 700/834, loss: 0.27122, top1: 0.70307, throughput: 1313.97 | 2022-04-11 00:27:04.099 [rank:0] [train], epoch: 38/50, iter: 700/834, loss: 0.27005, top1: 0.70859, throughput: 1314.00 | 2022-04-11 00:27:04.100 [rank:2] [train], epoch: 38/50, iter: 700/834, loss: 0.26936, top1: 0.70792, throughput: 1313.46 | 2022-04-11 00:27:04.105 [rank:1] [train], epoch: 38/50, iter: 700/834, loss: 0.26548, top1: 0.71771, throughput: 1313.35 | 2022-04-11 00:27:04.106 [rank:4] [train], epoch: 38/50, iter: 800/834, loss: 0.26876, top1: 0.70839, throughput: 1312.40 | 2022-04-11 00:27:18.729 [rank:1] [train], epoch: 38/50, iter: 800/834, loss: 0.26888, top1: 0.70870, throughput: 1312.88 | 2022-04-11 00:27:18.730 [rank:3] [train], epoch: 38/50, iter: 800/834, loss: 0.26967, top1: 0.70891, throughput: 1312.41 | 2022-04-11 00:27:18.731 [rank:6] [train], epoch: 38/50, iter: 800/834, loss: 0.26600, top1: 0.71682, throughput: 1312.16 | 2022-04-11 00:27:18.731 [rank:2] [train], epoch: 38/50, iter: 800/834, loss: 0.26900, top1: 0.70609, throughput: 1312.46 | 2022-04-11 00:27:18.734 [rank:5] [train], epoch: 38/50, iter: 800/834, loss: 0.26674, top1: 0.71771, throughput: 1312.18 | 2022-04-11 00:27:18.731 [rank:7] [train], epoch: 38/50, iter: 800/834, loss: 0.26590, top1: 0.71922, throughput: 1312.21 | 2022-04-11 00:27:18.732 [rank:0] [train], epoch: 38/50, iter: 800/834, loss: 0.26557, top1: 0.71750, throughput: 1312.19 | 2022-04-11 00:27:18.732 [rank:1] [train], epoch: 38/50, iter: 834/834, loss: 0.27056, top1: 0.71186, throughput: 1312.92 | 2022-04-11 00:27:23.703 [rank:5] [train], epoch: 38/50, iter: 834/834, loss: 0.26306, top1: 0.72197, throughput: 1313.07 | 2022-04-11 00:27:23.703 [rank:0] [train], epoch: 38/50, iter: 834/834, loss: 0.26260, top1: 0.72396, throughput: 1313.39 | 2022-04-11 00:27:23.703[rank:6] [train], epoch: 38/50, iter: 834/834, loss: 0.26938, top1: 0.70726, throughput: 1313.08 | 2022-04-11 00:27:23.703 [rank:4] [train], epoch: 38/50, iter: 834/834, loss: 0.26808, top1: 0.70619, throughput: 1312.51 | 2022-04-11 00:27:23.703 [rank:7] [train], epoch: 38/50, iter: 834/834, loss: 0.26987, top1: 0.70787, throughput: 1313.02 | 2022-04-11 00:27:23.704 [rank:2] [train], epoch: 38/50, iter: 834/834, loss: 0.26748, top1: 0.71308, throughput: 1313.58 | 2022-04-11 00:27:23.703 [rank:3] [train], epoch: 38/50, iter: 834/834, loss: 0.26612, top1: 0.71599, throughput: 1312.10 | 2022-04-11 00:27:23.706 [rank:7] [eval], epoch: 38/50, iter: 125/125, loss: 0.00000, top1: 0.72192, throughput: 582.79 | 2022-04-11 00:27:34.428 [rank:0] [eval], epoch: 38/50, iter: 125/125, loss: 0.00000, top1: 0.72048, throughput: 582.65 | 2022-04-11 00:27:34.430 [rank:2] [eval], epoch: 38/50, iter: 125/125, loss: 0.00000, top1: 0.70944, throughput: 581.99 | 2022-04-11 00:27:34.442 [rank:4] [eval], epoch: 38/50, iter: 125/125, loss: 0.00000, top1: 0.71008, throughput: 581.04 | 2022-04-11 00:27:34.459 [rank:6] [eval], epoch: 38/50, iter: 125/125, loss: 0.00000, top1: 0.71552, throughput: 578.22 | 2022-04-11 00:27:34.512 [rank:3] [eval], epoch: 38/50, iter: 125/125, loss: 0.00000, top1: 0.71712, throughput: 575.03 | 2022-04-11 00:27:34.575 [rank:1] [eval], epoch: 38/50, iter: 125/125, loss: 0.00000, top1: 0.72256, throughput: 567.41 | 2022-04-11 00:27:34.718 [rank:5] [eval], epoch: 38/50, iter: 125/125, loss: 0.00000, top1: 0.69472, throughput: 566.48 | 2022-04-11 00:27:34.736 [rank:6] [train], epoch: 39/50, iter: 100/834, loss: 0.26346, top1: 0.72656, throughput: 1300.66 | 2022-04-11 00:27:49.274 [rank:5] [train], epoch: 39/50, iter: 100/834, loss: 0.26565, top1: 0.71594, throughput: 1320.65 | 2022-04-11 00:27:49.274 [rank:2] [train], epoch: 39/50, iter: 100/834, loss: 0.25963, top1: 0.73021, throughput: 1294.45 | 2022-04-11 00:27:49.275 [rank:1] [train], epoch: 39/50, iter: 100/834, loss: 0.26444, top1: 0.71917, throughput: 1318.91 | 2022-04-11 00:27:49.275 [rank:0] [train], epoch: 39/50, iter: 100/834, loss: 0.26457, top1: 0.72057, throughput: 1293.34 | 2022-04-11 00:27:49.275 [rank:3] [train], epoch: 39/50, iter: 100/834, loss: 0.26085, top1: 0.73057, throughput: 1305.93 | 2022-04-11 00:27:49.278 [rank:4] [train], epoch: 39/50, iter: 100/834, loss: 0.26300, top1: 0.72000, throughput: 1295.88 | 2022-04-11 00:27:49.276 [rank:7] [train], epoch: 39/50, iter: 100/834, loss: 0.26112, top1: 0.72786, throughput: 1293.00 | 2022-04-11 00:27:49.277 [rank:6] [train], epoch: 39/50, iter: 200/834, loss: 0.26314, top1: 0.72297, throughput: 1317.40 | 2022-04-11 00:28:03.848 [rank:4] [train], epoch: 39/50, iter: 200/834, loss: 0.26294, top1: 0.72151, throughput: 1317.58 | 2022-04-11 00:28:03.848 [rank:7] [train], epoch: 39/50, iter: 200/834, loss: 0.26334, top1: 0.72276, throughput: 1317.66[rank:1] [train], epoch: 39/50, iter: 200/834, loss: 0.26139, top1: 0.72260, throughput: 1317.48 | 2022-04-11 00:28:03.848| 2022-04-11 00:28:03.848 [rank:2] [train], epoch: 39/50, iter: 200/834, loss: 0.26208, top1: 0.72266, throughput: 1317.39 | 2022-04-11 00:28:03.849 [rank:3] [train], epoch: 39/50, iter: 200/834, loss: 0.26298, top1: 0.72172, throughput: 1317.54 | 2022-04-11 00:28:03.850 [rank:5] [train], epoch: 39/50, iter: 200/834, loss: 0.26189, top1: 0.72635, throughput: 1317.27 | 2022-04-11 00:28:03.850 [rank:0] [train], epoch: 39/50, iter: 200/834, loss: 0.26289, top1: 0.71932, throughput: 1317.38 | 2022-04-11 00:28:03.849 [rank:2] [train], epoch: 39/50, iter: 300/834, loss: 0.26426, top1: 0.72187, throughput: 1316.23 | 2022-04-11 00:28:18.436 [rank:5] [train], epoch: 39/50, iter: 300/834, loss: 0.26332, top1: 0.72344, throughput: 1316.30 | 2022-04-11 00:28:18.436 [rank:4] [train], epoch: 39/50, iter: 300/834, loss: 0.26142, top1: 0.72635, throughput: 1316.13 | 2022-04-11 00:28:18.436 [rank:1] [train], epoch: 39/50, iter: 300/834, loss: 0.26546, top1: 0.71865, throughput: 1316.03 | 2022-04-11 00:28:18.438 [rank:6] [train], epoch: 39/50, iter: 300/834, loss: 0.26165, top1: 0.72484, throughput: 1315.95 | 2022-04-11 00:28:18.438 [rank:7] [train], epoch: 39/50, iter: 300/834, loss: 0.26412, top1: 0.71995, throughput: 1316.00 | 2022-04-11 00:28:18.438 [rank:3] [train], epoch: 39/50, iter: 300/834, loss: 0.26624, top1: 0.71589, throughput: 1316.01 | 2022-04-11 00:28:18.440 [rank:0] [train], epoch: 39/50, iter: 300/834, loss: 0.26448, top1: 0.72297, throughput: 1316.02 | 2022-04-11 00:28:18.439 [rank:6] [train], epoch: 39/50, iter: 400/834, loss: 0.26362, top1: 0.72047, throughput: 1316.96 | 2022-04-11 00:28:33.017 [rank:2] [train], epoch: 39/50, iter: 400/834, loss: 0.26331, top1: 0.72198, throughput: 1316.61 | 2022-04-11 00:28:33.019 [rank:5] [train], epoch: 39/50, iter: 400/834, loss: 0.26315, top1: 0.72182, throughput: 1316.77 | 2022-04-11 00:28:33.017 [rank:4] [train], epoch: 39/50, iter: 400/834, loss: 0.26492, top1: 0.71771, throughput: 1316.71 | 2022-04-11 00:28:33.018 [rank:0] [train], epoch: 39/50, iter: 400/834, loss: 0.26275, top1: 0.72703, throughput: 1316.85 | 2022-04-11 00:28:33.019 [rank:3] [train], epoch: 39/50, iter: 400/834, loss: 0.26382, top1: 0.71958, throughput: 1316.63 | 2022-04-11 00:28:33.023 [rank:7] [train], epoch: 39/50, iter: 400/834, loss: 0.26189, top1: 0.72885, throughput: 1316.63 | 2022-04-11 00:28:33.021 [rank:1] [train], epoch: 39/50, iter: 400/834, loss: 0.26438, top1: 0.72068, throughput: 1316.55 | 2022-04-11 00:28:33.021 [rank:6] [train], epoch: 39/50, iter: 500/834, loss: 0.26356, top1: 0.72219, throughput: 1313.78 | 2022-04-11 00:28:47.631 [rank:4] [train], epoch: 39/50, iter: 500/834, loss: 0.26411, top1: 0.71859, throughput: 1313.96 | 2022-04-11 00:28:47.630 [rank:5] [train], epoch: 39/50, iter: 500/834, loss: 0.26250, top1: 0.71750, throughput: 1313.82 | 2022-04-11 00:28:47.631 [rank:1] [train], epoch: 39/50, iter: 500/834, loss: 0.26546, top1: 0.71490, throughput: 1314.07 | 2022-04-11 00:28:47.632 [rank:3] [train], epoch: 39/50, iter: 500/834, loss: 0.26492, top1: 0.71849, throughput: 1314.17 | 2022-04-11 00:28:47.633 [rank:2] [train], epoch: 39/50, iter: 500/834, loss: 0.26263, top1: 0.72484, throughput: 1313.95 | 2022-04-11 00:28:47.632 [rank:0] [train], epoch: 39/50, iter: 500/834, loss: 0.26429, top1: 0.71833, throughput: 1313.84 | 2022-04-11 00:28:47.632 [rank:7] [train], epoch: 39/50, iter: 500/834, loss: 0.26416, top1: 0.71812, throughput: 1313.94 | 2022-04-11 00:28:47.633 [rank:5] [train], epoch: 39/50, iter: 600/834, loss: 0.26544, top1: 0.71849, throughput: 1304.13 | 2022-04-11 00:29:02.353 [rank:2] [train], epoch: 39/50, iter: 600/834, loss: 0.26540, top1: 0.71797, throughput: 1303.93 | 2022-04-11 00:29:02.356 [rank:7] [train], epoch: 39/50, iter: 600/834, loss: 0.26524, top1: 0.71516, throughput: 1304.15 | 2022-04-11 00:29:02.356 [rank:4] [train], epoch: 39/50, iter: 600/834, loss: 0.26292, top1: 0.72302, throughput: 1303.95 | 2022-04-11 00:29:02.355 [rank:3] [train], epoch: 39/50, iter: 600/834, loss: 0.26497, top1: 0.72068, throughput: 1304.10 | 2022-04-11 00:29:02.355 [rank:6] [train], epoch: 39/50, iter: 600/834, loss: 0.26382, top1: 0.72188, throughput: 1304.05 | 2022-04-11 00:29:02.355 [rank:0] [train], epoch: 39/50, iter: 600/834, loss: 0.26619, top1: 0.72094, throughput: 1304.10 | 2022-04-11 00:29:02.355 [rank:1] [train], epoch: 39/50, iter: 600/834, loss: 0.26458, top1: 0.71953, throughput: 1304.06 | 2022-04-11 00:29:02.356 [rank:4] [train], epoch: 39/50, iter: 700/834, loss: 0.26276, top1: 0.72385, throughput: 1316.19 | 2022-04-11 00:29:16.942 [rank:6] [train], epoch: 39/50, iter: 700/834, loss: 0.26285, top1: 0.72042, throughput: 1316.22 | 2022-04-11 00:29:16.942 [rank:3] [train], epoch: 39/50, iter: 700/834, loss: 0.26339, top1: 0.72057, throughput: 1315.98 | 2022-04-11 00:29:16.945 [rank:1] [train], epoch: 39/50, iter: 700/834, loss: 0.26157, top1: 0.72589, throughput: 1316.09 | 2022-04-11 00:29:16.944 [rank:2] [train], epoch: 39/50, iter: 700/834, loss: 0.26278, top1: 0.72182, throughput: 1316.26 | 2022-04-11 00:29:16.943 [rank:5] [train], epoch: 39/50, iter: 700/834, loss: 0.26319, top1: 0.72208, throughput: 1315.98 | 2022-04-11 00:29:16.943 [rank:0] [train], epoch: 39/50, iter: 700/834, loss: 0.26611, top1: 0.71599, throughput: 1315.93 | 2022-04-11 00:29:16.946 [rank:7] [train], epoch: 39/50, iter: 700/834, loss: 0.26302, top1: 0.72427, throughput: 1316.06 | 2022-04-11 00:29:16.945 [rank:2] [train], epoch: 39/50, iter: 800/834, loss: 0.26481, top1: 0.71802, throughput: 1317.06 | 2022-04-11 00:29:31.521 [rank:5] [train], epoch: 39/50, iter: 800/834, loss: 0.26315, top1: 0.72208, throughput: 1317.09 | 2022-04-11 00:29:31.521 [rank:7] [train], epoch: 39/50, iter: 800/834, loss: 0.26297, top1: 0.72229, throughput: 1317.06 | 2022-04-11 00:29:31.522 [rank:1] [train], epoch: 39/50, iter: 800/834, loss: 0.26275, top1: 0.72182, throughput: 1317.05 | 2022-04-11 00:29:31.522 [rank:4] [train], epoch: 39/50, iter: 800/834, loss: 0.26167, top1: 0.72542, throughput: 1316.76 | 2022-04-11 00:29:31.523 [rank:0] [train], epoch: 39/50, iter: 800/834, loss: 0.26305, top1: 0.72083, throughput: 1317.08 | 2022-04-11 00:29:31.523 [rank:6] [train], epoch: 39/50, iter: 800/834, loss: 0.26214, top1: 0.72573, throughput: 1316.80 | 2022-04-11 00:29:31.523 [rank:3] [train], epoch: 39/50, iter: 800/834, loss: 0.26275, top1: 0.72099, throughput: 1316.82 | 2022-04-11 00:29:31.526 [rank:4] [train], epoch: 39/50, iter: 834/834, loss: 0.26148, top1: 0.72151, throughput: 1312.50 | 2022-04-11 00:29:36.497 [rank:6] [train], epoch: 39/50, iter: 834/834, loss: 0.26366, top1: 0.71952, throughput: 1312.20 | 2022-04-11 00:29:36.498 [rank:7] [train], epoch: 39/50, iter: 834/834, loss: 0.26039, top1: 0.72549, throughput: 1312.04 | 2022-04-11 00:29:36.498 [rank:0] [train], epoch: 39/50, iter: 834/834, loss: 0.26412, top1: 0.72181, throughput: 1312.32 | 2022-04-11 00:29:36.498 [rank:5] [train], epoch: 39/50, iter: 834/834, loss: 0.26357, top1: 0.72595, throughput: 1311.61 | 2022-04-11 00:29:36.498 [rank:1] [train], epoch: 39/50, iter: 834/834, loss: 0.26291, top1: 0.71722, throughput: 1311.54 | 2022-04-11 00:29:36.500 [rank:2] [train], epoch: 39/50, iter: 834/834, loss: 0.26177, top1: 0.72963, throughput: 1311.23 | 2022-04-11 00:29:36.500 [rank:3] [train], epoch: 39/50, iter: 834/834, loss: 0.26427, top1: 0.72258, throughput: 1312.07 | 2022-04-11 00:29:36.501 [rank:2] [eval], epoch: 39/50, iter: 125/125, loss: 0.00000, top1: 0.71952, throughput: 573.77 | 2022-04-11 00:29:47.393 [rank:7] [eval], epoch: 39/50, iter: 125/125, loss: 0.00000, top1: 0.73392, throughput: 573.66 | 2022-04-11 00:29:47.393 [rank:0] [eval], epoch: 39/50, iter: 125/125, loss: 0.00000, top1: 0.73504, throughput: 573.58 | 2022-04-11 00:29:47.394 [rank:4] [eval], epoch: 39/50, iter: 125/125, loss: 0.00000, top1: 0.72800, throughput: 572.85 | 2022-04-11 00:29:47.407 [rank:6] [eval], epoch: 39/50, iter: 125/125, loss: 0.00000, top1: 0.72640, throughput: 570.45 | 2022-04-11 00:29:47.454 [rank:3] [eval], epoch: 39/50, iter: 125/125, loss: 0.00000, top1: 0.72608, throughput: 564.85 | 2022-04-11 00:29:47.566 [rank:5] [eval], epoch: 39/50, iter: 125/125, loss: 0.00000, top1: 0.71840, throughput: 560.30 | 2022-04-11 00:29:47.653 [rank:1] [eval], epoch: 39/50, iter: 125/125, loss: 0.00000, top1: 0.73488, throughput: 555.74 | 2022-04-11 00:29:47.746 [rank:6] [train], epoch: 40/50, iter: 100/834, loss: 0.25808, top1: 0.73188, throughput: 1295.66 | 2022-04-11 00:30:02.273 [rank:2] [train], epoch: 40/50, iter: 100/834, loss: 0.26033, top1: 0.72443, throughput: 1290.06 | 2022-04-11 00:30:02.276 [rank:1] [train], epoch: 40/50, iter: 100/834, loss: 0.25720, top1: 0.73641, throughput: 1321.50[rank:4] [train], epoch: 40/50, iter: 100/834, loss: 0.25860, top1: 0.73036, throughput: 1291.52 | 2022-04-11 00:30:02.274 | 2022-04-11 00:30:02.275 [rank:5] [train], epoch: 40/50, iter: 100/834, loss: 0.25844, top1: 0.73396, throughput: 1313.13 | 2022-04-11 00:30:02.274 [rank:3] [train], epoch: 40/50, iter: 100/834, loss: 0.25647, top1: 0.73510, throughput: 1305.26 | 2022-04-11 00:30:02.276 [rank:0] [train], epoch: 40/50, iter: 100/834, loss: 0.25724, top1: 0.73438, throughput: 1290.24 | 2022-04-11 00:30:02.275 [rank:7] [train], epoch: 40/50, iter: 100/834, loss: 0.25770, top1: 0.73333, throughput: 1290.12 | 2022-04-11 00:30:02.275 [rank:6] [train], epoch: 40/50, iter: 200/834, loss: 0.25939, top1: 0.73099, throughput: 1314.45 | 2022-04-11 00:30:16.879 [rank:4] [train], epoch: 40/50, iter: 200/834, loss: 0.25909, top1: 0.72990, throughput: 1314.45 | 2022-04-11 00:30:16.881 [rank:1] [train], epoch: 40/50, iter: 200/834, loss: 0.25855, top1: 0.73406, throughput: 1314.40 | 2022-04-11 00:30:16.882 [rank:3] [train], epoch: 40/50, iter: 200/834, loss: 0.25884, top1: 0.72990, throughput: 1314.37 | 2022-04-11 00:30:16.883 [rank:5] [train], epoch: 40/50, iter: 200/834, loss: 0.26017, top1: 0.72490, throughput: 1314.28 | 2022-04-11 00:30:16.883 [rank:7] [train], epoch: 40/50, iter: 200/834, loss: 0.26148, top1: 0.72620, throughput: 1314.34 | 2022-04-11 00:30:16.883 [rank:2] [train], epoch: 40/50, iter: 200/834, loss: 0.25901, top1: 0.73177, throughput: 1314.33 | 2022-04-11 00:30:16.884 [rank:0] [train], epoch: 40/50, iter: 200/834, loss: 0.25985, top1: 0.72979, throughput: 1314.34 | 2022-04-11 00:30:16.883 [rank:6] [train], epoch: 40/50, iter: 300/834, loss: 0.25903, top1: 0.72781, throughput: 1312.61 | 2022-04-11 00:30:31.507 [rank:4] [train], epoch: 40/50, iter: 300/834, loss: 0.26002, top1: 0.73125, throughput: 1312.53 | 2022-04-11 00:30:31.509 [rank:2] [train], epoch: 40/50, iter: 300/834, loss: 0.25910, top1: 0.73167, throughput: 1312.91 | 2022-04-11 00:30:31.508 [rank:5] [train], epoch: 40/50, iter: 300/834, loss: 0.25979, top1: 0.73229, throughput: 1312.62 | 2022-04-11 00:30:31.510 [rank:3] [train], epoch: 40/50, iter: 300/834, loss: 0.25649, top1: 0.73536, throughput: 1312.37 | 2022-04-11 00:30:31.513 [rank:1] [train], epoch: 40/50, iter: 300/834, loss: 0.26009, top1: 0.73151, throughput: 1312.62 | 2022-04-11 00:30:31.510 [rank:7] [train], epoch: 40/50, iter: 300/834, loss: 0.26074, top1: 0.72547, throughput: 1312.54 | 2022-04-11 00:30:31.511 [rank:0] [train], epoch: 40/50, iter: 300/834, loss: 0.25717, top1: 0.73318, throughput: 1312.42 | 2022-04-11 00:30:31.513 [rank:2] [train], epoch: 40/50, iter: 400/834, loss: 0.25925, top1: 0.72943, throughput: 1314.42 | 2022-04-11 00:30:46.115 [rank:6] [train], epoch: 40/50, iter: 400/834, loss: 0.25839, top1: 0.73297, throughput: 1314.26 | 2022-04-11 00:30:46.116 [rank:1] [train], epoch: 40/50, iter: 400/834, loss: 0.25987, top1: 0.72786, throughput: 1314.50 | 2022-04-11 00:30:46.116 [rank:4] [train], epoch: 40/50, iter: 400/834, loss: 0.25934, top1: 0.73323, throughput: 1314.55 | 2022-04-11 00:30:46.115 [rank:5] [train], epoch: 40/50, iter: 400/834, loss: 0.25972, top1: 0.73042, throughput: 1314.45 | 2022-04-11 00:30:46.117 [rank:3] [train], epoch: 40/50, iter: 400/834, loss: 0.25645, top1: 0.73937, throughput: 1314.71 | 2022-04-11 00:30:46.117 [rank:7] [train], epoch: 40/50, iter: 400/834, loss: 0.25921, top1: 0.73146, throughput: 1314.56 | 2022-04-11 00:30:46.117 [rank:0] [train], epoch: 40/50, iter: 400/834, loss: 0.25952, top1: 0.72922, throughput: 1314.54 | 2022-04-11 00:30:46.119 [rank:5] [train], epoch: 40/50, iter: 500/834, loss: 0.25676, top1: 0.73193, throughput: 1316.85 | 2022-04-11 00:31:00.697 [rank:4] [train], epoch: 40/50, iter: 500/834, loss: 0.25926, top1: 0.73047, throughput: 1316.74 | 2022-04-11 00:31:00.696 [rank:6] [train], epoch: 40/50, iter: 500/834, loss: 0.25867, top1: 0.73130, throughput: 1316.83 | 2022-04-11 00:31:00.696 [rank:7] [train], epoch: 40/50, iter: 500/834, loss: 0.25876, top1: 0.73208, throughput: 1316.85 | 2022-04-11 00:31:00.697 [rank:2] [train], epoch: 40/50, iter: 500/834, loss: 0.25969, top1: 0.72693, throughput: 1316.66 | 2022-04-11 00:31:00.697 [rank:3] [train], epoch: 40/50, iter: 500/834, loss: 0.25799, top1: 0.73391, throughput: 1316.78 | 2022-04-11 00:31:00.698 [rank:0] [train], epoch: 40/50, iter: 500/834, loss: 0.26058, top1: 0.72833, throughput: 1316.99 | 2022-04-11 00:31:00.697 [rank:1] [train], epoch: 40/50, iter: 500/834, loss: 0.26083, top1: 0.72740, throughput: 1316.55 | 2022-04-11 00:31:00.699 [rank:2] [train], epoch: 40/50, iter: 600/834, loss: 0.26000, top1: 0.72995, throughput: 1315.80 | 2022-04-11 00:31:15.289 [rank:5] [train], epoch: 40/50, iter: 600/834, loss: 0.25925, top1: 0.72708, throughput: 1315.97 | 2022-04-11 00:31:15.287 [rank:4] [train], epoch: 40/50, iter: 600/834, loss: 0.25915, top1: 0.72979, throughput: 1315.81 | 2022-04-11 00:31:15.288 [rank:1] [train], epoch: 40/50, iter: 600/834, loss: 0.25687, top1: 0.73484, throughput: 1316.05 | 2022-04-11 00:31:15.289 [rank:7] [train], epoch: 40/50, iter: 600/834, loss: 0.25966, top1: 0.72943, throughput: 1315.85 | 2022-04-11 00:31:15.289 [rank:3] [train], epoch: 40/50, iter: 600/834, loss: 0.25799, top1: 0.73615, throughput: 1315.91 | 2022-04-11 00:31:15.289 [rank:0] [train], epoch: 40/50, iter: 600/834, loss: 0.25974, top1: 0.72979, throughput: 1315.89 | 2022-04-11 00:31:15.288 [rank:6] [train], epoch: 40/50, iter: 600/834, loss: 0.25812, top1: 0.73146, throughput: 1315.68 | 2022-04-11 00:31:15.289 [rank:5] [train], epoch: 40/50, iter: 700/834, loss: 0.25868, top1: 0.73094, throughput: 1318.00 | 2022-04-11 00:31:29.855 [rank:2] [train], epoch: 40/50, iter: 700/834, loss: 0.26097, top1: 0.72823, throughput: 1318.11 | 2022-04-11 00:31:29.856 [rank:7] [train], epoch: 40/50, iter: 700/834, loss: 0.25782, top1: 0.72995, throughput: 1318.03 | 2022-04-11 00:31:29.856 [rank:4] [train], epoch: 40/50, iter: 700/834, loss: 0.25755, top1: 0.73339, throughput: 1317.88 | 2022-04-11 00:31:29.857 [rank:6] [train], epoch: 40/50, iter: 700/834, loss: 0.25668, top1: 0.73948, throughput: 1318.04 | 2022-04-11 00:31:29.856 [rank:3] [train], epoch: 40/50, iter: 700/834, loss: 0.25929, top1: 0.72911, throughput: 1317.75 | 2022-04-11 00:31:29.859 [rank:1] [train], epoch: 40/50, iter: 700/834, loss: 0.25948, top1: 0.72781, throughput: 1317.47 | 2022-04-11 00:31:29.862 [rank:0] [train], epoch: 40/50, iter: 700/834, loss: 0.25890, top1: 0.73115, throughput: 1317.47 | 2022-04-11 00:31:29.862 [rank:3] [train], epoch: 40/50, iter: 800/834, loss: 0.25590, top1: 0.73724, throughput: 1315.80 | 2022-04-11 00:31:44.451 [rank:2] [train], epoch: 40/50, iter: 800/834, loss: 0.25822, top1: 0.72870, throughput: 1315.49 | 2022-04-11 00:31:44.451 [rank:6] [train], epoch: 40/50, iter: 800/834, loss: 0.26240, top1: 0.72401, throughput: 1315.66 | 2022-04-11 00:31:44.450 [rank:4] [train], epoch: 40/50, iter: 800/834, loss: 0.26006, top1: 0.73125, throughput: 1315.66 | 2022-04-11 00:31:44.450 [rank:5] [train], epoch: 40/50, iter: 800/834, loss: 0.26075, top1: 0.72698, throughput: 1315.33 | 2022-04-11 00:31:44.452 [rank:1] [train], epoch: 40/50, iter: 800/834, loss: 0.25849, top1: 0.73438, throughput: 1315.99 | 2022-04-11 00:31:44.452 [rank:0] [train], epoch: 40/50, iter: 800/834, loss: 0.25910, top1: 0.73198, throughput: 1315.97 | 2022-04-11 00:31:44.452 [rank:7] [train], epoch: 40/50, iter: 800/834, loss: 0.25763, top1: 0.72911, throughput: 1315.32 | 2022-04-11 00:31:44.453 [rank:4] [train], epoch: 40/50, iter: 834/834, loss: 0.26197, top1: 0.72426, throughput: 1309.74 | 2022-04-11 00:31:49.434 [rank:5] [train], epoch: 40/50, iter: 834/834, loss: 0.26064, top1: 0.72641, throughput: 1310.11 | 2022-04-11 00:31:49.435 [rank:2] [train], epoch: 40/50, iter: 834/834, loss: 0.26191, top1: 0.72672, throughput: 1309.57 | 2022-04-11 00:31:49.436 [rank:6] [train], epoch: 40/50, iter: 834/834, loss: 0.25898, top1: 0.72580, throughput: 1309.00 | 2022-04-11 00:31:49.437 [rank:0] [train], epoch: 40/50, iter: 834/834, loss: 0.25920, top1: 0.72840, throughput: 1309.19 | 2022-04-11 00:31:49.438[rank:7] [train], epoch: 40/50, iter: 834/834, loss: 0.25688, top1: 0.73192, throughput: 1309.61 | 2022-04-11 00:31:49.438 [rank:1] [train], epoch: 40/50, iter: 834/834, loss: 0.25931, top1: 0.73284, throughput: 1309.06 | 2022-04-11 00:31:49.439 [rank:3] [train], epoch: 40/50, iter: 834/834, loss: 0.25499, top1: 0.73866, throughput: 1308.89 | 2022-04-11 00:31:49.439 [rank:0] [eval], epoch: 40/50, iter: 125/125, loss: 0.00000, top1: 0.72976, throughput: 584.98 | 2022-04-11 00:32:00.122 [rank:7] [eval], epoch: 40/50, iter: 125/125, loss: 0.00000, top1: 0.73888, throughput: 584.71 | 2022-04-11 00:32:00.127 [rank:2] [eval], epoch: 40/50, iter: 125/125, loss: 0.00000, top1: 0.72544, throughput: 582.88 | 2022-04-11 00:32:00.158 [rank:4] [eval], epoch: 40/50, iter: 125/125, loss: 0.00000, top1: 0.72656, throughput: 581.71 | 2022-04-11 00:32:00.178 [rank:6] [eval], epoch: 40/50, iter: 125/125, loss: 0.00000, top1: 0.73904, throughput: 580.17 | 2022-04-11 00:32:00.210 [rank:5] [eval], epoch: 40/50, iter: 125/125, loss: 0.00000, top1: 0.72144, throughput: 575.01 | 2022-04-11 00:32:00.304 [rank:3] [eval], epoch: 40/50, iter: 125/125, loss: 0.00000, top1: 0.72672, throughput: 574.32 | 2022-04-11 00:32:00.321 [rank:1] [eval], epoch: 40/50, iter: 125/125, loss: 0.00000, top1: 0.73936, throughput: 569.61 | 2022-04-11 00:32:00.411 [rank:4] [train], epoch: 41/50, iter: 100/834, loss: 0.25370, top1: 0.74052, throughput: 1301.42 | 2022-04-11 00:32:14.932 [rank:5] [train], epoch: 41/50, iter: 100/834, loss: 0.25482, top1: 0.74313, throughput: 1312.57 | 2022-04-11 00:32:14.932 [rank:6] [train], epoch: 41/50, iter: 100/834, loss: 0.25600, top1: 0.74036, throughput: 1303.99 | 2022-04-11 00:32:14.934 [rank:2] [train], epoch: 41/50, iter: 100/834, loss: 0.25150, top1: 0.74557, throughput: 1299.57 | 2022-04-11 00:32:14.932 [rank:3] [train], epoch: 41/50, iter: 100/834, loss: 0.25403, top1: 0.74328, throughput: 1313.91 | 2022-04-11 00:32:14.934 [rank:7] [train], epoch: 41/50, iter: 100/834, loss: 0.25504, top1: 0.73714, throughput: 1296.74 | 2022-04-11 00:32:14.933 [rank:1] [train], epoch: 41/50, iter: 100/834, loss: 0.25566, top1: 0.73781, throughput: 1321.66 | 2022-04-11 00:32:14.938 [rank:0] [train], epoch: 41/50, iter: 100/834, loss: 0.25516, top1: 0.73760, throughput: 1295.98 | 2022-04-11 00:32:14.937 [rank:4] [train], epoch: 41/50, iter: 200/834, loss: 0.25545, top1: 0.74057, throughput: 1314.79 | 2022-04-11 00:32:29.535 [rank:2] [train], epoch: 41/50, iter: 200/834, loss: 0.25678, top1: 0.73562, throughput: 1314.73 | 2022-04-11 00:32:29.536 [rank:3] [train], epoch: 41/50, iter: 200/834, loss: 0.25457, top1: 0.74016, throughput: 1314.74 | 2022-04-11 00:32:29.538 [rank:0] [train], epoch: 41/50, iter: 200/834, loss: 0.25599, top1: 0.73417, throughput: 1314.97 | 2022-04-11 00:32:29.538 [rank:1] [train], epoch: 41/50, iter: 200/834, loss: 0.25511, top1: 0.73828, throughput: 1315.18 | 2022-04-11 00:32:29.537 [rank:5] [train], epoch: 41/50, iter: 200/834, loss: 0.25592, top1: 0.73635, throughput: 1314.75 | 2022-04-11 00:32:29.535 [rank:7] [train], epoch: 41/50, iter: 200/834, loss: 0.25469, top1: 0.73844, throughput: 1314.61 | 2022-04-11 00:32:29.538 [rank:6] [train], epoch: 41/50, iter: 200/834, loss: 0.25130, top1: 0.74990, throughput: 1314.65 | 2022-04-11 00:32:29.538 [rank:6] [train], epoch: 41/50, iter: 300/834, loss: 0.25587, top1: 0.73719, throughput: 1308.71 | 2022-04-11 00:32:44.209 [rank:2] [train], epoch: 41/50, iter: 300/834, loss: 0.25426, top1: 0.74214, throughput: 1308.49 | 2022-04-11 00:32:44.210 [rank:4] [train], epoch: 41/50, iter: 300/834, loss: 0.25475, top1: 0.74109, throughput: 1308.33 | 2022-04-11 00:32:44.210 [rank:1] [train], epoch: 41/50, iter: 300/834, loss: 0.25444, top1: 0.73766, throughput: 1308.48 | 2022-04-11 00:32:44.210 [rank:3] [train], epoch: 41/50, iter: 300/834, loss: 0.25380, top1: 0.74089, throughput: 1308.53 | 2022-04-11 00:32:44.211 [rank:7] [train], epoch: 41/50, iter: 300/834, loss: 0.25506, top1: 0.73755, throughput: 1308.69 | 2022-04-11 00:32:44.210 [rank:5] [train], epoch: 41/50, iter: 300/834, loss: 0.25762, top1: 0.73490, throughput: 1308.36 | 2022-04-11 00:32:44.210 [rank:0] [train], epoch: 41/50, iter: 300/834, loss: 0.25397, top1: 0.73833, throughput: 1308.43 | 2022-04-11 00:32:44.212 [rank:6] [train], epoch: 41/50, iter: 400/834, loss: 0.25522, top1: 0.74094, throughput: 1317.33 | 2022-04-11 00:32:58.784 [rank:5] [train], epoch: 41/50, iter: 400/834, loss: 0.25622, top1: 0.73583, throughput: 1317.36 | 2022-04-11 00:32:58.785 [rank:7] [train], epoch: 41/50, iter: 400/834, loss: 0.25190, top1: 0.74375, throughput: 1317.31 | 2022-04-11 00:32:58.785 [rank:2] [train], epoch: 41/50, iter: 400/834, loss: 0.25447, top1: 0.74031, throughput: 1317.28 | 2022-04-11 00:32:58.785 [rank:3] [train], epoch: 41/50, iter: 400/834, loss: 0.25518, top1: 0.73760, throughput: 1317.27 | 2022-04-11 00:32:58.786 [rank:1] [train], epoch: 41/50, iter: 400/834, loss: 0.25643, top1: 0.73724, throughput: 1317.25 | 2022-04-11 00:32:58.786 [rank:4] [train], epoch: 41/50, iter: 400/834, loss: 0.25365, top1: 0.74057, throughput: 1317.22 | 2022-04-11 00:32:58.786 [rank:0] [train], epoch: 41/50, iter: 400/834, loss: 0.25513, top1: 0.74120, throughput: 1317.42 | 2022-04-11 00:32:58.786 [rank:5] [train], epoch: 41/50, iter: 500/834, loss: 0.25366, top1: 0.74172, throughput: 1314.76 | 2022-04-11 00:33:13.388 [rank:3] [train], epoch: 41/50, iter: 500/834, loss: 0.25351, top1: 0.74187, throughput: 1314.80 | 2022-04-11 00:33:13.389 [rank:4] [train], epoch: 41/50, iter: 500/834, loss: 0.25460, top1: 0.74115, throughput: 1315.06 | 2022-04-11 00:33:13.386 [rank:6] [train], epoch: 41/50, iter: 500/834, loss: 0.25465, top1: 0.73844, throughput: 1314.77 | 2022-04-11 00:33:13.387 [rank:1] [train], epoch: 41/50, iter: 500/834, loss: 0.25488, top1: 0.74115, throughput: 1314.95 | 2022-04-11 00:33:13.388 [rank:0] [train], epoch: 41/50, iter: 500/834, loss: 0.25435, top1: 0.73974, throughput: 1315.03 | 2022-04-11 00:33:13.387 [rank:2] [train], epoch: 41/50, iter: 500/834, loss: 0.25482, top1: 0.73932, throughput: 1314.78 | 2022-04-11 00:33:13.388 [rank:7] [train], epoch: 41/50, iter: 500/834, loss: 0.25470, top1: 0.73901, throughput: 1314.61 | 2022-04-11 00:33:13.390 [rank:6] [train], epoch: 41/50, iter: 600/834, loss: 0.25496, top1: 0.73682, throughput: 1315.26 | 2022-04-11 00:33:27.985 [rank:2] [train], epoch: 41/50, iter: 600/834, loss: 0.25387, top1: 0.74089, throughput: 1315.29 | 2022-04-11 00:33:27.986 [rank:7] [train], epoch: 41/50, iter: 600/834, loss: 0.25544, top1: 0.73802, throughput: 1315.42[rank:5] [train], epoch: 41/50, iter: 600/834, loss: 0.25340, top1: 0.73984, throughput: 1315.35 | 2022-04-11 00:33:27.986 | 2022-04-11 00:33:27.985 [rank:1] [train], epoch: 41/50, iter: 600/834, loss: 0.25463, top1: 0.73766, throughput: 1315.18 | 2022-04-11 00:33:27.986 [rank:4] [train], epoch: 41/50, iter: 600/834, loss: 0.25522, top1: 0.73672, throughput: 1315.12 | 2022-04-11 00:33:27.986 [rank:0] [train], epoch: 41/50, iter: 600/834, loss: 0.25728, top1: 0.73552, throughput: 1315.13 | 2022-04-11 00:33:27.986 [rank:3] [train], epoch: 41/50, iter: 600/834, loss: 0.25617, top1: 0.73719, throughput: 1314.99 | 2022-04-11 00:33:27.990 [rank:4] [train], epoch: 41/50, iter: 700/834, loss: 0.25297, top1: 0.74276, throughput: 1311.68 | 2022-04-11 00:33:42.623 [rank:5] [train], epoch: 41/50, iter: 700/834, loss: 0.25191, top1: 0.74448, throughput: 1311.65 | 2022-04-11 00:33:42.623 [rank:1] [train], epoch: 41/50, iter: 700/834, loss: 0.25599, top1: 0.73724, throughput: 1311.79 | 2022-04-11 00:33:42.623 [rank:3] [train], epoch: 41/50, iter: 700/834, loss: 0.25407, top1: 0.73917, throughput: 1312.08 | 2022-04-11 00:33:42.623 [rank:0] [train], epoch: 41/50, iter: 700/834, loss: 0.25432, top1: 0.73792, throughput: 1311.71 | 2022-04-11 00:33:42.623 [rank:6] [train], epoch: 41/50, iter: 700/834, loss: 0.25472, top1: 0.74156, throughput: 1311.66 | 2022-04-11 00:33:42.623 [rank:2] [train], epoch: 41/50, iter: 700/834, loss: 0.25385, top1: 0.74219, throughput: 1311.58 | 2022-04-11 00:33:42.625 [rank:7] [train], epoch: 41/50, iter: 700/834, loss: 0.25498, top1: 0.73734, throughput: 1311.50 | 2022-04-11 00:33:42.626 [rank:2] [train], epoch: 41/50, iter: 800/834, loss: 0.25587, top1: 0.73635, throughput: 1315.12 | 2022-04-11 00:33:57.224 [rank:5] [train], epoch: 41/50, iter: 800/834, loss: 0.25559, top1: 0.73870, throughput: 1315.09 | 2022-04-11 00:33:57.223 [rank:3] [train], epoch: 41/50, iter: 800/834, loss: 0.25516, top1: 0.73609, throughput: 1314.98 | 2022-04-11 00:33:57.224 [rank:1] [train], epoch: 41/50, iter: 800/834, loss: 0.25677, top1: 0.73234, throughput: 1314.87 | 2022-04-11 00:33:57.225 [rank:0] [train], epoch: 41/50, iter: 800/834, loss: 0.25460, top1: 0.73896, throughput: 1314.84 | 2022-04-11 00:33:57.226 [rank:6] [train], epoch: 41/50, iter: 800/834, loss: 0.25403, top1: 0.74182, throughput: 1314.79 | 2022-04-11 00:33:57.226 [rank:4] [train], epoch: 41/50, iter: 800/834, loss: 0.25421, top1: 0.74161, throughput: 1314.79 | 2022-04-11 00:33:57.226 [rank:7] [train], epoch: 41/50, iter: 800/834, loss: 0.25401, top1: 0.73792, throughput: 1315.09 | 2022-04-11 00:33:57.225 [rank:4] [train], epoch: 41/50, iter: 834/834, loss: 0.25417, top1: 0.73667, throughput: 1315.33 | 2022-04-11 00:34:02.189 [rank:5] [train], epoch: 41/50, iter: 834/834, loss: 0.25839, top1: 0.73560, throughput: 1314.45 | 2022-04-11 00:34:02.189 [rank:6] [train], epoch: 41/50, iter: 834/834, loss: 0.25678, top1: 0.73330, throughput: 1315.39 | 2022-04-11 00:34:02.189 [rank:0] [train], epoch: 41/50, iter: 834/834, loss: 0.25753, top1: 0.73637, throughput: 1315.06 | 2022-04-11 00:34:02.190 [rank:7] [train], epoch: 41/50, iter: 834/834, loss: 0.25520, top1: 0.73974, throughput: 1314.76 | 2022-04-11 00:34:02.190 [rank:2] [train], epoch: 41/50, iter: 834/834, loss: 0.25253, top1: 0.74449, throughput: 1314.51 | 2022-04-11 00:34:02.190 [rank:1] [train], epoch: 41/50, iter: 834/834, loss: 0.25230, top1: 0.74494, throughput: 1314.45 | 2022-04-11 00:34:02.191 [rank:3] [train], epoch: 41/50, iter: 834/834, loss: 0.25377, top1: 0.73897, throughput: 1313.66 | 2022-04-11 00:34:02.194 [rank:0] [eval], epoch: 41/50, iter: 125/125, loss: 0.00000, top1: 0.73952, throughput: 579.43 | 2022-04-11 00:34:12.976 [rank:7] [eval], epoch: 41/50, iter: 125/125, loss: 0.00000, top1: 0.73872, throughput: 578.88 | 2022-04-11 00:34:12.987 [rank:2] [eval], epoch: 41/50, iter: 125/125, loss: 0.00000, top1: 0.72592, throughput: 578.79 | 2022-04-11 00:34:12.988 [rank:4] [eval], epoch: 41/50, iter: 125/125, loss: 0.00000, top1: 0.73392, throughput: 575.91 | 2022-04-11 00:34:13.042 [rank:3] [eval], epoch: 41/50, iter: 125/125, loss: 0.00000, top1: 0.73360, throughput: 575.95 | 2022-04-11 00:34:13.045 [rank:6] [eval], epoch: 41/50, iter: 125/125, loss: 0.00000, top1: 0.73984, throughput: 573.43 | 2022-04-11 00:34:13.088 [rank:5] [eval], epoch: 41/50, iter: 125/125, loss: 0.00000, top1: 0.72992, throughput: 565.70 | 2022-04-11 00:34:13.238 [rank:1] [eval], epoch: 41/50, iter: 125/125, loss: 0.00000, top1: 0.74272, throughput: 563.88 | 2022-04-11 00:34:13.275 [rank:2] [train], epoch: 42/50, iter: 100/834, loss: 0.24922, top1: 0.75240, throughput: 1292.84 | 2022-04-11 00:34:27.839 [rank:6] [train], epoch: 42/50, iter: 100/834, loss: 0.25239, top1: 0.74052, throughput: 1301.49 | 2022-04-11 00:34:27.841 [rank:7] [train], epoch: 42/50, iter: 100/834, loss: 0.25029, top1: 0.75167, throughput: 1292.69 | 2022-04-11 00:34:27.840 [rank:3] [train], epoch: 42/50, iter: 100/834, loss: 0.25067, top1: 0.74786, throughput: 1297.67 | 2022-04-11 00:34:27.841 [rank:1] [train], epoch: 42/50, iter: 100/834, loss: 0.25097, top1: 0.75089, throughput: 1318.08 | 2022-04-11 00:34:27.842 [rank:0] [train], epoch: 42/50, iter: 100/834, loss: 0.25229, top1: 0.74906, throughput: 1291.74 | 2022-04-11 00:34:27.840 [rank:5] [train], epoch: 42/50, iter: 100/834, loss: 0.24971, top1: 0.75000, throughput: 1314.59 | 2022-04-11 00:34:27.843 [rank:4] [train], epoch: 42/50, iter: 100/834, loss: 0.25103, top1: 0.75047, throughput: 1297.18 | 2022-04-11 00:34:27.843 [rank:6] [train], epoch: 42/50, iter: 200/834, loss: 0.24945, top1: 0.74552, throughput: 1317.08 | 2022-04-11 00:34:42.418 [rank:4] [train], epoch: 42/50, iter: 200/834, loss: 0.25006, top1: 0.75000, throughput: 1317.37 | 2022-04-11 00:34:42.418 [rank:1] [train], epoch: 42/50, iter: 200/834, loss: 0.25228, top1: 0.74552, throughput: 1317.15 | 2022-04-11 00:34:42.419 [rank:2] [train], epoch: 42/50, iter: 200/834, loss: 0.24985, top1: 0.75208, throughput: 1316.85 | 2022-04-11 00:34:42.420 [rank:3] [train], epoch: 42/50, iter: 200/834, loss: 0.25174, top1: 0.74776, throughput: 1316.93 | 2022-04-11 00:34:42.420 [rank:0] [train], epoch: 42/50, iter: 200/834, loss: 0.24937, top1: 0.75068, throughput: 1316.71 | 2022-04-11 00:34:42.422 [rank:5] [train], epoch: 42/50, iter: 200/834, loss: 0.25023, top1: 0.74370, throughput: 1316.93 | 2022-04-11 00:34:42.422 [rank:7] [train], epoch: 42/50, iter: 200/834, loss: 0.25037, top1: 0.75000, throughput: 1316.51 | 2022-04-11 00:34:42.424 [rank:6] [train], epoch: 42/50, iter: 300/834, loss: 0.24899, top1: 0.75005, throughput: 1314.68 | 2022-04-11 00:34:57.023 [rank:5] [train], epoch: 42/50, iter: 300/834, loss: 0.25083, top1: 0.74719, throughput: 1314.86 | 2022-04-11 00:34:57.025 [rank:2] [train], epoch: 42/50, iter: 300/834, loss: 0.24926, top1: 0.75172, throughput: 1314.67 | 2022-04-11 00:34:57.024 [rank:7] [train], epoch: 42/50, iter: 300/834, loss: 0.24878, top1: 0.75177, throughput: 1314.96 | 2022-04-11 00:34:57.025 [rank:0] [train], epoch: 42/50, iter: 300/834, loss: 0.25027, top1: 0.75156, throughput: 1314.84 | 2022-04-11 00:34:57.024 [rank:4] [train], epoch: 42/50, iter: 300/834, loss: 0.24933, top1: 0.74906, throughput: 1314.42 | 2022-04-11 00:34:57.025 [rank:3] [train], epoch: 42/50, iter: 300/834, loss: 0.24950, top1: 0.75266, throughput: 1314.45 | 2022-04-11 00:34:57.027 [rank:1] [train], epoch: 42/50, iter: 300/834, loss: 0.25058, top1: 0.74995, throughput: 1314.25 | 2022-04-11 00:34:57.028 [rank:7] [train], epoch: 42/50, iter: 400/834, loss: 0.25083, top1: 0.74635, throughput: 1314.69 | 2022-04-11 00:35:11.629 [rank:6] [train], epoch: 42/50, iter: 400/834, loss: 0.24985, top1: 0.75062, throughput: 1314.56 | 2022-04-11 00:35:11.628 [rank:1] [train], epoch: 42/50, iter: 400/834, loss: 0.24927, top1: 0.74839, throughput: 1314.93 | 2022-04-11 00:35:11.630 [rank:2] [train], epoch: 42/50, iter: 400/834, loss: 0.24818, top1: 0.75563, throughput: 1314.44 | 2022-04-11 00:35:11.631[rank:4] [train], epoch: 42/50, iter: 400/834, loss: 0.25093, top1: 0.74557, throughput: 1314.75 | 2022-04-11 00:35:11.628 [rank:5] [train], epoch: 42/50, iter: 400/834, loss: 0.25245, top1: 0.74411, throughput: 1314.56 | 2022-04-11 00:35:11.630 [rank:3] [train], epoch: 42/50, iter: 400/834, loss: 0.24944, top1: 0.75109, throughput: 1314.79 | 2022-04-11 00:35:11.630 [rank:0] [train], epoch: 42/50, iter: 400/834, loss: 0.24959, top1: 0.75208, throughput: 1314.55 | 2022-04-11 00:35:11.630 [rank:5] [train], epoch: 42/50, iter: 500/834, loss: 0.25167, top1: 0.74521, throughput: 1315.53 | 2022-04-11 00:35:26.225 [rank:4] [train], epoch: 42/50, iter: 500/834, loss: 0.24898, top1: 0.75083, throughput: 1315.34 | 2022-04-11 00:35:26.225 [rank:6] [train], epoch: 42/50, iter: 500/834, loss: 0.25015, top1: 0.74922, throughput: 1315.15 | 2022-04-11 00:35:26.228 [rank:1] [train], epoch: 42/50, iter: 500/834, loss: 0.25139, top1: 0.74630, throughput: 1315.28 | 2022-04-11 00:35:26.227 [rank:2] [train], epoch: 42/50, iter: 500/834, loss: 0.24924, top1: 0.74823, throughput: 1315.45 | 2022-04-11 00:35:26.227 [rank:3] [train], epoch: 42/50, iter: 500/834, loss: 0.24818, top1: 0.75234, throughput: 1315.25 | 2022-04-11 00:35:26.228 [rank:7] [train], epoch: 42/50, iter: 500/834, loss: 0.25084, top1: 0.74828, throughput: 1315.26 | 2022-04-11 00:35:26.227 [rank:0] [train], epoch: 42/50, iter: 500/834, loss: 0.24915, top1: 0.75094, throughput: 1315.18 | 2022-04-11 00:35:26.229 [rank:2] [train], epoch: 42/50, iter: 600/834, loss: 0.25172, top1: 0.74667, throughput: 1317.63 | 2022-04-11 00:35:40.799 [rank:4] [train], epoch: 42/50, iter: 600/834, loss: 0.24983, top1: 0.74766, throughput: 1317.55 | 2022-04-11 00:35:40.798 [rank:5] [train], epoch: 42/50, iter: 600/834, loss: 0.25155, top1: 0.74932, throughput: 1317.34 | 2022-04-11 00:35:40.800 [rank:7] [train], epoch: 42/50, iter: 600/834, loss: 0.24939, top1: 0.74859, throughput: 1317.54 | 2022-04-11 00:35:40.800 [rank:3] [train], epoch: 42/50, iter: 600/834, loss: 0.25075, top1: 0.74964, throughput: 1317.52 | 2022-04-11 00:35:40.801 [rank:0] [train], epoch: 42/50, iter: 600/834, loss: 0.25057, top1: 0.74833, throughput: 1317.69 | 2022-04-11 00:35:40.800 [rank:1] [train], epoch: 42/50, iter: 600/834, loss: 0.25282, top1: 0.74354, throughput: 1317.35 | 2022-04-11 00:35:40.802 [rank:6] [train], epoch: 42/50, iter: 600/834, loss: 0.25047, top1: 0.75036, throughput: 1317.50 | 2022-04-11 00:35:40.801 [rank:4] [train], epoch: 42/50, iter: 700/834, loss: 0.25043, top1: 0.74828, throughput: 1315.66 | 2022-04-11 00:35:55.391 [rank:6] [train], epoch: 42/50, iter: 700/834, loss: 0.25337, top1: 0.73870, throughput: 1316.06 | 2022-04-11 00:35:55.390 [rank:7] [train], epoch: 42/50, iter: 700/834, loss: 0.25049, top1: 0.74885, throughput: 1315.89 | 2022-04-11 00:35:55.391 [rank:3] [train], epoch: 42/50, iter: 700/834, loss: 0.24975, top1: 0.74786, throughput: 1315.81 | 2022-04-11 00:35:55.393 [rank:2] [train], epoch: 42/50, iter: 700/834, loss: 0.25202, top1: 0.74578, throughput: 1315.76 | 2022-04-11 00:35:55.391 [rank:1] [train], epoch: 42/50, iter: 700/834, loss: 0.25300, top1: 0.74385, throughput: 1315.99 | 2022-04-11 00:35:55.392 [rank:5] [train], epoch: 42/50, iter: 700/834, loss: 0.24732, top1: 0.75417, throughput: 1315.83 | 2022-04-11 00:35:55.392 [rank:0] [train], epoch: 42/50, iter: 700/834, loss: 0.25090, top1: 0.74786, throughput: 1315.94 | 2022-04-11 00:35:55.390 [rank:6] [train], epoch: 42/50, iter: 800/834, loss: 0.24957, top1: 0.74953, throughput: 1315.04 | 2022-04-11 00:36:09.990 [rank:5] [train], epoch: 42/50, iter: 800/834, loss: 0.24979, top1: 0.74708, throughput: 1315.21 | 2022-04-11 00:36:09.990 [rank:1] [train], epoch: 42/50, iter: 800/834, loss: 0.25045, top1: 0.74797, throughput: 1315.16 | 2022-04-11 00:36:09.991 [rank:2] [train], epoch: 42/50, iter: 800/834, loss: 0.24988, top1: 0.74813, throughput: 1314.91 | 2022-04-11 00:36:09.993 [rank:3] [train], epoch: 42/50, iter: 800/834, loss: 0.25113, top1: 0.74479, throughput: 1315.10 | 2022-04-11 00:36:09.993 [rank:4] [train], epoch: 42/50, iter: 800/834, loss: 0.25043, top1: 0.74828, throughput: 1315.06 | 2022-04-11 00:36:09.991 [rank:7] [train], epoch: 42/50, iter: 800/834, loss: 0.24996, top1: 0.74818, throughput: 1314.95 | 2022-04-11 00:36:09.992 [rank:0] [train], epoch: 42/50, iter: 800/834, loss: 0.25162, top1: 0.74250, throughput: 1314.81 | 2022-04-11 00:36:09.993 [rank:5] [train], epoch: 42/50, iter: 834/834, loss: 0.24910, top1: 0.74648, throughput: 1314.11 | 2022-04-11 00:36:14.958 [rank:7] [train], epoch: 42/50, iter: 834/834, loss: 0.24961, top1: 0.74510, throughput: 1314.92 | 2022-04-11 00:36:14.957 [rank:1] [train], epoch: 42/50, iter: 834/834, loss: 0.24924, top1: 0.74939, throughput: 1314.39 | 2022-04-11 00:36:14.957 [rank:6] [train], epoch: 42/50, iter: 834/834, loss: 0.24511, top1: 0.76180, throughput: 1314.03 | 2022-04-11 00:36:14.958 [rank:3] [train], epoch: 42/50, iter: 834/834, loss: 0.24737, top1: 0.75674, throughput: 1314.72 | 2022-04-11 00:36:14.958 [rank:4] [train], epoch: 42/50, iter: 834/834, loss: 0.25109, top1: 0.74280, throughput: 1314.44 | 2022-04-11 00:36:14.958 [rank:2] [train], epoch: 42/50, iter: 834/834, loss: 0.25032, top1: 0.75153, throughput: 1314.56 | 2022-04-11 00:36:14.958 [rank:0] [train], epoch: 42/50, iter: 834/834, loss: 0.25529, top1: 0.73483, throughput: 1314.53 | 2022-04-11 00:36:14.959 [rank:7] [eval], epoch: 42/50, iter: 125/125, loss: 0.00000, top1: 0.74960, throughput: 581.66 | 2022-04-11 00:36:25.702 [rank:0] [eval], epoch: 42/50, iter: 125/125, loss: 0.00000, top1: 0.74032, throughput: 581.70 | 2022-04-11 00:36:25.704 [rank:6] [eval], epoch: 42/50, iter: 125/125, loss: 0.00000, top1: 0.74848, throughput: 571.75 | 2022-04-11 00:36:25.889 [rank:1] [eval], epoch: 42/50, iter: 125/125, loss: 0.00000, top1: 0.75040, throughput: 570.37 | 2022-04-11 00:36:25.915 [rank:2] [eval], epoch: 42/50, iter: 125/125, loss: 0.00000, top1: 0.73344, throughput: 569.70 | 2022-04-11 00:36:25.929 [rank:3] [eval], epoch: 42/50, iter: 125/125, loss: 0.00000, top1: 0.73904, throughput: 567.20 | 2022-04-11 00:36:25.977 [rank:4] [eval], epoch: 42/50, iter: 125/125, loss: 0.00000, top1: 0.74320, throughput: 564.05 | 2022-04-11 00:36:26.038 [rank:5] [eval], epoch: 42/50, iter: 125/125, loss: 0.00000, top1: 0.73168, throughput: 562.59 | 2022-04-11 00:36:26.067 [rank:0] [train], epoch: 43/50, iter: 100/834, loss: 0.24787, top1: 0.75792, throughput: 1287.71 | 2022-04-11 00:36:40.614 [rank:2] [train], epoch: 43/50, iter: 100/834, loss: 0.24574, top1: 0.75604, throughput: 1307.61 | 2022-04-11 00:36:40.612 [rank:5] [train], epoch: 43/50, iter: 100/834, loss: 0.24474, top1: 0.76161, throughput: 1319.93 | 2022-04-11 00:36:40.613 [rank:4] [train], epoch: 43/50, iter: 100/834, loss: 0.24643, top1: 0.75578, throughput: 1317.28 | 2022-04-11 00:36:40.614 [rank:6] [train], epoch: 43/50, iter: 100/834, loss: 0.24478, top1: 0.76437, throughput: 1303.92 | 2022-04-11 00:36:40.614 [rank:7] [train], epoch: 43/50, iter: 100/834, loss: 0.24675, top1: 0.75776, throughput: 1287.56 | 2022-04-11 00:36:40.614 [rank:1] [train], epoch: 43/50, iter: 100/834, loss: 0.24571, top1: 0.75990, throughput: 1306.07 | 2022-04-11 00:36:40.616 [rank:3] [train], epoch: 43/50, iter: 100/834, loss: 0.24754, top1: 0.75500, throughput: 1311.52 | 2022-04-11 00:36:40.616 [rank:0] [train], epoch: 43/50, iter: 200/834, loss: 0.24742, top1: 0.75542, throughput: 1316.34 | 2022-04-11 00:36:55.200 [rank:5] [train], epoch: 43/50, iter: 200/834, loss: 0.24445, top1: 0.76260, throughput: 1316.45 | 2022-04-11 00:36:55.198 [rank:4] [train], epoch: 43/50, iter: 200/834, loss: 0.24717, top1: 0.75417, throughput: 1316.39 | 2022-04-11 00:36:55.199 [rank:7] [train], epoch: 43/50, iter: 200/834, loss: 0.24526, top1: 0.75974, throughput: 1316.39 | 2022-04-11 00:36:55.199 [rank:1] [train], epoch: 43/50, iter: 200/834, loss: 0.24695, top1: 0.75896, throughput: 1316.58 | 2022-04-11 00:36:55.199 [rank:2] [train], epoch: 43/50, iter: 200/834, loss: 0.24734, top1: 0.75708, throughput: 1316.15 | 2022-04-11 00:36:55.200 [rank:6] [train], epoch: 43/50, iter: 200/834, loss: 0.24625, top1: 0.75885, throughput: 1316.33[rank:3] [train], epoch: 43/50, iter: 200/834, loss: 0.24611, top1: 0.75839, throughput: 1316.35 | 2022-04-11 00:36:55.200| 2022-04-11 00:36:55.202 [rank:6] [train], epoch: 43/50, iter: 300/834, loss: 0.24694, top1: 0.75630, throughput: 1317.53 | 2022-04-11 00:37:09.773 [rank:5] [train], epoch: 43/50, iter: 300/834, loss: 0.24487, top1: 0.76073, throughput: 1317.36 | 2022-04-11 00:37:09.772 [rank:2] [train], epoch: 43/50, iter: 300/834, loss: 0.24413, top1: 0.76099, throughput: 1317.54 | 2022-04-11 00:37:09.773 [rank:1] [train], epoch: 43/50, iter: 300/834, loss: 0.24578, top1: 0.76099, throughput: 1317.26 | 2022-04-11 00:37:09.775 [rank:4] [train], epoch: 43/50, iter: 300/834, loss: 0.24829, top1: 0.75526, throughput: 1317.39 | 2022-04-11 00:37:09.774 [rank:0] [train], epoch: 43/50, iter: 300/834, loss: 0.24642, top1: 0.76057, throughput: 1317.27 | 2022-04-11 00:37:09.775 [rank:3] [train], epoch: 43/50, iter: 300/834, loss: 0.24396, top1: 0.76089, throughput: 1317.29 | 2022-04-11 00:37:09.778 [rank:7] [train], epoch: 43/50, iter: 300/834, loss: 0.24551, top1: 0.75771, throughput: 1316.93 | 2022-04-11 00:37:09.778 [rank:6] [train], epoch: 43/50, iter: 400/834, loss: 0.24629, top1: 0.75687, throughput: 1316.25 | 2022-04-11 00:37:24.360 [rank:5] [train], epoch: 43/50, iter: 400/834, loss: 0.24431, top1: 0.75979, throughput: 1316.21 | 2022-04-11 00:37:24.360 [rank:2] [train], epoch: 43/50, iter: 400/834, loss: 0.24835, top1: 0.75234, throughput: 1316.22 | 2022-04-11 00:37:24.360 [rank:4] [train], epoch: 43/50, iter: 400/834, loss: 0.24409, top1: 0.76193, throughput: 1316.31 | 2022-04-11 00:37:24.360 [rank:1] [train], epoch: 43/50, iter: 400/834, loss: 0.24754, top1: 0.75792, throughput: 1316.38 | 2022-04-11 00:37:24.360 [rank:0] [train], epoch: 43/50, iter: 400/834, loss: 0.24841, top1: 0.75302, throughput: 1316.34 | 2022-04-11 00:37:24.361 [rank:3] [train], epoch: 43/50, iter: 400/834, loss: 0.24539, top1: 0.75792, throughput: 1316.32[rank:7] [train], epoch: 43/50, iter: 400/834, loss: 0.24650, top1: 0.75990, throughput: 1316.51 | 2022-04-11 00:37:24.364 | 2022-04-11 00:37:24.362 [rank:2] [train], epoch: 43/50, iter: 500/834, loss: 0.24687, top1: 0.75635, throughput: 1314.42 | 2022-04-11 00:37:38.967 [rank:4] [train], epoch: 43/50, iter: 500/834, loss: 0.24462, top1: 0.75818, throughput: 1314.36 | 2022-04-11 00:37:38.968 [rank:5] [train], epoch: 43/50, iter: 500/834, loss: 0.24755, top1: 0.75484, throughput: 1314.47 | 2022-04-11 00:37:38.966 [rank:6] [train], epoch: 43/50, iter: 500/834, loss: 0.24538, top1: 0.75828, throughput: 1314.37 | 2022-04-11 00:37:38.967 [rank:3] [train], epoch: 43/50, iter: 500/834, loss: 0.24479, top1: 0.76146, throughput: 1314.50 | 2022-04-11 00:37:38.970 [rank:1] [train], epoch: 43/50, iter: 500/834, loss: 0.24780, top1: 0.75599, throughput: 1314.30 | 2022-04-11 00:37:38.969 [rank:0] [train], epoch: 43/50, iter: 500/834, loss: 0.24477, top1: 0.75990, throughput: 1314.38 | 2022-04-11 00:37:38.969 [rank:7] [train], epoch: 43/50, iter: 500/834, loss: 0.24736, top1: 0.75766, throughput: 1314.35 | 2022-04-11 00:37:38.970 [rank:5] [train], epoch: 43/50, iter: 600/834, loss: 0.24638, top1: 0.75604, throughput: 1316.47 | 2022-04-11 00:37:53.551 [rank:6] [train], epoch: 43/50, iter: 600/834, loss: 0.24720, top1: 0.75063, throughput: 1316.30 | 2022-04-11 00:37:53.554 [rank:4] [train], epoch: 43/50, iter: 600/834, loss: 0.24431, top1: 0.75948, throughput: 1316.52 | 2022-04-11 00:37:53.551 [rank:7] [train], epoch: 43/50, iter: 600/834, loss: 0.24746, top1: 0.75432, throughput: 1316.74 | 2022-04-11 00:37:53.552 [rank:0] [train], epoch: 43/50, iter: 600/834, loss: 0.24403, top1: 0.75979, throughput: 1316.61 | 2022-04-11 00:37:53.552 [rank:1] [train], epoch: 43/50, iter: 600/834, loss: 0.24635, top1: 0.75693, throughput: 1316.44 | 2022-04-11 00:37:53.554 [rank:2] [train], epoch: 43/50, iter: 600/834, loss: 0.24658, top1: 0.75766, throughput: 1316.22 | 2022-04-11 00:37:53.555 [rank:3] [train], epoch: 43/50, iter: 600/834, loss: 0.24553, top1: 0.75911, throughput: 1316.34 | 2022-04-11 00:37:53.556 [rank:2] [train], epoch: 43/50, iter: 700/834, loss: 0.24529, top1: 0.76021, throughput: 1314.72 | 2022-04-11 00:38:08.158 [rank:6] [train], epoch: 43/50, iter: 700/834, loss: 0.24709, top1: 0.75885, throughput: 1314.64 | 2022-04-11 00:38:08.158 [rank:4] [train], epoch: 43/50, iter: 700/834, loss: 0.24539, top1: 0.75531, throughput: 1314.50 | 2022-04-11 00:38:08.158 [rank:5] [train], epoch: 43/50, iter: 700/834, loss: 0.24401, top1: 0.76161, throughput: 1314.38 | 2022-04-11 00:38:08.158 [rank:1] [train], epoch: 43/50, iter: 700/834, loss: 0.24701, top1: 0.75375, throughput: 1314.61 | 2022-04-11 00:38:08.159 [rank:3] [train], epoch: 43/50, iter: 700/834, loss: 0.24747, top1: 0.75484, throughput: 1314.77 | 2022-04-11 00:38:08.159 [rank:7] [train], epoch: 43/50, iter: 700/834, loss: 0.24736, top1: 0.75734, throughput: 1314.29 | 2022-04-11 00:38:08.160 [rank:0] [train], epoch: 43/50, iter: 700/834, loss: 0.24644, top1: 0.75604, throughput: 1314.39 | 2022-04-11 00:38:08.159 [rank:6] [train], epoch: 43/50, iter: 800/834, loss: 0.24696, top1: 0.75469, throughput: 1315.48 | 2022-04-11 00:38:22.754 [rank:1] [train], epoch: 43/50, iter: 800/834, loss: 0.24648, top1: 0.75781, throughput: 1315.47 | 2022-04-11 00:38:22.754 [rank:7] [train], epoch: 43/50, iter: 800/834, loss: 0.24567, top1: 0.75839, throughput: 1315.69[rank:5] [train], epoch: 43/50, iter: 800/834, loss: 0.24530, top1: 0.76068, throughput: 1315.58 | 2022-04-11 00:38:22.754| 2022-04-11 00:38:22.753 [rank:2] [train], epoch: 43/50, iter: 800/834, loss: 0.24705, top1: 0.75755, throughput: 1315.53 | 2022-04-11 00:38:22.753 [rank:0] [train], epoch: 43/50, iter: 800/834, loss: 0.24371, top1: 0.76432, throughput: 1315.47 | 2022-04-11 00:38:22.755 [rank:4] [train], epoch: 43/50, iter: 800/834, loss: 0.24649, top1: 0.75708, throughput: 1315.37 | 2022-04-11 00:38:22.754 [rank:3] [train], epoch: 43/50, iter: 800/834, loss: 0.24776, top1: 0.75385, throughput: 1315.24 | 2022-04-11 00:38:22.757 [rank:4] [train], epoch: 43/50, iter: 834/834, loss: 0.24962, top1: 0.75551, throughput: 1306.69 | 2022-04-11 00:38:27.750 [rank:5] [train], epoch: 43/50, iter: 834/834, loss: 0.24621, top1: 0.76057, throughput: 1306.31 | 2022-04-11 00:38:27.750 [rank:2] [train], epoch: 43/50, iter: 834/834, loss: 0.24435, top1: 0.75490, throughput: 1306.17 | 2022-04-11 00:38:27.751 [rank:6] [train], epoch: 43/50, iter: 834/834, loss: 0.24801, top1: 0.75230, throughput: 1306.21 | 2022-04-11 00:38:27.752 [rank:7] [train], epoch: 43/50, iter: 834/834, loss: 0.24774, top1: 0.75061, throughput: 1306.02 | 2022-04-11 00:38:27.752 [rank:0] [train], epoch: 43/50, iter: 834/834, loss: 0.24275, top1: 0.77022, throughput: 1306.42 | 2022-04-11 00:38:27.752 [rank:1] [train], epoch: 43/50, iter: 834/834, loss: 0.24796, top1: 0.74709, throughput: 1306.25 | 2022-04-11 00:38:27.752 [rank:3] [train], epoch: 43/50, iter: 834/834, loss: 0.24900, top1: 0.74985, throughput: 1306.89 | 2022-04-11 00:38:27.752 [rank:0] [eval], epoch: 43/50, iter: 125/125, loss: 0.00000, top1: 0.74976, throughput: 582.98 | 2022-04-11 00:38:38.473 [rank:7] [eval], epoch: 43/50, iter: 125/125, loss: 0.00000, top1: 0.74960, throughput: 582.85 | 2022-04-11 00:38:38.475 [rank:4] [eval], epoch: 43/50, iter: 125/125, loss: 0.00000, top1: 0.74368, throughput: 580.25 | 2022-04-11 00:38:38.522 [rank:2] [eval], epoch: 43/50, iter: 125/125, loss: 0.00000, top1: 0.73424, throughput: 578.83 | 2022-04-11 00:38:38.549 [rank:3] [eval], epoch: 43/50, iter: 125/125, loss: 0.00000, top1: 0.74784, throughput: 572.88 | 2022-04-11 00:38:38.662 [rank:6] [eval], epoch: 43/50, iter: 125/125, loss: 0.00000, top1: 0.75712, throughput: 572.14 | 2022-04-11 00:38:38.675 [rank:5] [eval], epoch: 43/50, iter: 125/125, loss: 0.00000, top1: 0.73744, throughput: 568.25 | 2022-04-11 00:38:38.749 [rank:1] [eval], epoch: 43/50, iter: 125/125, loss: 0.00000, top1: 0.75248, throughput: 567.63 | 2022-04-11 00:38:38.762 [rank:6] [train], epoch: 44/50, iter: 100/834, loss: 0.24381, top1: 0.76245, throughput: 1312.33 | 2022-04-11 00:38:53.306 [rank:5] [train], epoch: 44/50, iter: 100/834, loss: 0.24122, top1: 0.76823, throughput: 1318.81 | 2022-04-11 00:38:53.307 [rank:3] [train], epoch: 44/50, iter: 100/834, loss: 0.24204, top1: 0.76042, throughput: 1310.97 | 2022-04-11 00:38:53.308 [rank:0] [train], epoch: 44/50, iter: 100/834, loss: 0.24168, top1: 0.76995, throughput: 1294.20 | 2022-04-11 00:38:53.308 [rank:4] [train], epoch: 44/50, iter: 100/834, loss: 0.24110, top1: 0.76849, throughput: 1298.44 | 2022-04-11 00:38:53.309 [rank:1] [train], epoch: 44/50, iter: 100/834, loss: 0.24251, top1: 0.76734, throughput: 1319.85 | 2022-04-11 00:38:53.310 [rank:2] [train], epoch: 44/50, iter: 100/834, loss: 0.24290, top1: 0.76589, throughput: 1300.82 | 2022-04-11 00:38:53.309 [rank:7] [train], epoch: 44/50, iter: 100/834, loss: 0.24405, top1: 0.75844, throughput: 1294.25 | 2022-04-11 00:38:53.310 [rank:6] [train], epoch: 44/50, iter: 200/834, loss: 0.24087, top1: 0.76875, throughput: 1315.99 | 2022-04-11 00:39:07.896 [rank:5] [train], epoch: 44/50, iter: 200/834, loss: 0.24176, top1: 0.76740, throughput: 1316.10 | 2022-04-11 00:39:07.896 [rank:4] [train], epoch: 44/50, iter: 200/834, loss: 0.24335, top1: 0.76266, throughput: 1316.21 | 2022-04-11 00:39:07.896 [rank:3] [train], epoch: 44/50, iter: 200/834, loss: 0.24183, top1: 0.76672, throughput: 1316.01 | 2022-04-11 00:39:07.897 [rank:2] [train], epoch: 44/50, iter: 200/834, loss: 0.24226, top1: 0.76573, throughput: 1316.14 | 2022-04-11 00:39:07.897 [rank:0] [train], epoch: 44/50, iter: 200/834, loss: 0.24241, top1: 0.76594, throughput: 1316.00 | 2022-04-11 00:39:07.898 [rank:1] [train], epoch: 44/50, iter: 200/834, loss: 0.24227, top1: 0.76417, throughput: 1316.17 | 2022-04-11 00:39:07.897 [rank:7] [train], epoch: 44/50, iter: 200/834, loss: 0.24129, top1: 0.77042, throughput: 1316.16 | 2022-04-11 00:39:07.898 [rank:4] [train], epoch: 44/50, iter: 300/834, loss: 0.24256, top1: 0.76672, throughput: 1317.69 | 2022-04-11 00:39:22.467 [rank:6] [train], epoch: 44/50, iter: 300/834, loss: 0.24306, top1: 0.76786, throughput: 1317.70 | 2022-04-11 00:39:22.467 [rank:2] [train], epoch: 44/50, iter: 300/834, loss: 0.24287, top1: 0.76177, throughput: 1317.67 | 2022-04-11 00:39:22.468 [rank:7] [train], epoch: 44/50, iter: 300/834, loss: 0.24277, top1: 0.76286, throughput: 1317.74 | 2022-04-11 00:39:22.468 [rank:3] [train], epoch: 44/50, iter: 300/834, loss: 0.24247, top1: 0.76750, throughput: 1317.41 | 2022-04-11 00:39:22.471 [rank:5] [train], epoch: 44/50, iter: 300/834, loss: 0.24233, top1: 0.76823, throughput: 1317.49 | 2022-04-11 00:39:22.469 [rank:1] [train], epoch: 44/50, iter: 300/834, loss: 0.24287, top1: 0.76380, throughput: 1317.27 | 2022-04-11 00:39:22.473 [rank:0] [train], epoch: 44/50, iter: 300/834, loss: 0.24336, top1: 0.76547, throughput: 1317.25 | 2022-04-11 00:39:22.473 [rank:2] [train], epoch: 44/50, iter: 400/834, loss: 0.23823, top1: 0.77359, throughput: 1307.47 | 2022-04-11 00:39:37.153 [rank:6] [train], epoch: 44/50, iter: 400/834, loss: 0.24241, top1: 0.76589, throughput: 1307.22 | 2022-04-11 00:39:37.154 [rank:4] [train], epoch: 44/50, iter: 400/834, loss: 0.24118, top1: 0.76526, throughput: 1307.36 | 2022-04-11 00:39:37.153 [rank:1] [train], epoch: 44/50, iter: 400/834, loss: 0.24376, top1: 0.76203, throughput: 1307.88 | 2022-04-11 00:39:37.153 [rank:5] [train], epoch: 44/50, iter: 400/834, loss: 0.24329, top1: 0.76505, throughput: 1307.61 | 2022-04-11 00:39:37.152 [rank:0] [train], epoch: 44/50, iter: 400/834, loss: 0.24293, top1: 0.76526, throughput: 1307.60 | 2022-04-11 00:39:37.157 [rank:3] [train], epoch: 44/50, iter: 400/834, loss: 0.24266, top1: 0.76609, throughput: 1307.44 | 2022-04-11 00:39:37.157 [rank:7] [train], epoch: 44/50, iter: 400/834, loss: 0.24291, top1: 0.76776, throughput: 1307.16 | 2022-04-11 00:39:37.157 [rank:5] [train], epoch: 44/50, iter: 500/834, loss: 0.24323, top1: 0.76599, throughput: 1317.99 | 2022-04-11 00:39:51.720 [rank:2] [train], epoch: 44/50, iter: 500/834, loss: 0.24290, top1: 0.76510, throughput: 1317.78 | 2022-04-11 00:39:51.723 [rank:3] [train], epoch: 44/50, iter: 500/834, loss: 0.24178, top1: 0.76854, throughput: 1318.20 | 2022-04-11 00:39:51.722 [rank:4] [train], epoch: 44/50, iter: 500/834, loss: 0.24351, top1: 0.76401, throughput: 1317.98 | 2022-04-11 00:39:51.721 [rank:6] [train], epoch: 44/50, iter: 500/834, loss: 0.24411, top1: 0.76042, throughput: 1317.94 | 2022-04-11 00:39:51.722 [rank:1] [train], epoch: 44/50, iter: 500/834, loss: 0.24294, top1: 0.76349, throughput: 1317.74 | 2022-04-11 00:39:51.724 [rank:7] [train], epoch: 44/50, iter: 500/834, loss: 0.24375, top1: 0.76526, throughput: 1318.03 | 2022-04-11 00:39:51.724 [rank:0] [train], epoch: 44/50, iter: 500/834, loss: 0.24346, top1: 0.75964, throughput: 1318.17 | 2022-04-11 00:39:51.723 [rank:2] [train], epoch: 44/50, iter: 600/834, loss: 0.24219, top1: 0.76672, throughput: 1313.23 | 2022-04-11 00:40:06.343 [rank:5] [train], epoch: 44/50, iter: 600/834, loss: 0.24228, top1: 0.76297, throughput: 1312.94 | 2022-04-11 00:40:06.344 [rank:6] [train], epoch: 44/50, iter: 600/834, loss: 0.24177, top1: 0.76802, throughput: 1313.05 | 2022-04-11 00:40:06.345 [rank:4] [train], epoch: 44/50, iter: 600/834, loss: 0.24340, top1: 0.76401, throughput: 1313.12 | 2022-04-11 00:40:06.342 [rank:3] [train], epoch: 44/50, iter: 600/834, loss: 0.24265, top1: 0.76776, throughput: 1313.02 | 2022-04-11 00:40:06.345 [rank:7] [train], epoch: 44/50, iter: 600/834, loss: 0.23899, top1: 0.77302, throughput: 1313.20 | 2022-04-11 00:40:06.345 [rank:1] [train], epoch: 44/50, iter: 600/834, loss: 0.24173, top1: 0.76573, throughput: 1313.03 | 2022-04-11 00:40:06.346 [rank:0] [train], epoch: 44/50, iter: 600/834, loss: 0.24371, top1: 0.76464, throughput: 1313.14 | 2022-04-11 00:40:06.344 [rank:3] [train], epoch: 44/50, iter: 700/834, loss: 0.24115, top1: 0.76651, throughput: 1316.00 | 2022-04-11 00:40:20.934 [rank:4] [train], epoch: 44/50, iter: 700/834, loss: 0.24299, top1: 0.76625, throughput: 1315.80 | 2022-04-11 00:40:20.934 [rank:2] [train], epoch: 44/50, iter: 700/834, loss: 0.24108, top1: 0.76818, throughput: 1316.04 | 2022-04-11 00:40:20.933 [rank:5] [train], epoch: 44/50, iter: 700/834, loss: 0.24116, top1: 0.76729, throughput: 1316.10 | 2022-04-11 00:40:20.932 [rank:0] [train], epoch: 44/50, iter: 700/834, loss: 0.24276, top1: 0.76734, throughput: 1316.06 | 2022-04-11 00:40:20.933 [rank:1] [train], epoch: 44/50, iter: 700/834, loss: 0.24352, top1: 0.76437, throughput: 1316.00 | 2022-04-11 00:40:20.936 [rank:6] [train], epoch: 44/50, iter: 700/834, loss: 0.24370, top1: 0.76432, throughput: 1315.84 | 2022-04-11 00:40:20.936 [rank:7] [train], epoch: 44/50, iter: 700/834, loss: 0.24170, top1: 0.76724, throughput: 1315.55 | 2022-04-11 00:40:20.939 [rank:2] [train], epoch: 44/50, iter: 800/834, loss: 0.24018, top1: 0.77203, throughput: 1316.70 | 2022-04-11 00:40:35.514 [rank:6] [train], epoch: 44/50, iter: 800/834, loss: 0.24215, top1: 0.76620, throughput: 1317.10 | 2022-04-11 00:40:35.514 [rank:5] [train], epoch: 44/50, iter: 800/834, loss: 0.24422, top1: 0.76302, throughput: 1316.73 | 2022-04-11 00:40:35.514 [rank:4] [train], epoch: 44/50, iter: 800/834, loss: 0.24052, top1: 0.77339, throughput: 1316.89 | 2022-04-11 00:40:35.514 [rank:3] [train], epoch: 44/50, iter: 800/834, loss: 0.24220, top1: 0.76432, throughput: 1316.76 | 2022-04-11 00:40:35.516 [rank:7] [train], epoch: 44/50, iter: 800/834, loss: 0.24281, top1: 0.76328, throughput: 1317.29 | 2022-04-11 00:40:35.515 [rank:1] [train], epoch: 44/50, iter: 800/834, loss: 0.24222, top1: 0.76229, throughput: 1316.87 | 2022-04-11 00:40:35.516 [rank:0] [train], epoch: 44/50, iter: 800/834, loss: 0.24230, top1: 0.76609, throughput: 1316.55 | 2022-04-11 00:40:35.517 [rank:6] [train], epoch: 44/50, iter: 834/834, loss: 0.24400, top1: 0.76057, throughput: 1311.26 | 2022-04-11 00:40:40.492 [rank:1] [train], epoch: 44/50, iter: 834/834, loss: 0.24163, top1: 0.77053, throughput: 1311.40 | 2022-04-11 00:40:40.494 [rank:5] [train], epoch: 44/50, iter: 834/834, loss: 0.24312, top1: 0.75628, throughput: 1311.07 [rank:3] [train], epoch: 44/50, iter: 834/834, loss: 0.24081, top1: 0.76134, throughput: 1311.44| 2022-04-11 00:40:40.493 | 2022-04-11 00:40:40.493 [rank:2] [train], epoch: 44/50, iter: 834/834, loss: 0.24147, top1: 0.76562, throughput: 1311.03 | 2022-04-11 00:40:40.494 [rank:7] [train], epoch: 44/50, iter: 834/834, loss: 0.24308, top1: 0.76379, throughput: 1310.99 | 2022-04-11 00:40:40.494[rank:4] [train], epoch: 44/50, iter: 834/834, loss: 0.24241, top1: 0.77114, throughput: 1310.91 | 2022-04-11 00:40:40.494 [rank:0] [train], epoch: 44/50, iter: 834/834, loss: 0.24062, top1: 0.77436, throughput: 1311.15 | 2022-04-11 00:40:40.495 [rank:0] [eval], epoch: 44/50, iter: 125/125, loss: 0.00000, top1: 0.75584, throughput: 584.48 | 2022-04-11 00:40:51.189 [rank:7] [eval], epoch: 44/50, iter: 125/125, loss: 0.00000, top1: 0.75904, throughput: 584.16 | 2022-04-11 00:40:51.193 [rank:2] [eval], epoch: 44/50, iter: 125/125, loss: 0.00000, top1: 0.74304, throughput: 577.55 | 2022-04-11 00:40:51.315 [rank:6] [eval], epoch: 44/50, iter: 125/125, loss: 0.00000, top1: 0.76000, throughput: 573.72 | 2022-04-11 00:40:51.386 [rank:3] [eval], epoch: 44/50, iter: 125/125, loss: 0.00000, top1: 0.75616, throughput: 572.06 | 2022-04-11 00:40:51.419 [rank:1] [eval], epoch: 44/50, iter: 125/125, loss: 0.00000, top1: 0.75392, throughput: 569.66 | 2022-04-11 00:40:51.465 [rank:5] [eval], epoch: 44/50, iter: 125/125, loss: 0.00000, top1: 0.74400, throughput: 566.58 | 2022-04-11 00:40:51.524 [rank:4] [eval], epoch: 44/50, iter: 125/125, loss: 0.00000, top1: 0.75056, throughput: 565.73 | 2022-04-11 00:40:51.542 [rank:5] [train], epoch: 45/50, iter: 100/834, loss: 0.23951, top1: 0.77339, throughput: 1317.79 | 2022-04-11 00:41:06.094 [rank:7] [train], epoch: 45/50, iter: 100/834, loss: 0.23737, top1: 0.77844, throughput: 1288.46 | 2022-04-11 00:41:06.095 [rank:4] [train], epoch: 45/50, iter: 100/834, loss: 0.23876, top1: 0.77432, throughput: 1319.19 | 2022-04-11 00:41:06.096 [rank:6] [train], epoch: 45/50, iter: 100/834, loss: 0.23799, top1: 0.77438, throughput: 1305.27 | 2022-04-11 00:41:06.096 [rank:3] [train], epoch: 45/50, iter: 100/834, loss: 0.23846, top1: 0.77120, throughput: 1308.01 | 2022-04-11 00:41:06.097 [rank:1] [train], epoch: 45/50, iter: 100/834, loss: 0.23800, top1: 0.77729, throughput: 1312.15 | 2022-04-11 00:41:06.098 [rank:2] [train], epoch: 45/50, iter: 100/834, loss: 0.23980, top1: 0.76797, throughput: 1298.87 | 2022-04-11 00:41:06.098 [rank:0] [train], epoch: 45/50, iter: 100/834, loss: 0.23734, top1: 0.77495, throughput: 1287.83 | 2022-04-11 00:41:06.097 [rank:4] [train], epoch: 45/50, iter: 200/834, loss: 0.24022, top1: 0.77010, throughput: 1314.33 | 2022-04-11 00:41:20.704 [rank:6] [train], epoch: 45/50, iter: 200/834, loss: 0.23934, top1: 0.77609, throughput: 1314.38 | 2022-04-11 00:41:20.703 [rank:5] [train], epoch: 45/50, iter: 200/834, loss: 0.23905, top1: 0.77411, throughput: 1314.18 | 2022-04-11 00:41:20.704 [rank:2] [train], epoch: 45/50, iter: 200/834, loss: 0.23855, top1: 0.77359, throughput: 1314.40 | 2022-04-11 00:41:20.705 [rank:3] [train], epoch: 45/50, iter: 200/834, loss: 0.24010, top1: 0.76750, throughput: 1314.23 | 2022-04-11 00:41:20.707 [rank:1] [train], epoch: 45/50, iter: 200/834, loss: 0.23896, top1: 0.77177, throughput: 1314.38 | 2022-04-11 00:41:20.705 [rank:7] [train], epoch: 45/50, iter: 200/834, loss: 0.23918, top1: 0.77026, throughput: 1314.09 | 2022-04-11 00:41:20.705 [rank:0] [train], epoch: 45/50, iter: 200/834, loss: 0.24174, top1: 0.76891, throughput: 1314.39 | 2022-04-11 00:41:20.705 [rank:5] [train], epoch: 45/50, iter: 300/834, loss: 0.24009, top1: 0.76792, throughput: 1313.19 | 2022-04-11 00:41:35.325 [rank:6] [train], epoch: 45/50, iter: 300/834, loss: 0.23904, top1: 0.77229, throughput: 1313.17 | 2022-04-11 00:41:35.325 [rank:2] [train], epoch: 45/50, iter: 300/834, loss: 0.23748, top1: 0.77453, throughput: 1313.31 | 2022-04-11 00:41:35.324 [rank:7] [train], epoch: 45/50, iter: 300/834, loss: 0.23909, top1: 0.77391, throughput: 1313.23 | 2022-04-11 00:41:35.326 [rank:3] [train], epoch: 45/50, iter: 300/834, loss: 0.24036, top1: 0.77094, throughput: 1313.01 | 2022-04-11 00:41:35.330 [rank:4] [train], epoch: 45/50, iter: 300/834, loss: 0.23925, top1: 0.77234, throughput: 1313.20 | 2022-04-11 00:41:35.325 [rank:0] [train], epoch: 45/50, iter: 300/834, loss: 0.23958, top1: 0.77620, throughput: 1313.13 | 2022-04-11 00:41:35.326 [rank:1] [train], epoch: 45/50, iter: 300/834, loss: 0.24022, top1: 0.77135, throughput: 1312.96 | 2022-04-11 00:41:35.329 [rank:6] [train], epoch: 45/50, iter: 400/834, loss: 0.23972, top1: 0.76812, throughput: 1317.15 | 2022-04-11 00:41:49.901 [rank:2] [train], epoch: 45/50, iter: 400/834, loss: 0.23743, top1: 0.77344, throughput: 1317.10 | 2022-04-11 00:41:49.902 [rank:4] [train], epoch: 45/50, iter: 400/834, loss: 0.23907, top1: 0.77271, throughput: 1317.13 | 2022-04-11 00:41:49.902 [rank:3] [train], epoch: 45/50, iter: 400/834, loss: 0.23854, top1: 0.77104, throughput: 1317.40 | 2022-04-11 00:41:49.904 [rank:1] [train], epoch: 45/50, iter: 400/834, loss: 0.23813, top1: 0.77609, throughput: 1317.35 | 2022-04-11 00:41:49.904 [rank:5] [train], epoch: 45/50, iter: 400/834, loss: 0.24219, top1: 0.76823, throughput: 1317.03 | 2022-04-11 00:41:49.903 [rank:7] [train], epoch: 45/50, iter: 400/834, loss: 0.23943, top1: 0.77281, throughput: 1317.00 | 2022-04-11 00:41:49.904 [rank:0] [train], epoch: 45/50, iter: 400/834, loss: 0.23917, top1: 0.77198, throughput: 1317.16 | 2022-04-11 00:41:49.903 [rank:5] [train], epoch: 45/50, iter: 500/834, loss: 0.23865, top1: 0.77599, throughput: 1315.05 | 2022-04-11 00:42:04.503 [rank:4] [train], epoch: 45/50, iter: 500/834, loss: 0.23994, top1: 0.77104, throughput: 1315.17 | 2022-04-11 00:42:04.501 [rank:6] [train], epoch: 45/50, iter: 500/834, loss: 0.23892, top1: 0.77255, throughput: 1315.08 | 2022-04-11 00:42:04.501 [rank:3] [train], epoch: 45/50, iter: 500/834, loss: 0.24060, top1: 0.76906, throughput: 1315.10 | 2022-04-11 00:42:04.503 [rank:1] [train], epoch: 45/50, iter: 500/834, loss: 0.23969, top1: 0.77224, throughput: 1315.09 | 2022-04-11 00:42:04.503 [rank:2] [train], epoch: 45/50, iter: 500/834, loss: 0.23803, top1: 0.77182, throughput: 1314.61 | 2022-04-11 00:42:04.507 [rank:7] [train], epoch: 45/50, iter: 500/834, loss: 0.24151, top1: 0.76792, throughput: 1315.04 | 2022-04-11 00:42:04.505 [rank:0] [train], epoch: 45/50, iter: 500/834, loss: 0.23658, top1: 0.77646, throughput: 1314.55 | 2022-04-11 00:42:04.509 [rank:6] [train], epoch: 45/50, iter: 600/834, loss: 0.23903, top1: 0.77380, throughput: 1315.48 | 2022-04-11 00:42:19.097 [rank:2] [train], epoch: 45/50, iter: 600/834, loss: 0.23877, top1: 0.77411, throughput: 1315.75 | 2022-04-11 00:42:19.099 [rank:4] [train], epoch: 45/50, iter: 600/834, loss: 0.23773, top1: 0.77339, throughput: 1315.21 | 2022-04-11 00:42:19.099 [rank:7] [train], epoch: 45/50, iter: 600/834, loss: 0.23928, top1: 0.77318, throughput: 1315.74 | 2022-04-11 00:42:19.097 [rank:3] [train], epoch: 45/50, iter: 600/834, loss: 0.23910, top1: 0.77479, throughput: 1315.54 | 2022-04-11 00:42:19.098 [rank:5] [train], epoch: 45/50, iter: 600/834, loss: 0.23839, top1: 0.77906, throughput: 1315.66 | 2022-04-11 00:42:19.097 [rank:0] [train], epoch: 45/50, iter: 600/834, loss: 0.23799, top1: 0.77557, throughput: 1315.91 | 2022-04-11 00:42:19.100 [rank:1] [train], epoch: 45/50, iter: 600/834, loss: 0.23779, top1: 0.77385, throughput: 1315.35 | 2022-04-11 00:42:19.100 [rank:4] [train], epoch: 45/50, iter: 700/834, loss: 0.23783, top1: 0.77771, throughput: 1316.15 | 2022-04-11 00:42:33.687 [rank:7] [train], epoch: 45/50, iter: 700/834, loss: 0.23991, top1: 0.77234, throughput: 1315.88 | 2022-04-11 00:42:33.688 [rank:5] [train], epoch: 45/50, iter: 700/834, loss: 0.23698, top1: 0.78094, throughput: 1315.76 | 2022-04-11 00:42:33.689 [rank:2] [train], epoch: 45/50, iter: 700/834, loss: 0.23903, top1: 0.77135, throughput: 1316.07 | 2022-04-11 00:42:33.688 [rank:3] [train], epoch: 45/50, iter: 700/834, loss: 0.23775, top1: 0.77354, throughput: 1315.89 | 2022-04-11 00:42:33.689 [rank:6] [train], epoch: 45/50, iter: 700/834, loss: 0.23957, top1: 0.77036, throughput: 1315.78 | 2022-04-11 00:42:33.689 [rank:1] [train], epoch: 45/50, iter: 700/834, loss: 0.23747, top1: 0.77714, throughput: 1315.89 | 2022-04-11 00:42:33.691 [rank:0] [train], epoch: 45/50, iter: 700/834, loss: 0.23807, top1: 0.77443, throughput: 1316.05 | 2022-04-11 00:42:33.689 [rank:2] [train], epoch: 45/50, iter: 800/834, loss: 0.23739, top1: 0.77828, throughput: 1314.13 | 2022-04-11 00:42:48.299 [rank:7] [train], epoch: 45/50, iter: 800/834, loss: 0.24067, top1: 0.76750, throughput: 1314.08 | 2022-04-11 00:42:48.299 [rank:3] [train], epoch: 45/50, iter: 800/834, loss: 0.23851, top1: 0.77302, throughput: 1314.00 | 2022-04-11 00:42:48.301 [rank:4] [train], epoch: 45/50, iter: 800/834, loss: 0.23773, top1: 0.77391, throughput: 1314.13 | 2022-04-11 00:42:48.298 [rank:6] [train], epoch: 45/50, iter: 800/834, loss: 0.23846, top1: 0.77172, throughput: 1314.21 | 2022-04-11 00:42:48.298 [rank:0] [train], epoch: 45/50, iter: 800/834, loss: 0.23964, top1: 0.77286, throughput: 1314.10[rank:1] [train], epoch: 45/50, iter: 800/834, loss: 0.23882, top1: 0.77302, throughput: 1314.23 | 2022-04-11 00:42:48.300 | 2022-04-11 00:42:48.300 [rank:5] [train], epoch: 45/50, iter: 800/834, loss: 0.24017, top1: 0.76990, throughput: 1314.06 | 2022-04-11 00:42:48.300 [rank:6] [train], epoch: 45/50, iter: 834/834, loss: 0.23859, top1: 0.76945, throughput: 1313.41 | 2022-04-11 00:42:53.269 [rank:4] [train], epoch: 45/50, iter: 834/834, loss: 0.24092, top1: 0.76716, throughput: 1313.21 | 2022-04-11 00:42:53.269 [rank:1] [train], epoch: 45/50, iter: 834/834, loss: 0.23866, top1: 0.77359, throughput: 1313.55[rank:7] [train], epoch: 45/50, iter: 834/834, loss: 0.24017, top1: 0.77344, throughput: 1313.34 | 2022-04-11 00:42:53.270 | 2022-04-11 00:42:53.270 [rank:5] [train], epoch: 45/50, iter: 834/834, loss: 0.23711, top1: 0.78217, throughput: 1313.10 | 2022-04-11 00:42:53.272 [rank:2] [train], epoch: 45/50, iter: 834/834, loss: 0.23933, top1: 0.77007, throughput: 1313.04 | 2022-04-11 00:42:53.270 [rank:0] [train], epoch: 45/50, iter: 834/834, loss: 0.23887, top1: 0.77037, throughput: 1312.84 | 2022-04-11 00:42:53.272 [rank:3] [train], epoch: 45/50, iter: 834/834, loss: 0.23769, top1: 0.77374, throughput: 1313.05 | 2022-04-11 00:42:53.273 [rank:0] [eval], epoch: 45/50, iter: 125/125, loss: 0.00000, top1: 0.76032, throughput: 588.88 | 2022-04-11 00:43:03.885 [rank:7] [eval], epoch: 45/50, iter: 125/125, loss: 0.00000, top1: 0.76128, throughput: 587.28 | 2022-04-11 00:43:03.912 [rank:6] [eval], epoch: 45/50, iter: 125/125, loss: 0.00000, top1: 0.76176, throughput: 582.04 | 2022-04-11 00:43:04.007 [rank:2] [eval], epoch: 45/50, iter: 125/125, loss: 0.00000, top1: 0.75040, throughput: 579.64 | 2022-04-11 00:43:04.053 [rank:3] [eval], epoch: 45/50, iter: 125/125, loss: 0.00000, top1: 0.75728, throughput: 579.64 | 2022-04-11 00:43:04.055 [rank:4] [eval], epoch: 45/50, iter: 125/125, loss: 0.00000, top1: 0.75264, throughput: 575.58 | 2022-04-11 00:43:04.127 [rank:5] [eval], epoch: 45/50, iter: 125/125, loss: 0.00000, top1: 0.74720, throughput: 572.03 | 2022-04-11 00:43:04.198 [rank:1] [eval], epoch: 45/50, iter: 125/125, loss: 0.00000, top1: 0.76016, throughput: 571.45 | 2022-04-11 00:43:04.207 [rank:6] [train], epoch: 46/50, iter: 100/834, loss: 0.23845, top1: 0.77740, throughput: 1300.18 | 2022-04-11 00:43:18.774 [rank:5] [train], epoch: 46/50, iter: 100/834, loss: 0.23710, top1: 0.77766, throughput: 1317.19 | 2022-04-11 00:43:18.774 [rank:1] [train], epoch: 46/50, iter: 100/834, loss: 0.23611, top1: 0.77964, throughput: 1317.74 | 2022-04-11 00:43:18.778 [rank:2] [train], epoch: 46/50, iter: 100/834, loss: 0.23616, top1: 0.77906, throughput: 1304.07 | 2022-04-11 00:43:18.776 [rank:3] [train], epoch: 46/50, iter: 100/834, loss: 0.23645, top1: 0.77911, throughput: 1304.26 | 2022-04-11 00:43:18.776 [rank:0] [train], epoch: 46/50, iter: 100/834, loss: 0.23506, top1: 0.78120, throughput: 1289.40 | 2022-04-11 00:43:18.776 [rank:4] [train], epoch: 46/50, iter: 100/834, loss: 0.23824, top1: 0.77562, throughput: 1310.35 | 2022-04-11 00:43:18.780 [rank:7] [train], epoch: 46/50, iter: 100/834, loss: 0.23617, top1: 0.77995, throughput: 1291.28 | 2022-04-11 00:43:18.781 [rank:6] [train], epoch: 46/50, iter: 200/834, loss: 0.23631, top1: 0.78115, throughput: 1316.35 | 2022-04-11 00:43:33.360[rank:0] [train], epoch: 46/50, iter: 200/834, loss: 0.23810, top1: 0.77359, throughput: 1316.55 | 2022-04-11 00:43:33.360 [rank:2] [train], epoch: 46/50, iter: 200/834, loss: 0.23885, top1: 0.77411, throughput: 1316.70 | 2022-04-11 00:43:33.358 [rank:1] [train], epoch: 46/50, iter: 200/834, loss: 0.23558, top1: 0.78047, throughput: 1316.69 | 2022-04-11 00:43:33.360 [rank:4] [train], epoch: 46/50, iter: 200/834, loss: 0.23626, top1: 0.77885, throughput: 1316.92 | 2022-04-11 00:43:33.359 [rank:7] [train], epoch: 46/50, iter: 200/834, loss: 0.23617, top1: 0.78073, throughput: 1317.03 | 2022-04-11 00:43:33.359 [rank:5] [train], epoch: 46/50, iter: 200/834, loss: 0.23715, top1: 0.77635, throughput: 1316.38 | 2022-04-11 00:43:33.360 [rank:3] [train], epoch: 46/50, iter: 200/834, loss: 0.23597, top1: 0.77969, throughput: 1316.33 | 2022-04-11 00:43:33.362 [rank:5] [train], epoch: 46/50, iter: 300/834, loss: 0.23547, top1: 0.77859, throughput: 1304.91 | 2022-04-11 00:43:48.073 [rank:3] [train], epoch: 46/50, iter: 300/834, loss: 0.23785, top1: 0.77953, throughput: 1304.84 | 2022-04-11 00:43:48.077 [rank:4] [train], epoch: 46/50, iter: 300/834, loss: 0.23730, top1: 0.77755, throughput: 1304.88 | 2022-04-11 00:43:48.073 [rank:6] [train], epoch: 46/50, iter: 300/834, loss: 0.23800, top1: 0.77771, throughput: 1304.85 | 2022-04-11 00:43:48.074 [rank:7] [train], epoch: 46/50, iter: 300/834, loss: 0.23440, top1: 0.78448, throughput: 1304.74 | 2022-04-11 00:43:48.075 [rank:1] [train], epoch: 46/50, iter: 300/834, loss: 0.23670, top1: 0.77656, throughput: 1304.74 | 2022-04-11 00:43:48.075 [rank:2] [train], epoch: 46/50, iter: 300/834, loss: 0.23692, top1: 0.77656, throughput: 1304.50 | 2022-04-11 00:43:48.076 [rank:0] [train], epoch: 46/50, iter: 300/834, loss: 0.23404, top1: 0.78474, throughput: 1304.70 | 2022-04-11 00:43:48.076 [rank:6] [train], epoch: 46/50, iter: 400/834, loss: 0.23478, top1: 0.78359, throughput: 1314.03 | 2022-04-11 00:44:02.686 [rank:5] [train], epoch: 46/50, iter: 400/834, loss: 0.23673, top1: 0.77990, throughput: 1313.90 | 2022-04-11 00:44:02.686 [rank:4] [train], epoch: 46/50, iter: 400/834, loss: 0.23625, top1: 0.77620, throughput: 1313.87 | 2022-04-11 00:44:02.687 [rank:2] [train], epoch: 46/50, iter: 400/834, loss: 0.23362, top1: 0.78422, throughput: 1314.16 | 2022-04-11 00:44:02.686 [rank:7] [train], epoch: 46/50, iter: 400/834, loss: 0.23434, top1: 0.78443, throughput: 1313.88 | 2022-04-11 00:44:02.688 [rank:1] [train], epoch: 46/50, iter: 400/834, loss: 0.23707, top1: 0.77328, throughput: 1313.89 | 2022-04-11 00:44:02.688 [rank:3] [train], epoch: 46/50, iter: 400/834, loss: 0.23696, top1: 0.77875, throughput: 1313.96 | 2022-04-11 00:44:02.689 [rank:0] [train], epoch: 46/50, iter: 400/834, loss: 0.23576, top1: 0.77870, throughput: 1313.79 | 2022-04-11 00:44:02.690 [rank:4] [train], epoch: 46/50, iter: 500/834, loss: 0.23543, top1: 0.77651, throughput: 1316.67 | 2022-04-11 00:44:17.269 [rank:2] [train], epoch: 46/50, iter: 500/834, loss: 0.23626, top1: 0.77719, throughput: 1316.52 | 2022-04-11 00:44:17.270 [rank:6] [train], epoch: 46/50, iter: 500/834, loss: 0.23574, top1: 0.78109, throughput: 1316.60 | 2022-04-11 00:44:17.269 [rank:3] [train], epoch: 46/50, iter: 500/834, loss: 0.23438, top1: 0.78182, throughput: 1316.71[rank:7] [train], epoch: 46/50, iter: 500/834, loss: 0.23709, top1: 0.77698, throughput: 1316.73 | 2022-04-11 00:44:17.270| 2022-04-11 00:44:17.271 [rank:5] [train], epoch: 46/50, iter: 500/834, loss: 0.23753, top1: 0.77323, throughput: 1316.49 | 2022-04-11 00:44:17.270 [rank:1] [train], epoch: 46/50, iter: 500/834, loss: 0.23683, top1: 0.77776, throughput: 1316.76 | 2022-04-11 00:44:17.270 [rank:0] [train], epoch: 46/50, iter: 500/834, loss: 0.23459, top1: 0.78198, throughput: 1316.85 | 2022-04-11 00:44:17.270 [rank:5] [train], epoch: 46/50, iter: 600/834, loss: 0.23699, top1: 0.77823, throughput: 1314.45 | 2022-04-11 00:44:31.877 [rank:7] [train], epoch: 46/50, iter: 600/834, loss: 0.23541, top1: 0.78651, throughput: 1314.20 | 2022-04-11 00:44:31.879 [rank:6] [train], epoch: 46/50, iter: 600/834, loss: 0.23501, top1: 0.78130, throughput: 1314.21 | 2022-04-11 00:44:31.878 [rank:3] [train], epoch: 46/50, iter: 600/834, loss: 0.23593, top1: 0.77870, throughput: 1314.34 | 2022-04-11 00:44:31.879 [rank:1] [train], epoch: 46/50, iter: 600/834, loss: 0.23604, top1: 0.78219, throughput: 1314.15 | 2022-04-11 00:44:31.880 [rank:0] [train], epoch: 46/50, iter: 600/834, loss: 0.23705, top1: 0.77807, throughput: 1314.31 | 2022-04-11 00:44:31.879 [rank:4] [train], epoch: 46/50, iter: 600/834, loss: 0.23576, top1: 0.78099, throughput: 1314.14 | 2022-04-11 00:44:31.879 [rank:2] [train], epoch: 46/50, iter: 600/834, loss: 0.23697, top1: 0.78146, throughput: 1314.24 | 2022-04-11 00:44:31.879 [rank:4] [train], epoch: 46/50, iter: 700/834, loss: 0.23838, top1: 0.77318, throughput: 1315.10 | 2022-04-11 00:44:46.479 [rank:5] [train], epoch: 46/50, iter: 700/834, loss: 0.23643, top1: 0.77661, throughput: 1314.94 | 2022-04-11 00:44:46.479 [rank:6] [train], epoch: 46/50, iter: 700/834, loss: 0.23718, top1: 0.78000, throughput: 1315.00 | 2022-04-11 00:44:46.479 [rank:2] [train], epoch: 46/50, iter: 700/834, loss: 0.23661, top1: 0.78057, throughput: 1314.81 | 2022-04-11 00:44:46.482 [rank:1] [train], epoch: 46/50, iter: 700/834, loss: 0.23761, top1: 0.77719, throughput: 1315.05 | 2022-04-11 00:44:46.480 [rank:3] [train], epoch: 46/50, iter: 700/834, loss: 0.23517, top1: 0.77953, throughput: 1314.86[rank:0] [train], epoch: 46/50, iter: 700/834, loss: 0.23325, top1: 0.78370, throughput: 1314.96 | 2022-04-11 00:44:46.481| 2022-04-11 00:44:46.480 [rank:7] [train], epoch: 46/50, iter: 700/834, loss: 0.23793, top1: 0.77458, throughput: 1315.00 | 2022-04-11 00:44:46.480 [rank:7] [train], epoch: 46/50, iter: 800/834, loss: 0.23561, top1: 0.77911, throughput: 1316.14 | 2022-04-11 00:45:01.068 [rank:6] [train], epoch: 46/50, iter: 800/834, loss: 0.23878, top1: 0.77740, throughput: 1316.13 | 2022-04-11 00:45:01.067 [rank:4] [train], epoch: 46/50, iter: 800/834, loss: 0.23759, top1: 0.77729, throughput: 1316.13 | 2022-04-11 00:45:01.067 [rank:1] [train], epoch: 46/50, iter: 800/834, loss: 0.23501, top1: 0.78172, throughput: 1316.07 | 2022-04-11 00:45:01.069 [rank:3] [train], epoch: 46/50, iter: 800/834, loss: 0.23581, top1: 0.78047, throughput: 1315.90 | 2022-04-11 00:45:01.072 [rank:2] [train], epoch: 46/50, iter: 800/834, loss: 0.23441, top1: 0.78359, throughput: 1316.37 | 2022-04-11 00:45:01.068 [rank:5] [train], epoch: 46/50, iter: 800/834, loss: 0.23432, top1: 0.78260, throughput: 1315.93 | 2022-04-11 00:45:01.069 [rank:0] [train], epoch: 46/50, iter: 800/834, loss: 0.23531, top1: 0.78068, throughput: 1316.03 | 2022-04-11 00:45:01.069 [rank:6] [train], epoch: 46/50, iter: 834/834, loss: 0.23821, top1: 0.77466, throughput: 1313.59 | 2022-04-11 00:45:06.037 [rank:7] [train], epoch: 46/50, iter: 834/834, loss: 0.23784, top1: 0.77528, throughput: 1313.47 | 2022-04-11 00:45:06.038 [rank:4] [train], epoch: 46/50, iter: 834/834, loss: 0.23739, top1: 0.77865, throughput: 1312.94 | 2022-04-11 00:45:06.039 [rank:1] [train], epoch: 46/50, iter: 834/834, loss: 0.23579, top1: 0.77604, throughput: 1313.54 | 2022-04-11 00:45:06.039 [rank:5] [train], epoch: 46/50, iter: 834/834, loss: 0.23496, top1: 0.77757, throughput: 1313.56 | 2022-04-11 00:45:06.039 [rank:2] [train], epoch: 46/50, iter: 834/834, loss: 0.23395, top1: 0.78294, throughput: 1313.19 | 2022-04-11 00:45:06.039 [rank:0] [train], epoch: 46/50, iter: 834/834, loss: 0.23642, top1: 0.77528, throughput: 1313.48 | 2022-04-11 00:45:06.039 [rank:3] [train], epoch: 46/50, iter: 834/834, loss: 0.23349, top1: 0.78631, throughput: 1314.00 | 2022-04-11 00:45:06.040 [rank:4] [eval], epoch: 46/50, iter: 125/125, loss: 0.00000, top1: 0.75376, throughput: 579.04 | 2022-04-11 00:45:16.833 [rank:7] [eval], epoch: 46/50, iter: 125/125, loss: 0.00000, top1: 0.76384, throughput: 578.82 | 2022-04-11 00:45:16.836 [rank:0] [eval], epoch: 46/50, iter: 125/125, loss: 0.00000, top1: 0.76480, throughput: 578.05 | 2022-04-11 00:45:16.851 [rank:6] [eval], epoch: 46/50, iter: 125/125, loss: 0.00000, top1: 0.76368, throughput: 574.48 | 2022-04-11 00:45:16.916 [rank:2] [eval], epoch: 46/50, iter: 125/125, loss: 0.00000, top1: 0.75328, throughput: 574.45 | 2022-04-11 00:45:16.919 [rank:3] [eval], epoch: 46/50, iter: 125/125, loss: 0.00000, top1: 0.76256, throughput: 574.07 | 2022-04-11 00:45:16.927 [rank:1] [eval], epoch: 46/50, iter: 125/125, loss: 0.00000, top1: 0.75840, throughput: 564.20 | 2022-04-11 00:45:17.116 [rank:5] [eval], epoch: 46/50, iter: 125/125, loss: 0.00000, top1: 0.74656, throughput: 560.62 | 2022-04-11 00:45:17.187 [rank:6] [train], epoch: 47/50, iter: 100/834, loss: 0.23574, top1: 0.78161, throughput: 1295.91 | 2022-04-11 00:45:31.732 [rank:4] [train], epoch: 47/50, iter: 100/834, loss: 0.23500, top1: 0.78104, throughput: 1288.62 | 2022-04-11 00:45:31.733 [rank:3] [train], epoch: 47/50, iter: 100/834, loss: 0.23420, top1: 0.78167, throughput: 1296.70 | 2022-04-11 00:45:31.734 [rank:1] [train], epoch: 47/50, iter: 100/834, loss: 0.23301, top1: 0.78714, throughput: 1313.50 | 2022-04-11 00:45:31.734 [rank:7] [train], epoch: 47/50, iter: 100/834, loss: 0.23569, top1: 0.78286, throughput: 1288.74 | 2022-04-11 00:45:31.734 [rank:5] [train], epoch: 47/50, iter: 100/834, loss: 0.23390, top1: 0.78458, throughput: 1319.95 | 2022-04-11 00:45:31.733 [rank:2] [train], epoch: 47/50, iter: 100/834, loss: 0.23448, top1: 0.77984, throughput: 1295.97 | 2022-04-11 00:45:31.734 [rank:0] [train], epoch: 47/50, iter: 100/834, loss: 0.23357, top1: 0.78370, throughput: 1289.92 | 2022-04-11 00:45:31.736 [rank:6] [train], epoch: 47/50, iter: 200/834, loss: 0.23425, top1: 0.78224, throughput: 1315.20 | 2022-04-11 00:45:46.331 [rank:4] [train], epoch: 47/50, iter: 200/834, loss: 0.23484, top1: 0.78193, throughput: 1315.29 | 2022-04-11 00:45:46.330 [rank:5] [train], epoch: 47/50, iter: 200/834, loss: 0.23584, top1: 0.77896, throughput: 1315.31 | 2022-04-11 00:45:46.331 [rank:2] [train], epoch: 47/50, iter: 200/834, loss: 0.23288, top1: 0.78714, throughput: 1315.28 | 2022-04-11 00:45:46.332 [rank:0] [train], epoch: 47/50, iter: 200/834, loss: 0.23507, top1: 0.78271, throughput: 1315.37 | 2022-04-11 00:45:46.333 [rank:1] [train], epoch: 47/50, iter: 200/834, loss: 0.23444, top1: 0.78474, throughput: 1315.20 | 2022-04-11 00:45:46.332 [rank:3] [train], epoch: 47/50, iter: 200/834, loss: 0.23517, top1: 0.78281, throughput: 1314.96 | 2022-04-11 00:45:46.335 [rank:7] [train], epoch: 47/50, iter: 200/834, loss: 0.23387, top1: 0.78406, throughput: 1314.94 | 2022-04-11 00:45:46.336 [rank:4] [train], epoch: 47/50, iter: 300/834, loss: 0.23481, top1: 0.78219, throughput: 1315.56 | 2022-04-11 00:46:00.925 [rank:5] [train], epoch: 47/50, iter: 300/834, loss: 0.23369, top1: 0.78573, throughput: 1315.61 | 2022-04-11 00:46:00.925 [rank:0] [train], epoch: 47/50, iter: 300/834, loss: 0.23466, top1: 0.77797, throughput: 1315.73 | 2022-04-11 00:46:00.925 [rank:2] [train], epoch: 47/50, iter: 300/834, loss: 0.23406, top1: 0.78453, throughput: 1315.62 | 2022-04-11 00:46:00.926 [rank:1] [train], epoch: 47/50, iter: 300/834, loss: 0.23335, top1: 0.78536, throughput: 1315.50 | 2022-04-11 00:46:00.928 [rank:7] [train], epoch: 47/50, iter: 300/834, loss: 0.23609, top1: 0.78005, throughput: 1315.78 | 2022-04-11 00:46:00.928 [rank:6] [train], epoch: 47/50, iter: 300/834, loss: 0.23514, top1: 0.78391, throughput: 1315.34 | 2022-04-11 00:46:00.928 [rank:3] [train], epoch: 47/50, iter: 300/834, loss: 0.23323, top1: 0.78911, throughput: 1315.52 | 2022-04-11 00:46:00.930 [rank:6] [train], epoch: 47/50, iter: 400/834, loss: 0.23545, top1: 0.78276, throughput: 1315.36 | 2022-04-11 00:46:15.524 [rank:4] [train], epoch: 47/50, iter: 400/834, loss: 0.23446, top1: 0.78385, throughput: 1315.07 | 2022-04-11 00:46:15.525 [rank:2] [train], epoch: 47/50, iter: 400/834, loss: 0.23492, top1: 0.78286, throughput: 1314.86 | 2022-04-11 00:46:15.528 [rank:7] [train], epoch: 47/50, iter: 400/834, loss: 0.23461, top1: 0.78505, throughput: 1315.22 | 2022-04-11 00:46:15.526 [rank:3] [train], epoch: 47/50, iter: 400/834, loss: 0.23393, top1: 0.78156, throughput: 1315.28 | 2022-04-11 00:46:15.528 [rank:1] [train], epoch: 47/50, iter: 400/834, loss: 0.23590, top1: 0.78234, throughput: 1315.12 | 2022-04-11 00:46:15.527 [rank:5] [train], epoch: 47/50, iter: 400/834, loss: 0.23460, top1: 0.78115, throughput: 1314.87 | 2022-04-11 00:46:15.527 [rank:0] [train], epoch: 47/50, iter: 400/834, loss: 0.23288, top1: 0.78719, throughput: 1314.97 | 2022-04-11 00:46:15.526 [rank:5] [train], epoch: 47/50, iter: 500/834, loss: 0.23334, top1: 0.78656, throughput: 1315.45 | 2022-04-11 00:46:30.123 [rank:4] [train], epoch: 47/50, iter: 500/834, loss: 0.23340, top1: 0.78557, throughput: 1315.24 | 2022-04-11 00:46:30.123 [rank:1] [train], epoch: 47/50, iter: 500/834, loss: 0.23308, top1: 0.78552, throughput: 1315.26 | 2022-04-11 00:46:30.125 [rank:6] [train], epoch: 47/50, iter: 500/834, loss: 0.23534, top1: 0.78255, throughput: 1315.11 | 2022-04-11 00:46:30.124 [rank:3] [train], epoch: 47/50, iter: 500/834, loss: 0.23479, top1: 0.78448, throughput: 1315.29 | 2022-04-11 00:46:30.125 [rank:0] [train], epoch: 47/50, iter: 500/834, loss: 0.23603, top1: 0.77911, throughput: 1315.26 | 2022-04-11 00:46:30.124 [rank:2] [train], epoch: 47/50, iter: 500/834, loss: 0.23329, top1: 0.78609, throughput: 1315.26 | 2022-04-11 00:46:30.126 [rank:7] [train], epoch: 47/50, iter: 500/834, loss: 0.23382, top1: 0.78505, throughput: 1314.88 | 2022-04-11 00:46:30.128 [rank:5] [train], epoch: 47/50, iter: 600/834, loss: 0.23652, top1: 0.77786, throughput: 1317.16 | 2022-04-11 00:46:44.699 [rank:2] [train], epoch: 47/50, iter: 600/834, loss: 0.23651, top1: 0.77943, throughput: 1317.47 | 2022-04-11 00:46:44.699 [rank:6] [train], epoch: 47/50, iter: 600/834, loss: 0.23379, top1: 0.78333, throughput: 1317.44 | 2022-04-11 00:46:44.698 [rank:4] [train], epoch: 47/50, iter: 600/834, loss: 0.23323, top1: 0.78380, throughput: 1317.25 | 2022-04-11 00:46:44.699 [rank:1] [train], epoch: 47/50, iter: 600/834, loss: 0.23398, top1: 0.78297, throughput: 1317.31 | 2022-04-11 00:46:44.700 [rank:3] [train], epoch: 47/50, iter: 600/834, loss: 0.23393, top1: 0.78490, throughput: 1317.45 | 2022-04-11 00:46:44.699 [rank:7] [train], epoch: 47/50, iter: 600/834, loss: 0.23461, top1: 0.78240, throughput: 1317.62 | 2022-04-11 00:46:44.700 [rank:0] [train], epoch: 47/50, iter: 600/834, loss: 0.23366, top1: 0.78391, throughput: 1317.26 | 2022-04-11 00:46:44.700 [rank:5] [train], epoch: 47/50, iter: 700/834, loss: 0.23419, top1: 0.78411, throughput: 1308.94 | 2022-04-11 00:46:59.368 [rank:3] [train], epoch: 47/50, iter: 700/834, loss: 0.23357, top1: 0.78474, throughput: 1308.71 | 2022-04-11 00:46:59.370 [rank:1] [train], epoch: 47/50, iter: 700/834, loss: 0.23598, top1: 0.78000, throughput: 1308.89 | 2022-04-11 00:46:59.369 [rank:7] [train], epoch: 47/50, iter: 700/834, loss: 0.23465, top1: 0.78094, throughput: 1308.86 | 2022-04-11 00:46:59.369 [rank:6] [train], epoch: 47/50, iter: 700/834, loss: 0.23368, top1: 0.78495, throughput: 1308.63 | 2022-04-11 00:46:59.369 [rank:2] [train], epoch: 47/50, iter: 700/834, loss: 0.23261, top1: 0.78667, throughput: 1308.78 | 2022-04-11 00:46:59.369 [rank:4] [train], epoch: 47/50, iter: 700/834, loss: 0.23382, top1: 0.78464, throughput: 1308.60 | 2022-04-11 00:46:59.371 [rank:0] [train], epoch: 47/50, iter: 700/834, loss: 0.23288, top1: 0.78521, throughput: 1308.77 | 2022-04-11 00:46:59.370 [rank:5] [train], epoch: 47/50, iter: 800/834, loss: 0.23388, top1: 0.78255, throughput: 1313.45 | 2022-04-11 00:47:13.986 [rank:4] [train], epoch: 47/50, iter: 800/834, loss: 0.23075, top1: 0.79151, throughput: 1313.70 | 2022-04-11 00:47:13.986 [rank:7] [train], epoch: 47/50, iter: 800/834, loss: 0.23702, top1: 0.77854, throughput: 1313.50 | 2022-04-11 00:47:13.986 [rank:2] [train], epoch: 47/50, iter: 800/834, loss: 0.23275, top1: 0.78635, throughput: 1313.29 | 2022-04-11 00:47:13.989 [rank:0] [train], epoch: 47/50, iter: 800/834, loss: 0.23570, top1: 0.77708, throughput: 1313.62 | 2022-04-11 00:47:13.986 [rank:1] [train], epoch: 47/50, iter: 800/834, loss: 0.23367, top1: 0.78474, throughput: 1313.24 | 2022-04-11 00:47:13.989 [rank:3] [train], epoch: 47/50, iter: 800/834, loss: 0.23414, top1: 0.78495, throughput: 1313.32 | 2022-04-11 00:47:13.989 [rank:6] [train], epoch: 47/50, iter: 800/834, loss: 0.23224, top1: 0.78771, throughput: 1313.42 | 2022-04-11 00:47:13.988 [rank:5] [train], epoch: 47/50, iter: 834/834, loss: 0.23159, top1: 0.78738, throughput: 1316.69 | 2022-04-11 00:47:18.944 [rank:4] [train], epoch: 47/50, iter: 834/834, loss: 0.23460, top1: 0.78064, throughput: 1316.59 | 2022-04-11 00:47:18.944 [rank:1] [train], epoch: 47/50, iter: 834/834, loss: 0.23467, top1: 0.78339, throughput: 1317.14 | 2022-04-11 00:47:18.945 [rank:6] [train], epoch: 47/50, iter: 834/834, loss: 0.23160, top1: 0.78539, throughput: 1316.70 | 2022-04-11 00:47:18.946 [rank:2] [train], epoch: 47/50, iter: 834/834, loss: 0.23075, top1: 0.79366, throughput: 1316.71 | 2022-04-11 00:47:18.947 [rank:7] [train], epoch: 47/50, iter: 834/834, loss: 0.23460, top1: 0.78232, throughput: 1315.93 | 2022-04-11 00:47:18.947 [rank:0] [train], epoch: 47/50, iter: 834/834, loss: 0.23919, top1: 0.77834, throughput: 1315.31 | 2022-04-11 00:47:18.949 [rank:3] [train], epoch: 47/50, iter: 834/834, loss: 0.23726, top1: 0.77390, throughput: 1315.90 | 2022-04-11 00:47:18.950 [rank:7] [eval], epoch: 47/50, iter: 125/125, loss: 0.00000, top1: 0.76688, throughput: 578.28 | 2022-04-11 00:47:29.755 [rank:0] [eval], epoch: 47/50, iter: 125/125, loss: 0.00000, top1: 0.76384, throughput: 575.75 | 2022-04-11 00:47:29.805 [rank:4] [eval], epoch: 47/50, iter: 125/125, loss: 0.00000, top1: 0.75536, throughput: 572.42 | 2022-04-11 00:47:29.863 [rank:3] [eval], epoch: 47/50, iter: 125/125, loss: 0.00000, top1: 0.76208, throughput: 569.69 | 2022-04-11 00:47:29.921 [rank:2] [eval], epoch: 47/50, iter: 125/125, loss: 0.00000, top1: 0.75440, throughput: 569.50 | 2022-04-11 00:47:29.921 [rank:6] [eval], epoch: 47/50, iter: 125/125, loss: 0.00000, top1: 0.76560, throughput: 568.86 | 2022-04-11 00:47:29.933 [rank:5] [eval], epoch: 47/50, iter: 125/125, loss: 0.00000, top1: 0.75264, throughput: 566.50 | 2022-04-11 00:47:29.976 [rank:1] [eval], epoch: 47/50, iter: 125/125, loss: 0.00000, top1: 0.76144, throughput: 560.00 | 2022-04-11 00:47:30.106 [rank:2] [train], epoch: 48/50, iter: 100/834, loss: 0.23429, top1: 0.78172, throughput: 1305.28 | 2022-04-11 00:47:44.631 [rank:1] [train], epoch: 48/50, iter: 100/834, loss: 0.23191, top1: 0.79068, throughput: 1321.66 | 2022-04-11 00:47:44.633 [rank:7] [train], epoch: 48/50, iter: 100/834, loss: 0.23169, top1: 0.78776, throughput: 1290.50 | 2022-04-11 00:47:44.633 [rank:6] [train], epoch: 48/50, iter: 100/834, loss: 0.23377, top1: 0.78729, throughput: 1305.94 | 2022-04-11 00:47:44.635 [rank:5] [train], epoch: 48/50, iter: 100/834, loss: 0.23258, top1: 0.78859, throughput: 1310.05 | 2022-04-11 00:47:44.632 [rank:0] [train], epoch: 48/50, iter: 100/834, loss: 0.23200, top1: 0.78771, throughput: 1294.77 | 2022-04-11 00:47:44.634 [rank:4] [train], epoch: 48/50, iter: 100/834, loss: 0.23292, top1: 0.78901, throughput: 1299.76 | 2022-04-11 00:47:44.635 [rank:3] [train], epoch: 48/50, iter: 100/834, loss: 0.23432, top1: 0.78609, throughput: 1304.90 | 2022-04-11 00:47:44.635 [rank:6] [train], epoch: 48/50, iter: 200/834, loss: 0.23392, top1: 0.78292, throughput: 1316.90 | 2022-04-11 00:47:59.214 [rank:2] [train], epoch: 48/50, iter: 200/834, loss: 0.23475, top1: 0.78062, throughput: 1316.53 | 2022-04-11 00:47:59.215 [rank:7] [train], epoch: 48/50, iter: 200/834, loss: 0.23472, top1: 0.78057, throughput: 1316.68 | 2022-04-11 00:47:59.215 [rank:5] [train], epoch: 48/50, iter: 200/834, loss: 0.23159, top1: 0.79198, throughput: 1316.53 | 2022-04-11 00:47:59.216 [rank:4] [train], epoch: 48/50, iter: 200/834, loss: 0.23024, top1: 0.79188, throughput: 1316.90 | 2022-04-11 00:47:59.214 [rank:3] [train], epoch: 48/50, iter: 200/834, loss: 0.23408, top1: 0.78484, throughput: 1316.76 | 2022-04-11 00:47:59.216 [rank:1] [train], epoch: 48/50, iter: 200/834, loss: 0.23269, top1: 0.78552, throughput: 1316.51 | 2022-04-11 00:47:59.217 [rank:0] [train], epoch: 48/50, iter: 200/834, loss: 0.23288, top1: 0.78849, throughput: 1316.66 | 2022-04-11 00:47:59.216 [rank:6] [train], epoch: 48/50, iter: 300/834, loss: 0.23278, top1: 0.78641, throughput: 1314.96[rank:5] [train], epoch: 48/50, iter: 300/834, loss: 0.23423, top1: 0.78552, throughput: 1315.06 | 2022-04-11 00:48:13.816 | 2022-04-11 00:48:13.816 [rank:1] [train], epoch: 48/50, iter: 300/834, loss: 0.23200, top1: 0.79109, throughput: 1315.08 | 2022-04-11 00:48:13.817 [rank:2] [train], epoch: 48/50, iter: 300/834, loss: 0.23354, top1: 0.78276, throughput: 1314.91[rank:7] [train], epoch: 48/50, iter: 300/834, loss: 0.23322, top1: 0.78906, throughput: 1314.90 | 2022-04-11 00:48:13.817 | 2022-04-11 00:48:13.817 [rank:4] [train], epoch: 48/50, iter: 300/834, loss: 0.23431, top1: 0.78260, throughput: 1314.78 | 2022-04-11 00:48:13.817 [rank:0] [train], epoch: 48/50, iter: 300/834, loss: 0.23379, top1: 0.78292, throughput: 1314.95 | 2022-04-11 00:48:13.817 [rank:3] [train], epoch: 48/50, iter: 300/834, loss: 0.23294, top1: 0.78312, throughput: 1314.84 | 2022-04-11 00:48:13.819 [rank:4] [train], epoch: 48/50, iter: 400/834, loss: 0.23209, top1: 0.78802, throughput: 1315.92 | 2022-04-11 00:48:28.408 [rank:5] [train], epoch: 48/50, iter: 400/834, loss: 0.23325, top1: 0.78365, throughput: 1315.80 | 2022-04-11 00:48:28.408 [rank:6] [train], epoch: 48/50, iter: 400/834, loss: 0.23160, top1: 0.78880, throughput: 1315.47 | 2022-04-11 00:48:28.411 [rank:3] [train], epoch: 48/50, iter: 400/834, loss: 0.23365, top1: 0.78328, throughput: 1315.69 | 2022-04-11 00:48:28.412 [rank:0] [train], epoch: 48/50, iter: 400/834, loss: 0.23408, top1: 0.78292, throughput: 1315.69 | 2022-04-11 00:48:28.410 [rank:1] [train], epoch: 48/50, iter: 400/834, loss: 0.23261, top1: 0.78927, throughput: 1315.44 | 2022-04-11 00:48:28.413 [rank:2] [train], epoch: 48/50, iter: 400/834, loss: 0.23478, top1: 0.78156, throughput: 1315.46 | 2022-04-11 00:48:28.412 [rank:7] [train], epoch: 48/50, iter: 400/834, loss: 0.23555, top1: 0.78234, throughput: 1315.51 | 2022-04-11 00:48:28.412 [rank:5] [train], epoch: 48/50, iter: 500/834, loss: 0.23355, top1: 0.78479, throughput: 1315.23 | 2022-04-11 00:48:43.006 [rank:2] [train], epoch: 48/50, iter: 500/834, loss: 0.23199, top1: 0.78885, throughput: 1315.37 | 2022-04-11 00:48:43.009 [rank:6] [train], epoch: 48/50, iter: 500/834, loss: 0.23093, top1: 0.79094, throughput: 1315.56 | 2022-04-11 00:48:43.006 [rank:1] [train], epoch: 48/50, iter: 500/834, loss: 0.23148, top1: 0.79047, throughput: 1315.54 | 2022-04-11 00:48:43.008 [rank:4] [train], epoch: 48/50, iter: 500/834, loss: 0.23135, top1: 0.79026, throughput: 1315.04 | 2022-04-11 00:48:43.008 [rank:3] [train], epoch: 48/50, iter: 500/834, loss: 0.23428, top1: 0.78328, throughput: 1315.29 | 2022-04-11 00:48:43.009 [rank:7] [train], epoch: 48/50, iter: 500/834, loss: 0.23249, top1: 0.78750, throughput: 1315.28 | 2022-04-11 00:48:43.010 [rank:0] [train], epoch: 48/50, iter: 500/834, loss: 0.23291, top1: 0.78536, throughput: 1315.06 | 2022-04-11 00:48:43.011 [rank:5] [train], epoch: 48/50, iter: 600/834, loss: 0.23335, top1: 0.78333, throughput: 1314.01 | 2022-04-11 00:48:57.618 [rank:6] [train], epoch: 48/50, iter: 600/834, loss: 0.23383, top1: 0.78620, throughput: 1313.96 | 2022-04-11 00:48:57.618 [rank:0] [train], epoch: 48/50, iter: 600/834, loss: 0.23397, top1: 0.78807, throughput: 1314.32 | 2022-04-11 00:48:57.619 [rank:7] [train], epoch: 48/50, iter: 600/834, loss: 0.23151, top1: 0.79443, throughput: 1314.20 | 2022-04-11 00:48:57.620 [rank:4] [train], epoch: 48/50, iter: 600/834, loss: 0.23434, top1: 0.78224, throughput: 1314.13 | 2022-04-11 00:48:57.619 [rank:1] [train], epoch: 48/50, iter: 600/834, loss: 0.23401, top1: 0.78542, throughput: 1314.00 | 2022-04-11 00:48:57.620 [rank:3] [train], epoch: 48/50, iter: 600/834, loss: 0.23310, top1: 0.78589, throughput: 1314.05 | 2022-04-11 00:48:57.620 [rank:2] [train], epoch: 48/50, iter: 600/834, loss: 0.23354, top1: 0.78677, throughput: 1313.99 | 2022-04-11 00:48:57.621 [rank:4] [train], epoch: 48/50, iter: 700/834, loss: 0.23164, top1: 0.78854, throughput: 1312.73 | 2022-04-11 00:49:12.245 [rank:5] [train], epoch: 48/50, iter: 700/834, loss: 0.23270, top1: 0.78542, throughput: 1312.68 | 2022-04-11 00:49:12.245 [rank:1] [train], epoch: 48/50, iter: 700/834, loss: 0.23361, top1: 0.78651, throughput: 1312.70 | 2022-04-11 00:49:12.246 [rank:3] [train], epoch: 48/50, iter: 700/834, loss: 0.23298, top1: 0.78578, throughput: 1312.63 | 2022-04-11 00:49:12.248 [rank:0] [train], epoch: 48/50, iter: 700/834, loss: 0.23198, top1: 0.78776, throughput: 1312.73 | 2022-04-11 00:49:12.245 [rank:7] [train], epoch: 48/50, iter: 700/834, loss: 0.23266, top1: 0.78510, throughput: 1312.67 | 2022-04-11 00:49:12.246 [rank:2] [train], epoch: 48/50, iter: 700/834, loss: 0.23479, top1: 0.78495, throughput: 1312.50 | 2022-04-11 00:49:12.249 [rank:6] [train], epoch: 48/50, iter: 700/834, loss: 0.23229, top1: 0.79208, throughput: 1312.43 | 2022-04-11 00:49:12.247 [rank:6] [train], epoch: 48/50, iter: 800/834, loss: 0.23101, top1: 0.79063, throughput: 1311.31 | 2022-04-11 00:49:26.889 [rank:2] [train], epoch: 48/50, iter: 800/834, loss: 0.23228, top1: 0.78620, throughput: 1311.36 | 2022-04-11 00:49:26.891 [rank:7] [train], epoch: 48/50, iter: 800/834, loss: 0.23059, top1: 0.79297, throughput: 1311.07 | 2022-04-11 00:49:26.891 [rank:5] [train], epoch: 48/50, iter: 800/834, loss: 0.23251, top1: 0.79094, throughput: 1310.86 | 2022-04-11 00:49:26.891 [rank:3] [train], epoch: 48/50, iter: 800/834, loss: 0.23355, top1: 0.78766, throughput: 1311.08 | 2022-04-11 00:49:26.892 [rank:4] [train], epoch: 48/50, iter: 800/834, loss: 0.23413, top1: 0.78807, throughput: 1310.87 | 2022-04-11 00:49:26.891 [rank:1] [train], epoch: 48/50, iter: 800/834, loss: 0.23312, top1: 0.78448, throughput: 1310.99 | 2022-04-11 00:49:26.892 [rank:0] [train], epoch: 48/50, iter: 800/834, loss: 0.23043, top1: 0.79099, throughput: 1310.96 | 2022-04-11 00:49:26.891 [rank:6] [train], epoch: 48/50, iter: 834/834, loss: 0.23253, top1: 0.78676, throughput: 1308.83 | 2022-04-11 00:49:31.877 [rank:5] [train], epoch: 48/50, iter: 834/834, loss: 0.23289, top1: 0.78646, throughput: 1309.26[rank:4] [train], epoch: 48/50, iter: 834/834, loss: 0.23595, top1: 0.78033, throughput: 1309.44 | 2022-04-11 00:49:31.877 | 2022-04-11 00:49:31.877 [rank:2] [train], epoch: 48/50, iter: 834/834, loss: 0.23479, top1: 0.78385, throughput: 1309.07 | 2022-04-11 00:49:31.877 [rank:0] [train], epoch: 48/50, iter: 834/834, loss: 0.23042, top1: 0.79488, throughput: 1308.94 | 2022-04-11 00:49:31.878 [rank:1] [train], epoch: 48/50, iter: 834/834, loss: 0.23233, top1: 0.78401, throughput: 1309.05 | 2022-04-11 00:49:31.878 [rank:3] [train], epoch: 48/50, iter: 834/834, loss: 0.23242, top1: 0.78646, throughput: 1309.08 | 2022-04-11 00:49:31.879 [rank:7] [train], epoch: 48/50, iter: 834/834, loss: 0.23455, top1: 0.78140, throughput: 1308.45 | 2022-04-11 00:49:31.880 [rank:0] [eval], epoch: 48/50, iter: 125/125, loss: 0.00000, top1: 0.76480, throughput: 568.50 | 2022-04-11 00:49:42.872 [rank:7] [eval], epoch: 48/50, iter: 125/125, loss: 0.00000, top1: 0.76512, throughput: 568.55 | 2022-04-11 00:49:42.873 [rank:4] [eval], epoch: 48/50, iter: 125/125, loss: 0.00000, top1: 0.75904, throughput: 565.54 | 2022-04-11 00:49:42.928 [rank:5] [eval], epoch: 48/50, iter: 125/125, loss: 0.00000, top1: 0.75152, throughput: 562.14 | 2022-04-11 00:49:42.996 [rank:3] [eval], epoch: 48/50, iter: 125/125, loss: 0.00000, top1: 0.76032, throughput: 562.19 | 2022-04-11 00:49:42.996 [rank:2] [eval], epoch: 48/50, iter: 125/125, loss: 0.00000, top1: 0.75152, throughput: 561.67 | 2022-04-11 00:49:43.005 [rank:6] [eval], epoch: 48/50, iter: 125/125, loss: 0.00000, top1: 0.76560, throughput: 561.38 | 2022-04-11 00:49:43.010 [rank:1] [eval], epoch: 48/50, iter: 125/125, loss: 0.00000, top1: 0.76304, throughput: 552.27 | 2022-04-11 00:49:43.195 [rank:5] [train], epoch: 49/50, iter: 100/834, loss: 0.23212, top1: 0.78938, throughput: 1300.78 | 2022-04-11 00:49:57.756 [rank:4] [train], epoch: 49/50, iter: 100/834, loss: 0.23196, top1: 0.78891, throughput: 1294.79 | 2022-04-11 00:49:57.757 [rank:6] [train], epoch: 49/50, iter: 100/834, loss: 0.23315, top1: 0.78604, throughput: 1301.99 | 2022-04-11 00:49:57.757 [rank:2] [train], epoch: 49/50, iter: 100/834, loss: 0.23380, top1: 0.78573, throughput: 1301.44 | 2022-04-11 00:49:57.758 [rank:7] [train], epoch: 49/50, iter: 100/834, loss: 0.23026, top1: 0.79589, throughput: 1289.79 | 2022-04-11 00:49:57.759 [rank:1] [train], epoch: 49/50, iter: 100/834, loss: 0.23275, top1: 0.78745, throughput: 1318.36 | 2022-04-11 00:49:57.759 [rank:3] [train], epoch: 49/50, iter: 100/834, loss: 0.23366, top1: 0.78677, throughput: 1300.31 | 2022-04-11 00:49:57.762 [rank:0] [train], epoch: 49/50, iter: 100/834, loss: 0.23245, top1: 0.78714, throughput: 1289.73 | 2022-04-11 00:49:57.759 [rank:0] [train], epoch: 49/50, iter: 200/834, loss: 0.23185, top1: 0.78792, throughput: 1312.87 | 2022-04-11 00:50:12.383 [rank:6] [train], epoch: 49/50, iter: 200/834, loss: 0.23222, top1: 0.78823, throughput: 1312.71 | 2022-04-11 00:50:12.383 [rank:1] [train], epoch: 49/50, iter: 200/834, loss: 0.23351, top1: 0.78688, throughput: 1312.78 | 2022-04-11 00:50:12.384 [rank:5] [train], epoch: 49/50, iter: 200/834, loss: 0.23128, top1: 0.79083, throughput: 1312.78 | 2022-04-11 00:50:12.382 [rank:7] [train], epoch: 49/50, iter: 200/834, loss: 0.23155, top1: 0.79182, throughput: 1312.89 | 2022-04-11 00:50:12.383 [rank:4] [train], epoch: 49/50, iter: 200/834, loss: 0.23016, top1: 0.79417, throughput: 1312.63 | 2022-04-11 00:50:12.384 [rank:3] [train], epoch: 49/50, iter: 200/834, loss: 0.23172, top1: 0.78917, throughput: 1312.98 | 2022-04-11 00:50:12.385 [rank:2] [train], epoch: 49/50, iter: 200/834, loss: 0.23401, top1: 0.78760, throughput: 1312.74 | 2022-04-11 00:50:12.384 [rank:4] [train], epoch: 49/50, iter: 300/834, loss: 0.23146, top1: 0.79146, throughput: 1304.45 | 2022-04-11 00:50:27.103 [rank:0] [train], epoch: 49/50, iter: 300/834, loss: 0.23286, top1: 0.78583, throughput: 1304.41 | 2022-04-11 00:50:27.102 [rank:5] [train], epoch: 49/50, iter: 300/834, loss: 0.23123, top1: 0.79135, throughput: 1304.29 | 2022-04-11 00:50:27.102 [rank:1] [train], epoch: 49/50, iter: 300/834, loss: 0.23358, top1: 0.78333, throughput: 1304.41 | 2022-04-11 00:50:27.104 [rank:3] [train], epoch: 49/50, iter: 300/834, loss: 0.23051, top1: 0.79214, throughput: 1304.44 | 2022-04-11 00:50:27.104 [rank:7] [train], epoch: 49/50, iter: 300/834, loss: 0.23131, top1: 0.79031, throughput: 1304.30 | 2022-04-11 00:50:27.104 [rank:2] [train], epoch: 49/50, iter: 300/834, loss: 0.23222, top1: 0.78755, throughput: 1304.34 | 2022-04-11 00:50:27.104 [rank:6] [train], epoch: 49/50, iter: 300/834, loss: 0.23444, top1: 0.78094, throughput: 1304.40 | 2022-04-11 00:50:27.103 [rank:6] [train], epoch: 49/50, iter: 400/834, loss: 0.23276, top1: 0.78943, throughput: 1315.58 | 2022-04-11 00:50:41.697 [rank:4] [train], epoch: 49/50, iter: 400/834, loss: 0.23208, top1: 0.78979, throughput: 1315.59 | 2022-04-11 00:50:41.697 [rank:5] [train], epoch: 49/50, iter: 400/834, loss: 0.23086, top1: 0.78984, throughput: 1315.45 | 2022-04-11 00:50:41.698 [rank:1] [train], epoch: 49/50, iter: 400/834, loss: 0.23321, top1: 0.78547, throughput: 1315.54 | 2022-04-11 00:50:41.698 [rank:0] [train], epoch: 49/50, iter: 400/834, loss: 0.23104, top1: 0.78901, throughput: 1315.35 | 2022-04-11 00:50:41.699 [rank:2] [train], epoch: 49/50, iter: 400/834, loss: 0.23204, top1: 0.78974, throughput: 1315.54 | 2022-04-11 00:50:41.699 [rank:3] [train], epoch: 49/50, iter: 400/834, loss: 0.23313, top1: 0.78490, throughput: 1315.51 | 2022-04-11 00:50:41.699 [rank:7] [train], epoch: 49/50, iter: 400/834, loss: 0.23285, top1: 0.78625, throughput: 1315.36 | 2022-04-11 00:50:41.701 [rank:2] [train], epoch: 49/50, iter: 500/834, loss: 0.23251, top1: 0.78437, throughput: 1309.33 | 2022-04-11 00:50:56.363 [rank:4] [train], epoch: 49/50, iter: 500/834, loss: 0.23397, top1: 0.78177, throughput: 1309.30 | 2022-04-11 00:50:56.361 [rank:5] [train], epoch: 49/50, iter: 500/834, loss: 0.23287, top1: 0.78802, throughput: 1309.39 | 2022-04-11 00:50:56.361 [rank:0] [train], epoch: 49/50, iter: 500/834, loss: 0.23080, top1: 0.79193, throughput: 1309.38 | 2022-04-11 00:50:56.363 [rank:6] [train], epoch: 49/50, iter: 500/834, loss: 0.23326, top1: 0.78583, throughput: 1309.14 | 2022-04-11 00:50:56.363 [rank:7] [train], epoch: 49/50, iter: 500/834, loss: 0.23314, top1: 0.78578, throughput: 1309.48 | 2022-04-11 00:50:56.363 [rank:1] [train], epoch: 49/50, iter: 500/834, loss: 0.23223, top1: 0.78776, throughput: 1309.20 | 2022-04-11 00:50:56.364 [rank:3] [train], epoch: 49/50, iter: 500/834, loss: 0.23020, top1: 0.79323, throughput: 1309.01 | 2022-04-11 00:50:56.366 [rank:6] [train], epoch: 49/50, iter: 600/834, loss: 0.23233, top1: 0.78807, throughput: 1315.32 | 2022-04-11 00:51:10.960 [rank:5] [train], epoch: 49/50, iter: 600/834, loss: 0.23407, top1: 0.78495, throughput: 1314.97 | 2022-04-11 00:51:10.962 [rank:2] [train], epoch: 49/50, iter: 600/834, loss: 0.23203, top1: 0.78766, throughput: 1315.09 | 2022-04-11 00:51:10.962 [rank:3] [train], epoch: 49/50, iter: 600/834, loss: 0.23202, top1: 0.78594, throughput: 1315.39 | 2022-04-11 00:51:10.963 [rank:4] [train], epoch: 49/50, iter: 600/834, loss: 0.23482, top1: 0.78193, throughput: 1314.96[rank:7] [train], epoch: 49/50, iter: 600/834, loss: 0.23304, top1: 0.78854, throughput: 1314.99 | 2022-04-11 00:51:10.964 | 2022-04-11 00:51:10.963 [rank:1] [train], epoch: 49/50, iter: 600/834, loss: 0.23245, top1: 0.78786, throughput: 1315.11 | 2022-04-11 00:51:10.963 [rank:0] [train], epoch: 49/50, iter: 600/834, loss: 0.23147, top1: 0.78776, throughput: 1314.66 | 2022-04-11 00:51:10.967 [rank:6] [train], epoch: 49/50, iter: 700/834, loss: 0.23237, top1: 0.78708, throughput: 1314.92 | 2022-04-11 00:51:25.562 [rank:2] [train], epoch: 49/50, iter: 700/834, loss: 0.23081, top1: 0.79167, throughput: 1314.97 | 2022-04-11 00:51:25.564 [rank:5] [train], epoch: 49/50, iter: 700/834, loss: 0.23211, top1: 0.78958, throughput: 1315.14 | 2022-04-11 00:51:25.562 [rank:3] [train], epoch: 49/50, iter: 700/834, loss: 0.23215, top1: 0.78818, throughput: 1314.73 | 2022-04-11 00:51:25.567 [rank:0] [train], epoch: 49/50, iter: 700/834, loss: 0.23206, top1: 0.78844, throughput: 1315.51 | 2022-04-11 00:51:25.562 [rank:1] [train], epoch: 49/50, iter: 700/834, loss: 0.23393, top1: 0.78469, throughput: 1314.81 | 2022-04-11 00:51:25.566 [rank:4] [train], epoch: 49/50, iter: 700/834, loss: 0.23189, top1: 0.78995, throughput: 1314.89 | 2022-04-11 00:51:25.565 [rank:7] [train], epoch: 49/50, iter: 700/834, loss: 0.23350, top1: 0.78672, throughput: 1314.71 | 2022-04-11 00:51:25.568 [rank:6] [train], epoch: 49/50, iter: 800/834, loss: 0.23224, top1: 0.78682, throughput: 1312.79 | 2022-04-11 00:51:40.187 [rank:5] [train], epoch: 49/50, iter: 800/834, loss: 0.23221, top1: 0.79047, throughput: 1312.76 | 2022-04-11 00:51:40.187 [rank:4] [train], epoch: 49/50, iter: 800/834, loss: 0.23215, top1: 0.78932, throughput: 1313.00 | 2022-04-11 00:51:40.188 [rank:2] [train], epoch: 49/50, iter: 800/834, loss: 0.23266, top1: 0.78391, throughput: 1312.88 | 2022-04-11 00:51:40.188 [rank:7] [train], epoch: 49/50, iter: 800/834, loss: 0.23314, top1: 0.78135, throughput: 1313.22 | 2022-04-11 00:51:40.188 [rank:3] [train], epoch: 49/50, iter: 800/834, loss: 0.23338, top1: 0.78750, throughput: 1312.99 | 2022-04-11 00:51:40.190 [rank:1] [train], epoch: 49/50, iter: 800/834, loss: 0.23164, top1: 0.78943, throughput: 1312.88 | 2022-04-11 00:51:40.190 [rank:0] [train], epoch: 49/50, iter: 800/834, loss: 0.23163, top1: 0.79083, throughput: 1312.66 | 2022-04-11 00:51:40.189 [rank:5] [train], epoch: 49/50, iter: 834/834, loss: 0.23188, top1: 0.78876, throughput: 1313.58 | 2022-04-11 00:51:45.157 [rank:2] [train], epoch: 49/50, iter: 834/834, loss: 0.23251, top1: 0.78631, throughput: 1313.42 | 2022-04-11 00:51:45.158 [rank:1] [train], epoch: 49/50, iter: 834/834, loss: 0.23103, top1: 0.78937, throughput: 1313.95 | 2022-04-11 00:51:45.159 [rank:0] [train], epoch: 49/50, iter: 834/834, loss: 0.23268, top1: 0.78983, throughput: 1313.76 | 2022-04-11 00:51:45.158 [rank:6] [train], epoch: 49/50, iter: 834/834, loss: 0.22690, top1: 0.80101, throughput: 1312.98 | 2022-04-11 00:51:45.159 [rank:4] [train], epoch: 49/50, iter: 834/834, loss: 0.23251, top1: 0.79013, throughput: 1313.01 | 2022-04-11 00:51:45.159 [rank:3] [train], epoch: 49/50, iter: 834/834, loss: 0.23413, top1: 0.78799, throughput: 1313.38 | 2022-04-11 00:51:45.160 [rank:7] [train], epoch: 49/50, iter: 834/834, loss: 0.23179, top1: 0.78661, throughput: 1312.94 | 2022-04-11 00:51:45.160 [rank:0] [eval], epoch: 49/50, iter: 125/125, loss: 0.00000, top1: 0.76304, throughput: 572.40 | 2022-04-11 00:51:56.077 [rank:7] [eval], epoch: 49/50, iter: 125/125, loss: 0.00000, top1: 0.76576, throughput: 570.73 | 2022-04-11 00:51:56.111 [rank:4] [eval], epoch: 49/50, iter: 125/125, loss: 0.00000, top1: 0.75808, throughput: 568.20 | 2022-04-11 00:51:56.159 [rank:2] [eval], epoch: 49/50, iter: 125/125, loss: 0.00000, top1: 0.75088, throughput: 566.26 | 2022-04-11 00:51:56.196 [rank:3] [eval], epoch: 49/50, iter: 125/125, loss: 0.00000, top1: 0.76032, throughput: 564.91 | 2022-04-11 00:51:56.224 [rank:6] [eval], epoch: 49/50, iter: 125/125, loss: 0.00000, top1: 0.76528, throughput: 564.33 | 2022-04-11 00:51:56.234 [rank:5] [eval], epoch: 49/50, iter: 125/125, loss: 0.00000, top1: 0.75024, throughput: 559.28 | 2022-04-11 00:51:56.332 [rank:1] [eval], epoch: 49/50, iter: 125/125, loss: 0.00000, top1: 0.76368, throughput: 553.63 | 2022-04-11 00:51:56.448