lucataco / ssd-lora-training

POC to train SSD-1B LoRAs for cheaper & faster training

Cold

Public
262 runs
L40S
GitHub

Iterate in playground

Run with an API

Playground API Examples README Versions

Input

input_images

*file

A .zip or .tar file containing the image files that will be used for fine-tuning

seed

integer

Random seed for reproducible training. Leave empty to use a random seed

resolution

integer

Square pixel resolution which your images will be resized to for training

Default: 768

train_batch_size

integer

Batch size (per device) for training

Default: 4

num_train_epochs

integer

Number of epochs to loop through your training dataset

Default: 2000

max_train_steps

integer

Number of individual training steps. Takes precedence over num_train_epochs

Default: 500

is_lora

boolean

Whether to use LoRA training. If set to False, will use Full fine tuning

Default: true

unet_learning_rate

number

Learning rate for the U-Net. We recommend this value to be somewhere between `1e-6` to `1e-5`.

Default: 0.000001

ti_lr

number

Scaling of learning rate for training textual inversion embeddings. Don't alter unless you know what you're doing.

Default: 0.0003

lora_lr

number

Scaling of learning rate for training LoRA embeddings. Don't alter unless you know what you're doing.

Default: 0.0001

lora_rank

integer

Rank of LoRA embeddings. Don't alter unless you know what you're doing.

Default: 32

lr_scheduler

string

Learning rate scheduler to use for training

Default: "constant"

lr_warmup_steps

integer

Number of warmup steps for lr schedulers with warmups.

Default: 100

token_string

string

Shift + Return to add a new line

A unique string that will be trained to refer to the concept in the input images. Can be anything, but TOK works well

Default: "TOK"

caption_prefix

string

Shift + Return to add a new line

Text which will be used as prefix during automatic captioning. Must contain the `token_string`. For example, if caption text is 'a photo of TOK', automatic captioning will expand to 'a photo of TOK under a bridge', 'a photo of TOK holding a cup', etc.

Default: "a photo of TOK, "

mask_target_prompts

string

Shift + Return to add a new line

Prompt that describes part of the image that you will find important. For example, if you are fine-tuning your pet, `photo of a dog` will be a good prompt. Prompt-based masking is used to focus the fine-tuning process on the important/salient parts of the image

crop_based_on_salience

boolean

If you want to crop the image to `target_size` based on the important parts of the image, set this to True. If you want to crop the image based on face detection, set this to False

Default: true

use_face_detection_instead

boolean

If you want to use face detection instead of CLIPSeg for masking. For face applications, we recommend using this option.

Default: false

clipseg_temperature

number

How blurry you want the CLIPSeg mask to be. We recommend this value be something between `0.5` to `1.0`. If you want to have more sharp mask (but thus more errorful), you can decrease this value.

Default: 1

verbose

boolean

verbose output

Default: true

checkpointing_steps

integer

Number of steps between saving checkpoints. Set to very very high number to disable checkpointing, because you don't need one.

Default: 999999

input_images_filetype

string

Filetype of the input images. Can be either `zip` or `tar`. By default its `infer`, and it will be inferred from the ext of input file.

Default: "infer"

Run this model in Node.js with one line of code:

npx create-replicate --model=lucataco/ssd-lora-training

or set up a project from scratch

Install Replicate’s Node.js client library:

npm install replicate

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Import and set up the client:

import Replicate from "replicate";
import fs from "node:fs";

const replicate = new Replicate({
  auth: process.env.REPLICATE_API_TOKEN,
});

Run lucataco/ssd-lora-training using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

const output = await replicate.run(
  "lucataco/ssd-lora-training:28d7173225f9a3320ad19bbf78e37bd23eb0681362aba460a191bcf25f5a7afc",
  {
    input: {
      ti_lr: 0.0003,
      is_lora: true,
      lora_lr: 0.0001,
      verbose: true,
      lora_rank: 32,
      resolution: 768,
      input_images: "https://replicate.delivery/pbxt/JosiwEGYXZHg0wXzMHcHJPTUMwoN4BsRIB9TOII73u8vbEqr/zeke.zip",
      lr_scheduler: "constant",
      token_string: "TOK",
      caption_prefix: "a photo of TOK, ",
      lr_warmup_steps: 100,
      max_train_steps: 500,
      num_train_epochs: 2000,
      train_batch_size: 4,
      unet_learning_rate: 0.000001,
      checkpointing_steps: 999999,
      clipseg_temperature: 1,
      input_images_filetype: "infer",
      crop_based_on_salience: true,
      use_face_detection_instead: true
    }
  }
);

// To access the file URL:
console.log(output.url()); //=> "http://example.com"

// To write the file to disk:
fs.writeFile("my-image.png", output);

To learn more, take a look at the guide on getting started with Node.js.

Install Replicate’s Python client library:

pip install replicate

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Import the client:

import replicate

Run lucataco/ssd-lora-training using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

output = replicate.run(
    "lucataco/ssd-lora-training:28d7173225f9a3320ad19bbf78e37bd23eb0681362aba460a191bcf25f5a7afc",
    input={
        "ti_lr": 0.0003,
        "is_lora": True,
        "lora_lr": 0.0001,
        "verbose": True,
        "lora_rank": 32,
        "resolution": 768,
        "input_images": "https://replicate.delivery/pbxt/JosiwEGYXZHg0wXzMHcHJPTUMwoN4BsRIB9TOII73u8vbEqr/zeke.zip",
        "lr_scheduler": "constant",
        "token_string": "TOK",
        "caption_prefix": "a photo of TOK, ",
        "lr_warmup_steps": 100,
        "max_train_steps": 500,
        "num_train_epochs": 2000,
        "train_batch_size": 4,
        "unet_learning_rate": 0.000001,
        "checkpointing_steps": 999999,
        "clipseg_temperature": 1,
        "input_images_filetype": "infer",
        "crop_based_on_salience": True,
        "use_face_detection_instead": True
    }
)
print(output)

To learn more, take a look at the guide on getting started with Python.

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Run lucataco/ssd-lora-training using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

curl -s -X POST \
  -H "Authorization: Bearer $REPLICATE_API_TOKEN" \
  -H "Content-Type: application/json" \
  -H "Prefer: wait" \
  -d $'{
    "version": "lucataco/ssd-lora-training:28d7173225f9a3320ad19bbf78e37bd23eb0681362aba460a191bcf25f5a7afc",
    "input": {
      "ti_lr": 0.0003,
      "is_lora": true,
      "lora_lr": 0.0001,
      "verbose": true,
      "lora_rank": 32,
      "resolution": 768,
      "input_images": "https://replicate.delivery/pbxt/JosiwEGYXZHg0wXzMHcHJPTUMwoN4BsRIB9TOII73u8vbEqr/zeke.zip",
      "lr_scheduler": "constant",
      "token_string": "TOK",
      "caption_prefix": "a photo of TOK, ",
      "lr_warmup_steps": 100,
      "max_train_steps": 500,
      "num_train_epochs": 2000,
      "train_batch_size": 4,
      "unet_learning_rate": 0.000001,
      "checkpointing_steps": 999999,
      "clipseg_temperature": 1,
      "input_images_filetype": "infer",
      "crop_based_on_salience": true,
      "use_face_detection_instead": true
    }
  }' \
  https://api.replicate.com/v1/predictions

To learn more, take a look at Replicate’s HTTP API reference docs.

Output

trained_model.tar

{
  "completed_at": "2023-11-08T19:58:24.587148Z",
  "created_at": "2023-11-08T19:56:01.242342Z",
  "data_removed": false,
  "error": null,
  "id": "sfs2lolbezbisgcl2pqrfvxjbe",
  "input": {
    "ti_lr": 0.0003,
    "is_lora": true,
    "lora_lr": 0.0001,
    "verbose": true,
    "lora_rank": 32,
    "resolution": 768,
    "input_images": "https://replicate.delivery/pbxt/JosiwEGYXZHg0wXzMHcHJPTUMwoN4BsRIB9TOII73u8vbEqr/zeke.zip",
    "lr_scheduler": "constant",
    "token_string": "TOK",
    "caption_prefix": "a photo of TOK, ",
    "lr_warmup_steps": 100,
    "max_train_steps": 500,
    "num_train_epochs": 2000,
    "train_batch_size": 4,
    "unet_learning_rate": 0.000001,
    "checkpointing_steps": 999999,
    "clipseg_temperature": 1,
    "input_images_filetype": "infer",
    "crop_based_on_salience": true,
    "use_face_detection_instead": true
  },
  "logs": "Image files:  ['./temp_in/zeke1.jpg', './temp_in/zeke2.jpg', './temp_in/zeke3.jpg', './temp_in/zeke4.jpg', './temp_in/zeke5.jpg', './temp_in/zeke6.jpg']\nGenerating 6 captions...\nInput captioning text: a photo of TOK,\nTOK\nhe looks to be taking a selfie while sitting in the middle of the floor\n  0%|          | 0/6 [00:00<?, ?it/s]\nTOK\ncurly man in glasses smiling in front of a beach\n 17%|█▋        | 1/6 [00:00<00:01,  4.23it/s]\nTOK\nlacy woman wearing glasses in front of a man with curly hair\n 33%|███▎      | 2/6 [00:00<00:00,  5.18it/s]\nTOK\nshaggy haired man in glasses looking up at something with a white wall behind him\n 50%|█████     | 3/6 [00:00<00:00,  5.30it/s]\nTOK\ncurly man wearing glasses, white shirt and tie, stares forward\n 67%|██████▋   | 4/6 [00:00<00:00,  4.98it/s]\nTOK\nthey are looking at the camera to see the man with glasses\nGenerated captions ['a photo of TOK, he looks to be taking a selfie while sitting in the middle of the floor', 'a photo of TOK, curly man in glasses smiling in front of a beach', 'a photo of TOK, lacy woman wearing glasses in front of a man with curly hair', 'a photo of TOK, shaggy haired man in glasses looking up at something with a white wall behind him', 'a photo of TOK, curly man wearing glasses, white shirt and tie, stares forward', 'a photo of TOK, they are looking at the camera to see the man with glasses']\n 83%|████████▎ | 5/6 [00:00<00:00,  5.14it/s]\n100%|██████████| 6/6 [00:01<00:00,  5.24it/s]\n100%|██████████| 6/6 [00:01<00:00,  5.13it/s]\nGenerating 6 masks...\n(175, 123, 247, 247)\n(72, 115, 329, 329)\n(290, 130, 170, 170)\n(115, 141, 145, 145)\n(82, 166, 270, 270)\n(125, 145, 288, 288)\n  0%|          | 0/6 [00:00<?, ?it/s]\n100%|██████████| 6/6 [00:00<00:00, 53.65it/s]\n100%|██████████| 6/6 [00:00<00:00, 53.57it/s]\nUpscaling 6 images...\n  0%|          | 0/6 [00:00<?, ?it/s]\n 17%|█▋        | 1/6 [00:01<00:08,  1.67s/it]\n 33%|███▎      | 2/6 [00:03<00:06,  1.64s/it]\n 50%|█████     | 3/6 [00:04<00:04,  1.64s/it]\n 67%|██████▋   | 4/6 [00:06<00:03,  1.63s/it]\n 83%|████████▎ | 5/6 [00:08<00:01,  1.63s/it]\n100%|██████████| 6/6 [00:09<00:00,  1.63s/it]\n100%|██████████| 6/6 [00:09<00:00,  1.64s/it]\nUsing seed 2012688753\nYou are using a model of type clip_text_model to instantiate a model of type . This is not supported for all configurations of models and can yield errors.\nYou are using a model of type clip_text_model to instantiate a model of type . This is not supported for all configurations of models and can yield errors.\n# PTI : Loaded models\n0 text encodedr's std_token_embedding: 0.01531982421875\ntorch.Size([49410])\n1 text encodedr's std_token_embedding: 0.014404296875\ntorch.Size([49410])\ntext_model.embeddings.token_embedding.weight\ntext_model.embeddings.token_embedding.weight\n# PTI : Loading dataset, do_cache True\nCaptions to train on:\na photo of <s0><s1>, he looks to be taking a selfie while sitting in the middle of the floor\na photo of <s0><s1>, curly man in glasses smiling in front of a beach\na photo of <s0><s1>, lacy woman wearing glasses in front of a man with curly hair\na photo of <s0><s1>, shaggy haired man in glasses looking up at something with a white wall behind him\na photo of <s0><s1>, curly man wearing glasses, white shirt and tie, stares forward\na photo of <s0><s1>, they are looking at the camera to see the man with glasses\n# PTI : Loaded dataset\n# PTI :  Running training\n# PTI :  Num examples = 6\n# PTI :  Num batches each epoch = 2\n# PTI :  Num Epochs = 250\n# PTI :  Instantaneous batch size per device = 4\nTotal train batch size (w. parallel, distributed & accumulation) = 4\n# PTI :  Gradient Accumulation steps = 1\n# PTI :  Total optimization steps = 500\n  0%|          | 0/500 [00:00<?, ?it/s]\n# PTI :step: 0, epoch: 0:   0%|          | 1/500 [00:00<00:04, 117.24it/s]/root/.pyenv/versions/3.11.6/lib/python3.11/site-packages/diffusers/models/attention_processor.py:1815: FutureWarning: `LoRAAttnProcessor2_0` is deprecated and will be removed in version 0.26.0. Make sure use AttnProcessor2_0 instead by settingLoRA layers to `self.{to_q,to_k,to_v,to_out[0]}.lora_layer` respectively. This will be done automatically when using `LoraLoaderMixin.load_lora_weights`\ndeprecate(\n# PTI :step: 0, epoch: 0:   0%|          | 2/500 [00:00<01:28,  5.63it/s]\n# PTI :step: 1, epoch: 0:   0%|          | 2/500 [00:00<01:28,  5.63it/s]\n# PTI :step: 1, epoch: 0:   1%|          | 3/500 [00:00<01:35,  5.19it/s]\n# PTI :step: 2, epoch: 1:   1%|          | 3/500 [00:00<01:35,  5.19it/s]\n# PTI :step: 2, epoch: 1:   1%|          | 4/500 [00:00<01:46,  4.65it/s]\n# PTI :step: 3, epoch: 1:   1%|          | 4/500 [00:00<01:46,  4.65it/s]\n# PTI :step: 3, epoch: 1:   1%|          | 5/500 [00:01<01:46,  4.66it/s]\n# PTI :step: 4, epoch: 2:   1%|          | 5/500 [00:01<01:46,  4.66it/s]\n# PTI :step: 4, epoch: 2:   1%|          | 6/500 [00:01<01:51,  4.41it/s]\n# PTI :step: 5, epoch: 2:   1%|          | 6/500 [00:01<01:51,  4.41it/s]\n# PTI :step: 5, epoch: 2:   1%|▏         | 7/500 [00:01<01:49,  4.49it/s]\n# PTI :step: 6, epoch: 3:   1%|▏         | 7/500 [00:01<01:49,  4.49it/s]\n# PTI :step: 6, epoch: 3:   2%|▏         | 8/500 [00:01<01:53,  4.33it/s]\n# PTI :step: 7, epoch: 3:   2%|▏         | 8/500 [00:01<01:53,  4.33it/s]\n# PTI :step: 7, epoch: 3:   2%|▏         | 9/500 [00:01<01:50,  4.43it/s]\n# PTI :step: 8, epoch: 4:   2%|▏         | 9/500 [00:01<01:50,  4.43it/s]\n# PTI :step: 8, epoch: 4:   2%|▏         | 10/500 [00:02<01:54,  4.29it/s]\n# PTI :step: 9, epoch: 4:   2%|▏         | 10/500 [00:02<01:54,  4.29it/s]\n# PTI :step: 9, epoch: 4:   2%|▏         | 11/500 [00:02<01:52,  4.33it/s]\n# PTI :step: 10, epoch: 5:   2%|▏         | 11/500 [00:02<01:52,  4.33it/s]\n# PTI :step: 10, epoch: 5:   2%|▏         | 12/500 [00:02<01:55,  4.21it/s]\n# PTI :step: 11, epoch: 5:   2%|▏         | 12/500 [00:02<01:55,  4.21it/s]\n# PTI :step: 11, epoch: 5:   3%|▎         | 13/500 [00:02<01:51,  4.35it/s]\n# PTI :step: 12, epoch: 6:   3%|▎         | 13/500 [00:02<01:51,  4.35it/s]\n# PTI :step: 12, epoch: 6:   3%|▎         | 14/500 [00:03<01:54,  4.25it/s]\n# PTI :step: 13, epoch: 6:   3%|▎         | 14/500 [00:03<01:54,  4.25it/s]\n# PTI :step: 13, epoch: 6:   3%|▎         | 15/500 [00:03<01:50,  4.37it/s]\n# PTI :step: 14, epoch: 7:   3%|▎         | 15/500 [00:03<01:50,  4.37it/s]\n# PTI :step: 14, epoch: 7:   3%|▎         | 16/500 [00:03<01:53,  4.26it/s]\n# PTI :step: 15, epoch: 7:   3%|▎         | 16/500 [00:03<01:53,  4.26it/s]\n# PTI :step: 15, epoch: 7:   3%|▎         | 17/500 [00:03<01:50,  4.39it/s]\n# PTI :step: 16, epoch: 8:   3%|▎         | 17/500 [00:03<01:50,  4.39it/s]\n# PTI :step: 16, epoch: 8:   4%|▎         | 18/500 [00:04<01:52,  4.27it/s]\n# PTI :step: 17, epoch: 8:   4%|▎         | 18/500 [00:04<01:52,  4.27it/s]\n# PTI :step: 17, epoch: 8:   4%|▍         | 19/500 [00:04<01:50,  4.37it/s]\n# PTI :step: 18, epoch: 9:   4%|▍         | 19/500 [00:04<01:50,  4.37it/s]\n# PTI :step: 18, epoch: 9:   4%|▍         | 20/500 [00:04<01:52,  4.25it/s]\n# PTI :step: 19, epoch: 9:   4%|▍         | 20/500 [00:04<01:52,  4.25it/s]\n# PTI :step: 19, epoch: 9:   4%|▍         | 21/500 [00:04<01:50,  4.32it/s]\n# PTI :step: 20, epoch: 10:   4%|▍         | 21/500 [00:04<01:50,  4.32it/s]\n# PTI :step: 20, epoch: 10:   4%|▍         | 22/500 [00:05<01:53,  4.22it/s]\n# PTI :step: 21, epoch: 10:   4%|▍         | 22/500 [00:05<01:53,  4.22it/s]\n# PTI :step: 21, epoch: 10:   5%|▍         | 23/500 [00:05<01:50,  4.33it/s]\n# PTI :step: 22, epoch: 11:   5%|▍         | 23/500 [00:05<01:50,  4.33it/s]\n# PTI :step: 22, epoch: 11:   5%|▍         | 24/500 [00:05<01:52,  4.23it/s]\n# PTI :step: 23, epoch: 11:   5%|▍         | 24/500 [00:05<01:52,  4.23it/s]\n# PTI :step: 23, epoch: 11:   5%|▌         | 25/500 [00:05<01:48,  4.36it/s]\n# PTI :step: 24, epoch: 12:   5%|▌         | 25/500 [00:05<01:48,  4.36it/s]\n# PTI :step: 24, epoch: 12:   5%|▌         | 26/500 [00:05<01:51,  4.25it/s]\n# PTI :step: 25, epoch: 12:   5%|▌         | 26/500 [00:05<01:51,  4.25it/s]\n# PTI :step: 25, epoch: 12:   5%|▌         | 27/500 [00:06<01:50,  4.29it/s]\n# PTI :step: 26, epoch: 13:   5%|▌         | 27/500 [00:06<01:50,  4.29it/s]\n# PTI :step: 26, epoch: 13:   6%|▌         | 28/500 [00:06<01:52,  4.20it/s]\n# PTI :step: 27, epoch: 13:   6%|▌         | 28/500 [00:06<01:52,  4.20it/s]\n# PTI :step: 27, epoch: 13:   6%|▌         | 29/500 [00:06<01:48,  4.35it/s]\n# PTI :step: 28, epoch: 14:   6%|▌         | 29/500 [00:06<01:48,  4.35it/s]\n# PTI :step: 28, epoch: 14:   6%|▌         | 30/500 [00:06<01:50,  4.24it/s]\n# PTI :step: 29, epoch: 14:   6%|▌         | 30/500 [00:06<01:50,  4.24it/s]\n# PTI :step: 29, epoch: 14:   6%|▌         | 31/500 [00:07<01:47,  4.36it/s]\n# PTI :step: 30, epoch: 15:   6%|▌         | 31/500 [00:07<01:47,  4.36it/s]\n# PTI :step: 30, epoch: 15:   6%|▋         | 32/500 [00:07<01:50,  4.24it/s]\n# PTI :step: 31, epoch: 15:   6%|▋         | 32/500 [00:07<01:50,  4.24it/s]\n# PTI :step: 31, epoch: 15:   7%|▋         | 33/500 [00:07<01:47,  4.35it/s]\n# PTI :step: 32, epoch: 16:   7%|▋         | 33/500 [00:07<01:47,  4.35it/s]\n# PTI :step: 32, epoch: 16:   7%|▋         | 34/500 [00:07<01:49,  4.24it/s]\n# PTI :step: 33, epoch: 16:   7%|▋         | 34/500 [00:07<01:49,  4.24it/s]\n# PTI :step: 33, epoch: 16:   7%|▋         | 35/500 [00:08<01:46,  4.38it/s]\n# PTI :step: 34, epoch: 17:   7%|▋         | 35/500 [00:08<01:46,  4.38it/s]\n# PTI :step: 34, epoch: 17:   7%|▋         | 36/500 [00:08<01:49,  4.26it/s]\n# PTI :step: 35, epoch: 17:   7%|▋         | 36/500 [00:08<01:49,  4.26it/s]\n# PTI :step: 35, epoch: 17:   7%|▋         | 37/500 [00:08<01:47,  4.31it/s]\n# PTI :step: 36, epoch: 18:   7%|▋         | 37/500 [00:08<01:47,  4.31it/s]\n# PTI :step: 36, epoch: 18:   8%|▊         | 38/500 [00:08<01:49,  4.20it/s]\n# PTI :step: 37, epoch: 18:   8%|▊         | 38/500 [00:08<01:49,  4.20it/s]\n# PTI :step: 37, epoch: 18:   8%|▊         | 39/500 [00:08<01:46,  4.33it/s]\n# PTI :step: 38, epoch: 19:   8%|▊         | 39/500 [00:08<01:46,  4.33it/s]\n# PTI :step: 38, epoch: 19:   8%|▊         | 40/500 [00:09<01:48,  4.23it/s]\n# PTI :step: 39, epoch: 19:   8%|▊         | 40/500 [00:09<01:48,  4.23it/s]\n# PTI :step: 39, epoch: 19:   8%|▊         | 41/500 [00:09<01:45,  4.36it/s]\n# PTI :step: 40, epoch: 20:   8%|▊         | 41/500 [00:09<01:45,  4.36it/s]\n# PTI :step: 40, epoch: 20:   8%|▊         | 42/500 [00:09<01:48,  4.23it/s]\n# PTI :step: 41, epoch: 20:   8%|▊         | 42/500 [00:09<01:48,  4.23it/s]\n# PTI :step: 41, epoch: 20:   9%|▊         | 43/500 [00:09<01:46,  4.31it/s]\n# PTI :step: 42, epoch: 21:   9%|▊         | 43/500 [00:09<01:46,  4.31it/s]\n# PTI :step: 42, epoch: 21:   9%|▉         | 44/500 [00:10<01:48,  4.21it/s]\n# PTI :step: 43, epoch: 21:   9%|▉         | 44/500 [00:10<01:48,  4.21it/s]\n# PTI :step: 43, epoch: 21:   9%|▉         | 45/500 [00:10<01:44,  4.35it/s]\n# PTI :step: 44, epoch: 22:   9%|▉         | 45/500 [00:10<01:44,  4.35it/s]\n# PTI :step: 44, epoch: 22:   9%|▉         | 46/500 [00:10<01:46,  4.25it/s]\n# PTI :step: 45, epoch: 22:   9%|▉         | 46/500 [00:10<01:46,  4.25it/s]\n# PTI :step: 45, epoch: 22:   9%|▉         | 47/500 [00:10<01:43,  4.37it/s]\n# PTI :step: 46, epoch: 23:   9%|▉         | 47/500 [00:10<01:43,  4.37it/s]\n# PTI :step: 46, epoch: 23:  10%|▉         | 48/500 [00:11<01:46,  4.26it/s]\n# PTI :step: 47, epoch: 23:  10%|▉         | 48/500 [00:11<01:46,  4.26it/s]\n# PTI :step: 47, epoch: 23:  10%|▉         | 49/500 [00:11<01:43,  4.37it/s]\n# PTI :step: 48, epoch: 24:  10%|▉         | 49/500 [00:11<01:43,  4.37it/s]\n# PTI :step: 48, epoch: 24:  10%|█         | 50/500 [00:11<01:45,  4.26it/s]\n# PTI :step: 49, epoch: 24:  10%|█         | 50/500 [00:11<01:45,  4.26it/s]\n# PTI :step: 49, epoch: 24:  10%|█         | 51/500 [00:11<01:42,  4.38it/s]\n# PTI :step: 50, epoch: 25:  10%|█         | 51/500 [00:11<01:42,  4.38it/s]\n# PTI :step: 50, epoch: 25:  10%|█         | 52/500 [00:12<01:45,  4.26it/s]\n# PTI :step: 51, epoch: 25:  10%|█         | 52/500 [00:12<01:45,  4.26it/s]\n# PTI :step: 51, epoch: 25:  11%|█         | 53/500 [00:12<01:42,  4.37it/s]\n# PTI :step: 52, epoch: 26:  11%|█         | 53/500 [00:12<01:42,  4.37it/s]\n# PTI :step: 52, epoch: 26:  11%|█         | 54/500 [00:12<01:44,  4.26it/s]\n# PTI :step: 53, epoch: 26:  11%|█         | 54/500 [00:12<01:44,  4.26it/s]\n# PTI :step: 53, epoch: 26:  11%|█         | 55/500 [00:12<01:42,  4.32it/s]\n# PTI :step: 54, epoch: 27:  11%|█         | 55/500 [00:12<01:42,  4.32it/s]\n# PTI :step: 54, epoch: 27:  11%|█         | 56/500 [00:12<01:45,  4.21it/s]\n# PTI :step: 55, epoch: 27:  11%|█         | 56/500 [00:12<01:45,  4.21it/s]\n# PTI :step: 55, epoch: 27:  11%|█▏        | 57/500 [00:13<01:42,  4.32it/s]\n# PTI :step: 56, epoch: 28:  11%|█▏        | 57/500 [00:13<01:42,  4.32it/s]\n# PTI :step: 56, epoch: 28:  12%|█▏        | 58/500 [00:13<01:44,  4.23it/s]\n# PTI :step: 57, epoch: 28:  12%|█▏        | 58/500 [00:13<01:44,  4.23it/s]\n# PTI :step: 57, epoch: 28:  12%|█▏        | 59/500 [00:13<01:41,  4.35it/s]\n# PTI :step: 58, epoch: 29:  12%|█▏        | 59/500 [00:13<01:41,  4.35it/s]\n# PTI :step: 58, epoch: 29:  12%|█▏        | 60/500 [00:13<01:43,  4.25it/s]\n# PTI :step: 59, epoch: 29:  12%|█▏        | 60/500 [00:13<01:43,  4.25it/s]\n# PTI :step: 59, epoch: 29:  12%|█▏        | 61/500 [00:14<01:40,  4.37it/s]\n# PTI :step: 60, epoch: 30:  12%|█▏        | 61/500 [00:14<01:40,  4.37it/s]\n# PTI :step: 60, epoch: 30:  12%|█▏        | 62/500 [00:14<01:43,  4.23it/s]\n# PTI :step: 61, epoch: 30:  12%|█▏        | 62/500 [00:14<01:43,  4.23it/s]\n# PTI :step: 61, epoch: 30:  13%|█▎        | 63/500 [00:14<01:41,  4.31it/s]\n# PTI :step: 62, epoch: 31:  13%|█▎        | 63/500 [00:14<01:41,  4.31it/s]\n# PTI :step: 62, epoch: 31:  13%|█▎        | 64/500 [00:14<01:43,  4.22it/s]\n# PTI :step: 63, epoch: 31:  13%|█▎        | 64/500 [00:14<01:43,  4.22it/s]\n# PTI :step: 63, epoch: 31:  13%|█▎        | 65/500 [00:15<01:39,  4.35it/s]\n# PTI :step: 64, epoch: 32:  13%|█▎        | 65/500 [00:15<01:39,  4.35it/s]\n# PTI :step: 64, epoch: 32:  13%|█▎        | 66/500 [00:15<01:42,  4.24it/s]\n# PTI :step: 65, epoch: 32:  13%|█▎        | 66/500 [00:15<01:42,  4.24it/s]\n# PTI :step: 65, epoch: 32:  13%|█▎        | 67/500 [00:15<01:38,  4.38it/s]\n# PTI :step: 66, epoch: 33:  13%|█▎        | 67/500 [00:15<01:38,  4.38it/s]\n# PTI :step: 66, epoch: 33:  14%|█▎        | 68/500 [00:15<01:41,  4.27it/s]\n# PTI :step: 67, epoch: 33:  14%|█▎        | 68/500 [00:15<01:41,  4.27it/s]\n# PTI :step: 67, epoch: 33:  14%|█▍        | 69/500 [00:15<01:38,  4.37it/s]\n# PTI :step: 68, epoch: 34:  14%|█▍        | 69/500 [00:15<01:38,  4.37it/s]\n# PTI :step: 68, epoch: 34:  14%|█▍        | 70/500 [00:16<01:41,  4.25it/s]\n# PTI :step: 69, epoch: 34:  14%|█▍        | 70/500 [00:16<01:41,  4.25it/s]\n# PTI :step: 69, epoch: 34:  14%|█▍        | 71/500 [00:16<01:37,  4.38it/s]\n# PTI :step: 70, epoch: 35:  14%|█▍        | 71/500 [00:16<01:37,  4.38it/s]\n# PTI :step: 70, epoch: 35:  14%|█▍        | 72/500 [00:16<01:40,  4.26it/s]\n# PTI :step: 71, epoch: 35:  14%|█▍        | 72/500 [00:16<01:40,  4.26it/s]\n# PTI :step: 71, epoch: 35:  15%|█▍        | 73/500 [00:16<01:38,  4.35it/s]\n# PTI :step: 72, epoch: 36:  15%|█▍        | 73/500 [00:16<01:38,  4.35it/s]\n# PTI :step: 72, epoch: 36:  15%|█▍        | 74/500 [00:17<01:40,  4.24it/s]\n# PTI :step: 73, epoch: 36:  15%|█▍        | 74/500 [00:17<01:40,  4.24it/s]\n# PTI :step: 73, epoch: 36:  15%|█▌        | 75/500 [00:17<01:37,  4.37it/s]\n# PTI :step: 74, epoch: 37:  15%|█▌        | 75/500 [00:17<01:37,  4.37it/s]\n# PTI :step: 74, epoch: 37:  15%|█▌        | 76/500 [00:17<01:39,  4.26it/s]\n# PTI :step: 75, epoch: 37:  15%|█▌        | 76/500 [00:17<01:39,  4.26it/s]\n# PTI :step: 75, epoch: 37:  15%|█▌        | 77/500 [00:17<01:37,  4.36it/s]\n# PTI :step: 76, epoch: 38:  15%|█▌        | 77/500 [00:17<01:37,  4.36it/s]\n# PTI :step: 76, epoch: 38:  16%|█▌        | 78/500 [00:18<01:39,  4.25it/s]\n# PTI :step: 77, epoch: 38:  16%|█▌        | 78/500 [00:18<01:39,  4.25it/s]\n# PTI :step: 77, epoch: 38:  16%|█▌        | 79/500 [00:18<01:37,  4.32it/s]\n# PTI :step: 78, epoch: 39:  16%|█▌        | 79/500 [00:18<01:37,  4.32it/s]\n# PTI :step: 78, epoch: 39:  16%|█▌        | 80/500 [00:18<01:40,  4.20it/s]\n# PTI :step: 79, epoch: 39:  16%|█▌        | 80/500 [00:18<01:40,  4.20it/s]\n# PTI :step: 79, epoch: 39:  16%|█▌        | 81/500 [00:18<01:36,  4.33it/s]\n# PTI :step: 80, epoch: 40:  16%|█▌        | 81/500 [00:18<01:36,  4.33it/s]\n# PTI :step: 80, epoch: 40:  16%|█▋        | 82/500 [00:18<01:38,  4.24it/s]\n# PTI :step: 81, epoch: 40:  16%|█▋        | 82/500 [00:18<01:38,  4.24it/s]\n# PTI :step: 81, epoch: 40:  17%|█▋        | 83/500 [00:19<01:36,  4.34it/s]\n# PTI :step: 82, epoch: 41:  17%|█▋        | 83/500 [00:19<01:36,  4.34it/s]\n# PTI :step: 82, epoch: 41:  17%|█▋        | 84/500 [00:19<01:38,  4.23it/s]\n# PTI :step: 83, epoch: 41:  17%|█▋        | 84/500 [00:19<01:38,  4.23it/s]\n# PTI :step: 83, epoch: 41:  17%|█▋        | 85/500 [00:19<01:35,  4.35it/s]\n# PTI :step: 84, epoch: 42:  17%|█▋        | 85/500 [00:19<01:35,  4.35it/s]\n# PTI :step: 84, epoch: 42:  17%|█▋        | 86/500 [00:19<01:37,  4.25it/s]\n# PTI :step: 85, epoch: 42:  17%|█▋        | 86/500 [00:19<01:37,  4.25it/s]\n# PTI :step: 85, epoch: 42:  17%|█▋        | 87/500 [00:20<01:34,  4.37it/s]\n# PTI :step: 86, epoch: 43:  17%|█▋        | 87/500 [00:20<01:34,  4.37it/s]\n# PTI :step: 86, epoch: 43:  18%|█▊        | 88/500 [00:20<01:36,  4.26it/s]\n# PTI :step: 87, epoch: 43:  18%|█▊        | 88/500 [00:20<01:36,  4.26it/s]\n# PTI :step: 87, epoch: 43:  18%|█▊        | 89/500 [00:20<01:34,  4.33it/s]\n# PTI :step: 88, epoch: 44:  18%|█▊        | 89/500 [00:20<01:34,  4.33it/s]\n# PTI :step: 88, epoch: 44:  18%|█▊        | 90/500 [00:20<01:37,  4.22it/s]\n# PTI :step: 89, epoch: 44:  18%|█▊        | 90/500 [00:20<01:37,  4.22it/s]\n# PTI :step: 89, epoch: 44:  18%|█▊        | 91/500 [00:21<01:34,  4.34it/s]\n# PTI :step: 90, epoch: 45:  18%|█▊        | 91/500 [00:21<01:34,  4.34it/s]\n# PTI :step: 90, epoch: 45:  18%|█▊        | 92/500 [00:21<01:36,  4.21it/s]\n# PTI :step: 91, epoch: 45:  18%|█▊        | 92/500 [00:21<01:36,  4.21it/s]\n# PTI :step: 91, epoch: 45:  19%|█▊        | 93/500 [00:21<01:34,  4.31it/s]\n# PTI :step: 92, epoch: 46:  19%|█▊        | 93/500 [00:21<01:34,  4.31it/s]\n# PTI :step: 92, epoch: 46:  19%|█▉        | 94/500 [00:21<01:36,  4.22it/s]\n# PTI :step: 93, epoch: 46:  19%|█▉        | 94/500 [00:21<01:36,  4.22it/s]\n# PTI :step: 93, epoch: 46:  19%|█▉        | 95/500 [00:22<01:33,  4.33it/s]\n# PTI :step: 94, epoch: 47:  19%|█▉        | 95/500 [00:22<01:33,  4.33it/s]\n# PTI :step: 94, epoch: 47:  19%|█▉        | 96/500 [00:22<01:35,  4.23it/s]\n# PTI :step: 95, epoch: 47:  19%|█▉        | 96/500 [00:22<01:35,  4.23it/s]\n# PTI :step: 95, epoch: 47:  19%|█▉        | 97/500 [00:22<01:32,  4.37it/s]\n# PTI :step: 96, epoch: 48:  19%|█▉        | 97/500 [00:22<01:32,  4.37it/s]\n# PTI :step: 96, epoch: 48:  20%|█▉        | 98/500 [00:22<01:34,  4.27it/s]\n# PTI :step: 97, epoch: 48:  20%|█▉        | 98/500 [00:22<01:34,  4.27it/s]\n# PTI :step: 97, epoch: 48:  20%|█▉        | 99/500 [00:22<01:31,  4.39it/s]\n# PTI :step: 98, epoch: 49:  20%|█▉        | 99/500 [00:22<01:31,  4.39it/s]\n# PTI :step: 98, epoch: 49:  20%|██        | 100/500 [00:23<01:33,  4.28it/s]\n# PTI :step: 99, epoch: 49:  20%|██        | 100/500 [00:23<01:33,  4.28it/s]\n# PTI :step: 99, epoch: 49:  20%|██        | 101/500 [00:23<01:30,  4.40it/s]\n# PTI :step: 100, epoch: 50:  20%|██        | 101/500 [00:23<01:30,  4.40it/s]\n# PTI :step: 100, epoch: 50:  20%|██        | 102/500 [00:23<01:32,  4.28it/s]\n# PTI :step: 101, epoch: 50:  20%|██        | 102/500 [00:23<01:32,  4.28it/s]\n# PTI :step: 101, epoch: 50:  21%|██        | 103/500 [00:23<01:30,  4.38it/s]\n# PTI :step: 102, epoch: 51:  21%|██        | 103/500 [00:23<01:30,  4.38it/s]\n# PTI :step: 102, epoch: 51:  21%|██        | 104/500 [00:24<01:32,  4.27it/s]\n# PTI :step: 103, epoch: 51:  21%|██        | 104/500 [00:24<01:32,  4.27it/s]\n# PTI :step: 103, epoch: 51:  21%|██        | 105/500 [00:24<01:30,  4.39it/s]\n# PTI :step: 104, epoch: 52:  21%|██        | 105/500 [00:24<01:30,  4.39it/s]\n# PTI :step: 104, epoch: 52:  21%|██        | 106/500 [00:24<01:32,  4.27it/s]\n# PTI :step: 105, epoch: 52:  21%|██        | 106/500 [00:24<01:32,  4.27it/s]\n# PTI :step: 105, epoch: 52:  21%|██▏       | 107/500 [00:24<01:29,  4.41it/s]\n# PTI :step: 106, epoch: 53:  21%|██▏       | 107/500 [00:24<01:29,  4.41it/s]\n# PTI :step: 106, epoch: 53:  22%|██▏       | 108/500 [00:25<01:31,  4.29it/s]\n# PTI :step: 107, epoch: 53:  22%|██▏       | 108/500 [00:25<01:31,  4.29it/s]\n# PTI :step: 107, epoch: 53:  22%|██▏       | 109/500 [00:25<01:30,  4.32it/s]\n# PTI :step: 108, epoch: 54:  22%|██▏       | 109/500 [00:25<01:30,  4.32it/s]\n# PTI :step: 108, epoch: 54:  22%|██▏       | 110/500 [00:25<01:32,  4.23it/s]\n# PTI :step: 109, epoch: 54:  22%|██▏       | 110/500 [00:25<01:32,  4.23it/s]\n# PTI :step: 109, epoch: 54:  22%|██▏       | 111/500 [00:25<01:29,  4.36it/s]\n# PTI :step: 110, epoch: 55:  22%|██▏       | 111/500 [00:25<01:29,  4.36it/s]\n# PTI :step: 110, epoch: 55:  22%|██▏       | 112/500 [00:25<01:31,  4.26it/s]\n# PTI :step: 111, epoch: 55:  22%|██▏       | 112/500 [00:25<01:31,  4.26it/s]\n# PTI :step: 111, epoch: 55:  23%|██▎       | 113/500 [00:26<01:28,  4.37it/s]\n# PTI :step: 112, epoch: 56:  23%|██▎       | 113/500 [00:26<01:28,  4.37it/s]\n# PTI :step: 112, epoch: 56:  23%|██▎       | 114/500 [00:26<01:30,  4.26it/s]\n# PTI :step: 113, epoch: 56:  23%|██▎       | 114/500 [00:26<01:30,  4.26it/s]\n# PTI :step: 113, epoch: 56:  23%|██▎       | 115/500 [00:26<01:27,  4.39it/s]\n# PTI :step: 114, epoch: 57:  23%|██▎       | 115/500 [00:26<01:27,  4.39it/s]\n# PTI :step: 114, epoch: 57:  23%|██▎       | 116/500 [00:26<01:29,  4.27it/s]\n# PTI :step: 115, epoch: 57:  23%|██▎       | 116/500 [00:26<01:29,  4.27it/s]\n# PTI :step: 115, epoch: 57:  23%|██▎       | 117/500 [00:27<01:27,  4.39it/s]\n# PTI :step: 116, epoch: 58:  23%|██▎       | 117/500 [00:27<01:27,  4.39it/s]\n# PTI :step: 116, epoch: 58:  24%|██▎       | 118/500 [00:27<01:29,  4.26it/s]\n# PTI :step: 117, epoch: 58:  24%|██▎       | 118/500 [00:27<01:29,  4.26it/s]\n# PTI :step: 117, epoch: 58:  24%|██▍       | 119/500 [00:27<01:27,  4.38it/s]\n# PTI :step: 118, epoch: 59:  24%|██▍       | 119/500 [00:27<01:27,  4.38it/s]\n# PTI :step: 118, epoch: 59:  24%|██▍       | 120/500 [00:27<01:29,  4.26it/s]\n# PTI :step: 119, epoch: 59:  24%|██▍       | 120/500 [00:27<01:29,  4.26it/s]\n# PTI :step: 119, epoch: 59:  24%|██▍       | 121/500 [00:28<01:26,  4.39it/s]\n# PTI :step: 120, epoch: 60:  24%|██▍       | 121/500 [00:28<01:26,  4.39it/s]\n# PTI :step: 120, epoch: 60:  24%|██▍       | 122/500 [00:28<01:28,  4.26it/s]\n# PTI :step: 121, epoch: 60:  24%|██▍       | 122/500 [00:28<01:28,  4.26it/s]\n# PTI :step: 121, epoch: 60:  25%|██▍       | 123/500 [00:28<01:26,  4.36it/s]\n# PTI :step: 122, epoch: 61:  25%|██▍       | 123/500 [00:28<01:26,  4.36it/s]\n# PTI :step: 122, epoch: 61:  25%|██▍       | 124/500 [00:28<01:28,  4.24it/s]\n# PTI :step: 123, epoch: 61:  25%|██▍       | 124/500 [00:28<01:28,  4.24it/s]\n# PTI :step: 123, epoch: 61:  25%|██▌       | 125/500 [00:28<01:26,  4.36it/s]\n# PTI :step: 124, epoch: 62:  25%|██▌       | 125/500 [00:28<01:26,  4.36it/s]\n# PTI :step: 124, epoch: 62:  25%|██▌       | 126/500 [00:29<01:27,  4.25it/s]\n# PTI :step: 125, epoch: 62:  25%|██▌       | 126/500 [00:29<01:27,  4.25it/s]\n# PTI :step: 125, epoch: 62:  25%|██▌       | 127/500 [00:29<01:25,  4.38it/s]\n# PTI :step: 126, epoch: 63:  25%|██▌       | 127/500 [00:29<01:25,  4.38it/s]\n# PTI :step: 126, epoch: 63:  26%|██▌       | 128/500 [00:29<01:27,  4.26it/s]\n# PTI :step: 127, epoch: 63:  26%|██▌       | 128/500 [00:29<01:27,  4.26it/s]\n# PTI :step: 127, epoch: 63:  26%|██▌       | 129/500 [00:29<01:24,  4.38it/s]\n# PTI :step: 128, epoch: 64:  26%|██▌       | 129/500 [00:29<01:24,  4.38it/s]\n# PTI :step: 128, epoch: 64:  26%|██▌       | 130/500 [00:30<01:26,  4.26it/s]\n# PTI :step: 129, epoch: 64:  26%|██▌       | 130/500 [00:30<01:26,  4.26it/s]\n# PTI :step: 129, epoch: 64:  26%|██▌       | 131/500 [00:30<01:25,  4.34it/s]\n# PTI :step: 130, epoch: 65:  26%|██▌       | 131/500 [00:30<01:25,  4.34it/s]\n# PTI :step: 130, epoch: 65:  26%|██▋       | 132/500 [00:30<01:27,  4.21it/s]\n# PTI :step: 131, epoch: 65:  26%|██▋       | 132/500 [00:30<01:27,  4.21it/s]\n# PTI :step: 131, epoch: 65:  27%|██▋       | 133/500 [00:30<01:24,  4.35it/s]\n# PTI :step: 132, epoch: 66:  27%|██▋       | 133/500 [00:30<01:24,  4.35it/s]\n# PTI :step: 132, epoch: 66:  27%|██▋       | 134/500 [00:31<01:26,  4.24it/s]\n# PTI :step: 133, epoch: 66:  27%|██▋       | 134/500 [00:31<01:26,  4.24it/s]\n# PTI :step: 133, epoch: 66:  27%|██▋       | 135/500 [00:31<01:24,  4.33it/s]\n# PTI :step: 134, epoch: 67:  27%|██▋       | 135/500 [00:31<01:24,  4.33it/s]\n# PTI :step: 134, epoch: 67:  27%|██▋       | 136/500 [00:31<01:26,  4.23it/s]\n# PTI :step: 135, epoch: 67:  27%|██▋       | 136/500 [00:31<01:26,  4.23it/s]\n# PTI :step: 135, epoch: 67:  27%|██▋       | 137/500 [00:31<01:23,  4.37it/s]\n# PTI :step: 136, epoch: 68:  27%|██▋       | 137/500 [00:31<01:23,  4.37it/s]\n# PTI :step: 136, epoch: 68:  28%|██▊       | 138/500 [00:31<01:25,  4.24it/s]\n# PTI :step: 137, epoch: 68:  28%|██▊       | 138/500 [00:31<01:25,  4.24it/s]\n# PTI :step: 137, epoch: 68:  28%|██▊       | 139/500 [00:32<01:23,  4.34it/s]\n# PTI :step: 138, epoch: 69:  28%|██▊       | 139/500 [00:32<01:23,  4.34it/s]\n# PTI :step: 138, epoch: 69:  28%|██▊       | 140/500 [00:32<01:24,  4.24it/s]\n# PTI :step: 139, epoch: 69:  28%|██▊       | 140/500 [00:32<01:24,  4.24it/s]\n# PTI :step: 139, epoch: 69:  28%|██▊       | 141/500 [00:32<01:22,  4.37it/s]\n# PTI :step: 140, epoch: 70:  28%|██▊       | 141/500 [00:32<01:22,  4.37it/s]\n# PTI :step: 140, epoch: 70:  28%|██▊       | 142/500 [00:32<01:24,  4.25it/s]\n# PTI :step: 141, epoch: 70:  28%|██▊       | 142/500 [00:32<01:24,  4.25it/s]\n# PTI :step: 141, epoch: 70:  29%|██▊       | 143/500 [00:33<01:22,  4.35it/s]\n# PTI :step: 142, epoch: 71:  29%|██▊       | 143/500 [00:33<01:22,  4.35it/s]\n# PTI :step: 142, epoch: 71:  29%|██▉       | 144/500 [00:33<01:24,  4.23it/s]\n# PTI :step: 143, epoch: 71:  29%|██▉       | 144/500 [00:33<01:24,  4.23it/s]\n# PTI :step: 143, epoch: 71:  29%|██▉       | 145/500 [00:33<01:21,  4.36it/s]\n# PTI :step: 144, epoch: 72:  29%|██▉       | 145/500 [00:33<01:21,  4.36it/s]\n# PTI :step: 144, epoch: 72:  29%|██▉       | 146/500 [00:33<01:23,  4.25it/s]\n# PTI :step: 145, epoch: 72:  29%|██▉       | 146/500 [00:33<01:23,  4.25it/s]\n# PTI :step: 145, epoch: 72:  29%|██▉       | 147/500 [00:34<01:20,  4.36it/s]\n# PTI :step: 146, epoch: 73:  29%|██▉       | 147/500 [00:34<01:20,  4.36it/s]\n# PTI :step: 146, epoch: 73:  30%|██▉       | 148/500 [00:34<01:23,  4.23it/s]\n# PTI :step: 147, epoch: 73:  30%|██▉       | 148/500 [00:34<01:23,  4.23it/s]\n# PTI :step: 147, epoch: 73:  30%|██▉       | 149/500 [00:34<01:20,  4.36it/s]\n# PTI :step: 148, epoch: 74:  30%|██▉       | 149/500 [00:34<01:20,  4.36it/s]\n# PTI :step: 148, epoch: 74:  30%|███       | 150/500 [00:34<01:22,  4.25it/s]\n# PTI :step: 149, epoch: 74:  30%|███       | 150/500 [00:34<01:22,  4.25it/s]\n# PTI :step: 149, epoch: 74:  30%|███       | 151/500 [00:34<01:19,  4.38it/s]\n# PTI :step: 150, epoch: 75:  30%|███       | 151/500 [00:34<01:19,  4.38it/s]\n# PTI :step: 150, epoch: 75:  30%|███       | 152/500 [00:35<01:21,  4.26it/s]\n# PTI :step: 151, epoch: 75:  30%|███       | 152/500 [00:35<01:21,  4.26it/s]\n# PTI :step: 151, epoch: 75:  31%|███       | 153/500 [00:35<01:19,  4.39it/s]\n# PTI :step: 152, epoch: 76:  31%|███       | 153/500 [00:35<01:19,  4.39it/s]\n# PTI :step: 152, epoch: 76:  31%|███       | 154/500 [00:35<01:21,  4.27it/s]\n# PTI :step: 153, epoch: 76:  31%|███       | 154/500 [00:35<01:21,  4.27it/s]\n# PTI :step: 153, epoch: 76:  31%|███       | 155/500 [00:35<01:18,  4.40it/s]\n# PTI :step: 154, epoch: 77:  31%|███       | 155/500 [00:35<01:18,  4.40it/s]\n# PTI :step: 154, epoch: 77:  31%|███       | 156/500 [00:36<01:20,  4.27it/s]\n# PTI :step: 155, epoch: 77:  31%|███       | 156/500 [00:36<01:20,  4.27it/s]\n# PTI :step: 155, epoch: 77:  31%|███▏      | 157/500 [00:36<01:18,  4.40it/s]\n# PTI :step: 156, epoch: 78:  31%|███▏      | 157/500 [00:36<01:18,  4.40it/s]\n# PTI :step: 156, epoch: 78:  32%|███▏      | 158/500 [00:36<01:20,  4.27it/s]\n# PTI :step: 157, epoch: 78:  32%|███▏      | 158/500 [00:36<01:20,  4.27it/s]\n# PTI :step: 157, epoch: 78:  32%|███▏      | 159/500 [00:36<01:17,  4.39it/s]\n# PTI :step: 158, epoch: 79:  32%|███▏      | 159/500 [00:36<01:17,  4.39it/s]\n# PTI :step: 158, epoch: 79:  32%|███▏      | 160/500 [00:37<01:20,  4.24it/s]\n# PTI :step: 159, epoch: 79:  32%|███▏      | 160/500 [00:37<01:20,  4.24it/s]\n# PTI :step: 159, epoch: 79:  32%|███▏      | 161/500 [00:37<01:17,  4.35it/s]\n# PTI :step: 160, epoch: 80:  32%|███▏      | 161/500 [00:37<01:17,  4.35it/s]\n# PTI :step: 160, epoch: 80:  32%|███▏      | 162/500 [00:37<01:19,  4.24it/s]\n# PTI :step: 161, epoch: 80:  32%|███▏      | 162/500 [00:37<01:19,  4.24it/s]\n# PTI :step: 161, epoch: 80:  33%|███▎      | 163/500 [00:37<01:18,  4.30it/s]\n# PTI :step: 162, epoch: 81:  33%|███▎      | 163/500 [00:37<01:18,  4.30it/s]\n# PTI :step: 162, epoch: 81:  33%|███▎      | 164/500 [00:38<01:20,  4.20it/s]\n# PTI :step: 163, epoch: 81:  33%|███▎      | 164/500 [00:38<01:20,  4.20it/s]\n# PTI :step: 163, epoch: 81:  33%|███▎      | 165/500 [00:38<01:18,  4.28it/s]\n# PTI :step: 164, epoch: 82:  33%|███▎      | 165/500 [00:38<01:18,  4.28it/s]\n# PTI :step: 164, epoch: 82:  33%|███▎      | 166/500 [00:38<01:19,  4.19it/s]\n# PTI :step: 165, epoch: 82:  33%|███▎      | 166/500 [00:38<01:19,  4.19it/s]\n# PTI :step: 165, epoch: 82:  33%|███▎      | 167/500 [00:38<01:16,  4.33it/s]\n# PTI :step: 166, epoch: 83:  33%|███▎      | 167/500 [00:38<01:16,  4.33it/s]\n# PTI :step: 166, epoch: 83:  34%|███▎      | 168/500 [00:38<01:18,  4.23it/s]\n# PTI :step: 167, epoch: 83:  34%|███▎      | 168/500 [00:38<01:18,  4.23it/s]\n# PTI :step: 167, epoch: 83:  34%|███▍      | 169/500 [00:39<01:18,  4.24it/s]\n# PTI :step: 168, epoch: 84:  34%|███▍      | 169/500 [00:39<01:18,  4.24it/s]\n# PTI :step: 168, epoch: 84:  34%|███▍      | 170/500 [00:39<01:19,  4.17it/s]\n# PTI :step: 169, epoch: 84:  34%|███▍      | 170/500 [00:39<01:19,  4.17it/s]\n# PTI :step: 169, epoch: 84:  34%|███▍      | 171/500 [00:39<01:16,  4.31it/s]\n# PTI :step: 170, epoch: 85:  34%|███▍      | 171/500 [00:39<01:16,  4.31it/s]\n# PTI :step: 170, epoch: 85:  34%|███▍      | 172/500 [00:39<01:17,  4.22it/s]\n# PTI :step: 171, epoch: 85:  34%|███▍      | 172/500 [00:39<01:17,  4.22it/s]\n# PTI :step: 171, epoch: 85:  35%|███▍      | 173/500 [00:40<01:16,  4.30it/s]\n# PTI :step: 172, epoch: 86:  35%|███▍      | 173/500 [00:40<01:16,  4.30it/s]\n# PTI :step: 172, epoch: 86:  35%|███▍      | 174/500 [00:40<01:18,  4.17it/s]\n# PTI :step: 173, epoch: 86:  35%|███▍      | 174/500 [00:40<01:18,  4.17it/s]\n# PTI :step: 173, epoch: 86:  35%|███▌      | 175/500 [00:40<01:15,  4.31it/s]\n# PTI :step: 174, epoch: 87:  35%|███▌      | 175/500 [00:40<01:15,  4.31it/s]\n# PTI :step: 174, epoch: 87:  35%|███▌      | 176/500 [00:40<01:16,  4.21it/s]\n# PTI :step: 175, epoch: 87:  35%|███▌      | 176/500 [00:40<01:16,  4.21it/s]\n# PTI :step: 175, epoch: 87:  35%|███▌      | 177/500 [00:41<01:15,  4.29it/s]\n# PTI :step: 176, epoch: 88:  35%|███▌      | 177/500 [00:41<01:15,  4.29it/s]\n# PTI :step: 176, epoch: 88:  36%|███▌      | 178/500 [00:41<01:16,  4.19it/s]\n# PTI :step: 177, epoch: 88:  36%|███▌      | 178/500 [00:41<01:16,  4.19it/s]\n# PTI :step: 177, epoch: 88:  36%|███▌      | 179/500 [00:41<01:14,  4.30it/s]\n# PTI :step: 178, epoch: 89:  36%|███▌      | 179/500 [00:41<01:14,  4.30it/s]\n# PTI :step: 178, epoch: 89:  36%|███▌      | 180/500 [00:41<01:16,  4.21it/s]\n# PTI :step: 179, epoch: 89:  36%|███▌      | 180/500 [00:41<01:16,  4.21it/s]\n# PTI :step: 179, epoch: 89:  36%|███▌      | 181/500 [00:42<01:13,  4.31it/s]\n# PTI :step: 180, epoch: 90:  36%|███▌      | 181/500 [00:42<01:13,  4.31it/s]\n# PTI :step: 180, epoch: 90:  36%|███▋      | 182/500 [00:42<01:15,  4.22it/s]\n# PTI :step: 181, epoch: 90:  36%|███▋      | 182/500 [00:42<01:15,  4.22it/s]\n# PTI :step: 181, epoch: 90:  37%|███▋      | 183/500 [00:42<01:12,  4.35it/s]\n# PTI :step: 182, epoch: 91:  37%|███▋      | 183/500 [00:42<01:12,  4.35it/s]\n# PTI :step: 182, epoch: 91:  37%|███▋      | 184/500 [00:42<01:14,  4.24it/s]\n# PTI :step: 183, epoch: 91:  37%|███▋      | 184/500 [00:42<01:14,  4.24it/s]\n# PTI :step: 183, epoch: 91:  37%|███▋      | 185/500 [00:42<01:12,  4.36it/s]\n# PTI :step: 184, epoch: 92:  37%|███▋      | 185/500 [00:42<01:12,  4.36it/s]\n# PTI :step: 184, epoch: 92:  37%|███▋      | 186/500 [00:43<01:13,  4.24it/s]\n# PTI :step: 185, epoch: 92:  37%|███▋      | 186/500 [00:43<01:13,  4.24it/s]\n# PTI :step: 185, epoch: 92:  37%|███▋      | 187/500 [00:43<01:11,  4.37it/s]\n# PTI :step: 186, epoch: 93:  37%|███▋      | 187/500 [00:43<01:11,  4.37it/s]\n# PTI :step: 186, epoch: 93:  38%|███▊      | 188/500 [00:43<01:13,  4.26it/s]\n# PTI :step: 187, epoch: 93:  38%|███▊      | 188/500 [00:43<01:13,  4.26it/s]\n# PTI :step: 187, epoch: 93:  38%|███▊      | 189/500 [00:43<01:10,  4.39it/s]\n# PTI :step: 188, epoch: 94:  38%|███▊      | 189/500 [00:43<01:10,  4.39it/s]\n# PTI :step: 188, epoch: 94:  38%|███▊      | 190/500 [00:44<01:12,  4.28it/s]\n# PTI :step: 189, epoch: 94:  38%|███▊      | 190/500 [00:44<01:12,  4.28it/s]\n# PTI :step: 189, epoch: 94:  38%|███▊      | 191/500 [00:44<01:11,  4.34it/s]\n# PTI :step: 190, epoch: 95:  38%|███▊      | 191/500 [00:44<01:11,  4.34it/s]\n# PTI :step: 190, epoch: 95:  38%|███▊      | 192/500 [00:44<01:12,  4.24it/s]\n# PTI :step: 191, epoch: 95:  38%|███▊      | 192/500 [00:44<01:12,  4.24it/s]\n# PTI :step: 191, epoch: 95:  39%|███▊      | 193/500 [00:44<01:10,  4.38it/s]\n# PTI :step: 192, epoch: 96:  39%|███▊      | 193/500 [00:44<01:10,  4.38it/s]\n# PTI :step: 192, epoch: 96:  39%|███▉      | 194/500 [00:45<01:11,  4.27it/s]\n# PTI :step: 193, epoch: 96:  39%|███▉      | 194/500 [00:45<01:11,  4.27it/s]\n# PTI :step: 193, epoch: 96:  39%|███▉      | 195/500 [00:45<01:09,  4.38it/s]\n# PTI :step: 194, epoch: 97:  39%|███▉      | 195/500 [00:45<01:09,  4.38it/s]\n# PTI :step: 194, epoch: 97:  39%|███▉      | 196/500 [00:45<01:11,  4.27it/s]\n# PTI :step: 195, epoch: 97:  39%|███▉      | 196/500 [00:45<01:11,  4.27it/s]\n# PTI :step: 195, epoch: 97:  39%|███▉      | 197/500 [00:45<01:09,  4.36it/s]\n# PTI :step: 196, epoch: 98:  39%|███▉      | 197/500 [00:45<01:09,  4.36it/s]\n# PTI :step: 196, epoch: 98:  40%|███▉      | 198/500 [00:45<01:11,  4.25it/s]\n# PTI :step: 197, epoch: 98:  40%|███▉      | 198/500 [00:45<01:11,  4.25it/s]\n# PTI :step: 197, epoch: 98:  40%|███▉      | 199/500 [00:46<01:09,  4.31it/s]\n# PTI :step: 198, epoch: 99:  40%|███▉      | 199/500 [00:46<01:09,  4.31it/s]\n# PTI :step: 198, epoch: 99:  40%|████      | 200/500 [00:46<01:11,  4.21it/s]\n# PTI :step: 199, epoch: 99:  40%|████      | 200/500 [00:46<01:11,  4.21it/s]\n# PTI :step: 199, epoch: 99:  40%|████      | 201/500 [00:46<01:09,  4.32it/s]\n# PTI :step: 200, epoch: 100:  40%|████      | 201/500 [00:46<01:09,  4.32it/s]\n# PTI :step: 200, epoch: 100:  40%|████      | 202/500 [00:46<01:10,  4.22it/s]\n# PTI :step: 201, epoch: 100:  40%|████      | 202/500 [00:46<01:10,  4.22it/s]\n# PTI :step: 201, epoch: 100:  41%|████      | 203/500 [00:47<01:08,  4.32it/s]\n# PTI :step: 202, epoch: 101:  41%|████      | 203/500 [00:47<01:08,  4.32it/s]\n# PTI :step: 202, epoch: 101:  41%|████      | 204/500 [00:47<01:10,  4.21it/s]\n# PTI :step: 203, epoch: 101:  41%|████      | 204/500 [00:47<01:10,  4.21it/s]\n# PTI :step: 203, epoch: 101:  41%|████      | 205/500 [00:47<01:07,  4.36it/s]\n# PTI :step: 204, epoch: 102:  41%|████      | 205/500 [00:47<01:07,  4.36it/s]\n# PTI :step: 204, epoch: 102:  41%|████      | 206/500 [00:47<01:09,  4.25it/s]\n# PTI :step: 205, epoch: 102:  41%|████      | 206/500 [00:47<01:09,  4.25it/s]\n# PTI :step: 205, epoch: 102:  41%|████▏     | 207/500 [00:48<01:06,  4.38it/s]\n# PTI :step: 206, epoch: 103:  41%|████▏     | 207/500 [00:48<01:06,  4.38it/s]\n# PTI :step: 206, epoch: 103:  42%|████▏     | 208/500 [00:48<01:08,  4.26it/s]\n# PTI :step: 207, epoch: 103:  42%|████▏     | 208/500 [00:48<01:08,  4.26it/s]\n# PTI :step: 207, epoch: 103:  42%|████▏     | 209/500 [00:48<01:07,  4.33it/s]\n# PTI :step: 208, epoch: 104:  42%|████▏     | 209/500 [00:48<01:07,  4.33it/s]\n# PTI :step: 208, epoch: 104:  42%|████▏     | 210/500 [00:48<01:08,  4.23it/s]\n# PTI :step: 209, epoch: 104:  42%|████▏     | 210/500 [00:48<01:08,  4.23it/s]\n# PTI :step: 209, epoch: 104:  42%|████▏     | 211/500 [00:49<01:08,  4.24it/s]\n# PTI :step: 210, epoch: 105:  42%|████▏     | 211/500 [00:49<01:08,  4.24it/s]\n# PTI :step: 210, epoch: 105:  42%|████▏     | 212/500 [00:49<01:09,  4.13it/s]\n# PTI :step: 211, epoch: 105:  42%|████▏     | 212/500 [00:49<01:09,  4.13it/s]\n# PTI :step: 211, epoch: 105:  43%|████▎     | 213/500 [00:49<01:06,  4.30it/s]\n# PTI :step: 212, epoch: 106:  43%|████▎     | 213/500 [00:49<01:06,  4.30it/s]\n# PTI :step: 212, epoch: 106:  43%|████▎     | 214/500 [00:49<01:08,  4.20it/s]\n# PTI :step: 213, epoch: 106:  43%|████▎     | 214/500 [00:49<01:08,  4.20it/s]\n# PTI :step: 213, epoch: 106:  43%|████▎     | 215/500 [00:49<01:05,  4.34it/s]\n# PTI :step: 214, epoch: 107:  43%|████▎     | 215/500 [00:49<01:05,  4.34it/s]\n# PTI :step: 214, epoch: 107:  43%|████▎     | 216/500 [00:50<01:07,  4.23it/s]\n# PTI :step: 215, epoch: 107:  43%|████▎     | 216/500 [00:50<01:07,  4.23it/s]\n# PTI :step: 215, epoch: 107:  43%|████▎     | 217/500 [00:50<01:04,  4.37it/s]\n# PTI :step: 216, epoch: 108:  43%|████▎     | 217/500 [00:50<01:04,  4.37it/s]\n# PTI :step: 216, epoch: 108:  44%|████▎     | 218/500 [00:50<01:06,  4.26it/s]\n# PTI :step: 217, epoch: 108:  44%|████▎     | 218/500 [00:50<01:06,  4.26it/s]\n# PTI :step: 217, epoch: 108:  44%|████▍     | 219/500 [00:50<01:04,  4.38it/s]\n# PTI :step: 218, epoch: 109:  44%|████▍     | 219/500 [00:50<01:04,  4.38it/s]\n# PTI :step: 218, epoch: 109:  44%|████▍     | 220/500 [00:51<01:05,  4.26it/s]\n# PTI :step: 219, epoch: 109:  44%|████▍     | 220/500 [00:51<01:05,  4.26it/s]\n# PTI :step: 219, epoch: 109:  44%|████▍     | 221/500 [00:51<01:03,  4.36it/s]\n# PTI :step: 220, epoch: 110:  44%|████▍     | 221/500 [00:51<01:03,  4.36it/s]\n# PTI :step: 220, epoch: 110:  44%|████▍     | 222/500 [00:51<01:05,  4.24it/s]\n# PTI :step: 221, epoch: 110:  44%|████▍     | 222/500 [00:51<01:05,  4.24it/s]\n# PTI :step: 221, epoch: 110:  45%|████▍     | 223/500 [00:51<01:04,  4.32it/s]\n# PTI :step: 222, epoch: 111:  45%|████▍     | 223/500 [00:51<01:04,  4.32it/s]\n# PTI :step: 222, epoch: 111:  45%|████▍     | 224/500 [00:52<01:05,  4.22it/s]\n# PTI :step: 223, epoch: 111:  45%|████▍     | 224/500 [00:52<01:05,  4.22it/s]\n# PTI :step: 223, epoch: 111:  45%|████▌     | 225/500 [00:52<01:03,  4.32it/s]\n# PTI :step: 224, epoch: 112:  45%|████▌     | 225/500 [00:52<01:03,  4.32it/s]\n# PTI :step: 224, epoch: 112:  45%|████▌     | 226/500 [00:52<01:05,  4.20it/s]\n# PTI :step: 225, epoch: 112:  45%|████▌     | 226/500 [00:52<01:05,  4.20it/s]\n# PTI :step: 225, epoch: 112:  45%|████▌     | 227/500 [00:52<01:02,  4.34it/s]\n# PTI :step: 226, epoch: 113:  45%|████▌     | 227/500 [00:52<01:02,  4.34it/s]\n# PTI :step: 226, epoch: 113:  46%|████▌     | 228/500 [00:53<01:04,  4.22it/s]\n# PTI :step: 227, epoch: 113:  46%|████▌     | 228/500 [00:53<01:04,  4.22it/s]\n# PTI :step: 227, epoch: 113:  46%|████▌     | 229/500 [00:53<01:02,  4.32it/s]\n# PTI :step: 228, epoch: 114:  46%|████▌     | 229/500 [00:53<01:02,  4.32it/s]\n# PTI :step: 228, epoch: 114:  46%|████▌     | 230/500 [00:53<01:04,  4.22it/s]\n# PTI :step: 229, epoch: 114:  46%|████▌     | 230/500 [00:53<01:04,  4.22it/s]\n# PTI :step: 229, epoch: 114:  46%|████▌     | 231/500 [00:53<01:01,  4.36it/s]\n# PTI :step: 230, epoch: 115:  46%|████▌     | 231/500 [00:53<01:01,  4.36it/s]\n# PTI :step: 230, epoch: 115:  46%|████▋     | 232/500 [00:53<01:03,  4.25it/s]\n# PTI :step: 231, epoch: 115:  46%|████▋     | 232/500 [00:53<01:03,  4.25it/s]\n# PTI :step: 231, epoch: 115:  47%|████▋     | 233/500 [00:54<01:02,  4.29it/s]\n# PTI :step: 232, epoch: 116:  47%|████▋     | 233/500 [00:54<01:02,  4.29it/s]\n# PTI :step: 232, epoch: 116:  47%|████▋     | 234/500 [00:54<01:03,  4.20it/s]\n# PTI :step: 233, epoch: 116:  47%|████▋     | 234/500 [00:54<01:03,  4.20it/s]\n# PTI :step: 233, epoch: 116:  47%|████▋     | 235/500 [00:54<01:01,  4.33it/s]\n# PTI :step: 234, epoch: 117:  47%|████▋     | 235/500 [00:54<01:01,  4.33it/s]\n# PTI :step: 234, epoch: 117:  47%|████▋     | 236/500 [00:54<01:02,  4.20it/s]\n# PTI :step: 235, epoch: 117:  47%|████▋     | 236/500 [00:54<01:02,  4.20it/s]\n# PTI :step: 235, epoch: 117:  47%|████▋     | 237/500 [00:55<01:00,  4.31it/s]\n# PTI :step: 236, epoch: 118:  47%|████▋     | 237/500 [00:55<01:00,  4.31it/s]\n# PTI :step: 236, epoch: 118:  48%|████▊     | 238/500 [00:55<01:02,  4.21it/s]\n# PTI :step: 237, epoch: 118:  48%|████▊     | 238/500 [00:55<01:02,  4.21it/s]\n# PTI :step: 237, epoch: 118:  48%|████▊     | 239/500 [00:55<01:00,  4.28it/s]\n# PTI :step: 238, epoch: 119:  48%|████▊     | 239/500 [00:55<01:00,  4.28it/s]\n# PTI :step: 238, epoch: 119:  48%|████▊     | 240/500 [00:55<01:02,  4.19it/s]\n# PTI :step: 239, epoch: 119:  48%|████▊     | 240/500 [00:55<01:02,  4.19it/s]\n# PTI :step: 239, epoch: 119:  48%|████▊     | 241/500 [00:56<00:59,  4.32it/s]\n# PTI :step: 240, epoch: 120:  48%|████▊     | 241/500 [00:56<00:59,  4.32it/s]\n# PTI :step: 240, epoch: 120:  48%|████▊     | 242/500 [00:56<01:01,  4.17it/s]\n# PTI :step: 241, epoch: 120:  48%|████▊     | 242/500 [00:56<01:01,  4.17it/s]\n# PTI :step: 241, epoch: 120:  49%|████▊     | 243/500 [00:56<00:59,  4.29it/s]\n# PTI :step: 242, epoch: 121:  49%|████▊     | 243/500 [00:56<00:59,  4.29it/s]\n# PTI :step: 242, epoch: 121:  49%|████▉     | 244/500 [00:56<01:00,  4.21it/s]\n# PTI :step: 243, epoch: 121:  49%|████▉     | 244/500 [00:56<01:00,  4.21it/s]\n# PTI :step: 243, epoch: 121:  49%|████▉     | 245/500 [00:56<00:58,  4.35it/s]\n# PTI :step: 244, epoch: 122:  49%|████▉     | 245/500 [00:56<00:58,  4.35it/s]\n# PTI :step: 244, epoch: 122:  49%|████▉     | 246/500 [00:57<01:00,  4.21it/s]\n# PTI :step: 245, epoch: 122:  49%|████▉     | 246/500 [00:57<01:00,  4.21it/s]\n# PTI :step: 245, epoch: 122:  49%|████▉     | 247/500 [00:57<00:59,  4.27it/s]\n# PTI :step: 246, epoch: 123:  49%|████▉     | 247/500 [00:57<00:59,  4.27it/s]\n# PTI :step: 246, epoch: 123:  50%|████▉     | 248/500 [00:57<01:00,  4.19it/s]\n# PTI :step: 247, epoch: 123:  50%|████▉     | 248/500 [00:57<01:00,  4.19it/s]\n# PTI :step: 247, epoch: 123:  50%|████▉     | 249/500 [00:57<00:58,  4.32it/s]\n# PTI :step: 248, epoch: 124:  50%|████▉     | 249/500 [00:57<00:58,  4.32it/s]\n# PTI :step: 248, epoch: 124:  50%|█████     | 250/500 [00:58<00:59,  4.23it/s]\n# PTI :  Pivot halfway\n# PTI :step: 249, epoch: 124:  50%|█████     | 250/500 [00:58<00:59,  4.23it/s]\n# PTI :step: 249, epoch: 124:  50%|█████     | 251/500 [00:58<00:57,  4.36it/s]\n# PTI :step: 250, epoch: 125:  50%|█████     | 251/500 [00:58<00:57,  4.36it/s]\n# PTI :step: 250, epoch: 125:  50%|█████     | 252/500 [00:58<01:07,  3.69it/s]\n# PTI :step: 251, epoch: 125:  50%|█████     | 252/500 [00:58<01:07,  3.69it/s]\n# PTI :step: 251, epoch: 125:  51%|█████     | 253/500 [00:58<01:02,  3.96it/s]\n# PTI :step: 252, epoch: 126:  51%|█████     | 253/500 [00:58<01:02,  3.96it/s]\n# PTI :step: 252, epoch: 126:  51%|█████     | 254/500 [00:59<01:01,  3.97it/s]\n# PTI :step: 253, epoch: 126:  51%|█████     | 254/500 [00:59<01:01,  3.97it/s]\n# PTI :step: 253, epoch: 126:  51%|█████     | 255/500 [00:59<00:58,  4.19it/s]\n# PTI :step: 254, epoch: 127:  51%|█████     | 255/500 [00:59<00:58,  4.19it/s]\n# PTI :step: 254, epoch: 127:  51%|█████     | 256/500 [00:59<00:58,  4.16it/s]\n# PTI :step: 255, epoch: 127:  51%|█████     | 256/500 [00:59<00:58,  4.16it/s]\n# PTI :step: 255, epoch: 127:  51%|█████▏    | 257/500 [00:59<00:56,  4.30it/s]\n# PTI :step: 256, epoch: 128:  51%|█████▏    | 257/500 [00:59<00:56,  4.30it/s]\n# PTI :step: 256, epoch: 128:  52%|█████▏    | 258/500 [01:00<00:57,  4.20it/s]\n# PTI :step: 257, epoch: 128:  52%|█████▏    | 258/500 [01:00<00:57,  4.20it/s]\n# PTI :step: 257, epoch: 128:  52%|█████▏    | 259/500 [01:00<00:55,  4.36it/s]\n# PTI :step: 258, epoch: 129:  52%|█████▏    | 259/500 [01:00<00:55,  4.36it/s]\n# PTI :step: 258, epoch: 129:  52%|█████▏    | 260/500 [01:00<00:56,  4.28it/s]\n# PTI :step: 259, epoch: 129:  52%|█████▏    | 260/500 [01:00<00:56,  4.28it/s]\n# PTI :step: 259, epoch: 129:  52%|█████▏    | 261/500 [01:00<00:53,  4.44it/s]\n# PTI :step: 260, epoch: 130:  52%|█████▏    | 261/500 [01:00<00:53,  4.44it/s]\n# PTI :step: 260, epoch: 130:  52%|█████▏    | 262/500 [01:01<00:54,  4.34it/s]\n# PTI :step: 261, epoch: 130:  52%|█████▏    | 262/500 [01:01<00:54,  4.34it/s]\n# PTI :step: 261, epoch: 130:  53%|█████▎    | 263/500 [01:01<00:53,  4.43it/s]\n# PTI :step: 262, epoch: 131:  53%|█████▎    | 263/500 [01:01<00:53,  4.43it/s]\n# PTI :step: 262, epoch: 131:  53%|█████▎    | 264/500 [01:01<00:54,  4.32it/s]\n# PTI :step: 263, epoch: 131:  53%|█████▎    | 264/500 [01:01<00:54,  4.32it/s]\n# PTI :step: 263, epoch: 131:  53%|█████▎    | 265/500 [01:01<00:52,  4.47it/s]\n# PTI :step: 264, epoch: 132:  53%|█████▎    | 265/500 [01:01<00:52,  4.47it/s]\n# PTI :step: 264, epoch: 132:  53%|█████▎    | 266/500 [01:01<00:53,  4.35it/s]\n# PTI :step: 265, epoch: 132:  53%|█████▎    | 266/500 [01:01<00:53,  4.35it/s]\n# PTI :step: 265, epoch: 132:  53%|█████▎    | 267/500 [01:02<00:52,  4.45it/s]\n# PTI :step: 266, epoch: 133:  53%|█████▎    | 267/500 [01:02<00:52,  4.45it/s]\n# PTI :step: 266, epoch: 133:  54%|█████▎    | 268/500 [01:02<00:53,  4.34it/s]\n# PTI :step: 267, epoch: 133:  54%|█████▎    | 268/500 [01:02<00:53,  4.34it/s]\n# PTI :step: 267, epoch: 133:  54%|█████▍    | 269/500 [01:02<00:51,  4.44it/s]\n# PTI :step: 268, epoch: 134:  54%|█████▍    | 269/500 [01:02<00:51,  4.44it/s]\n# PTI :step: 268, epoch: 134:  54%|█████▍    | 270/500 [01:02<00:53,  4.33it/s]\n# PTI :step: 269, epoch: 134:  54%|█████▍    | 270/500 [01:02<00:53,  4.33it/s]\n# PTI :step: 269, epoch: 134:  54%|█████▍    | 271/500 [01:03<00:51,  4.44it/s]\n# PTI :step: 270, epoch: 135:  54%|█████▍    | 271/500 [01:03<00:51,  4.44it/s]\n# PTI :step: 270, epoch: 135:  54%|█████▍    | 272/500 [01:03<00:53,  4.27it/s]\n# PTI :step: 271, epoch: 135:  54%|█████▍    | 272/500 [01:03<00:53,  4.27it/s]\n# PTI :step: 271, epoch: 135:  55%|█████▍    | 273/500 [01:03<00:51,  4.44it/s]\n# PTI :step: 272, epoch: 136:  55%|█████▍    | 273/500 [01:03<00:51,  4.44it/s]\n# PTI :step: 272, epoch: 136:  55%|█████▍    | 274/500 [01:03<00:52,  4.34it/s]\n# PTI :step: 273, epoch: 136:  55%|█████▍    | 274/500 [01:03<00:52,  4.34it/s]\n# PTI :step: 273, epoch: 136:  55%|█████▌    | 275/500 [01:03<00:50,  4.47it/s]\n# PTI :step: 274, epoch: 137:  55%|█████▌    | 275/500 [01:03<00:50,  4.47it/s]\n# PTI :step: 274, epoch: 137:  55%|█████▌    | 276/500 [01:04<00:51,  4.35it/s]\n# PTI :step: 275, epoch: 137:  55%|█████▌    | 276/500 [01:04<00:51,  4.35it/s]\n# PTI :step: 275, epoch: 137:  55%|█████▌    | 277/500 [01:04<00:49,  4.47it/s]\n# PTI :step: 276, epoch: 138:  55%|█████▌    | 277/500 [01:04<00:49,  4.47it/s]\n# PTI :step: 276, epoch: 138:  56%|█████▌    | 278/500 [01:04<00:50,  4.36it/s]\n# PTI :step: 277, epoch: 138:  56%|█████▌    | 278/500 [01:04<00:50,  4.36it/s]\n# PTI :step: 277, epoch: 138:  56%|█████▌    | 279/500 [01:04<00:49,  4.45it/s]\n# PTI :step: 278, epoch: 139:  56%|█████▌    | 279/500 [01:04<00:49,  4.45it/s]\n# PTI :step: 278, epoch: 139:  56%|█████▌    | 280/500 [01:05<00:50,  4.32it/s]\n# PTI :step: 279, epoch: 139:  56%|█████▌    | 280/500 [01:05<00:50,  4.32it/s]\n# PTI :step: 279, epoch: 139:  56%|█████▌    | 281/500 [01:05<00:49,  4.44it/s]\n# PTI :step: 280, epoch: 140:  56%|█████▌    | 281/500 [01:05<00:49,  4.44it/s]\n# PTI :step: 280, epoch: 140:  56%|█████▋    | 282/500 [01:05<00:50,  4.34it/s]\n# PTI :step: 281, epoch: 140:  56%|█████▋    | 282/500 [01:05<00:50,  4.34it/s]\n# PTI :step: 281, epoch: 140:  57%|█████▋    | 283/500 [01:05<00:48,  4.44it/s]\n# PTI :step: 282, epoch: 141:  57%|█████▋    | 283/500 [01:05<00:48,  4.44it/s]\n# PTI :step: 282, epoch: 141:  57%|█████▋    | 284/500 [01:06<00:50,  4.30it/s]\n# PTI :step: 283, epoch: 141:  57%|█████▋    | 284/500 [01:06<00:50,  4.30it/s]\n# PTI :step: 283, epoch: 141:  57%|█████▋    | 285/500 [01:06<00:48,  4.43it/s]\n# PTI :step: 284, epoch: 142:  57%|█████▋    | 285/500 [01:06<00:48,  4.43it/s]\n# PTI :step: 284, epoch: 142:  57%|█████▋    | 286/500 [01:06<00:49,  4.32it/s]\n# PTI :step: 285, epoch: 142:  57%|█████▋    | 286/500 [01:06<00:49,  4.32it/s]\n# PTI :step: 285, epoch: 142:  57%|█████▋    | 287/500 [01:06<00:47,  4.45it/s]\n# PTI :step: 286, epoch: 143:  57%|█████▋    | 287/500 [01:06<00:47,  4.45it/s]\n# PTI :step: 286, epoch: 143:  58%|█████▊    | 288/500 [01:06<00:48,  4.34it/s]\n# PTI :step: 287, epoch: 143:  58%|█████▊    | 288/500 [01:06<00:48,  4.34it/s]\n# PTI :step: 287, epoch: 143:  58%|█████▊    | 289/500 [01:07<00:47,  4.42it/s]\n# PTI :step: 288, epoch: 144:  58%|█████▊    | 289/500 [01:07<00:47,  4.42it/s]\n# PTI :step: 288, epoch: 144:  58%|█████▊    | 290/500 [01:07<00:48,  4.31it/s]\n# PTI :step: 289, epoch: 144:  58%|█████▊    | 290/500 [01:07<00:48,  4.31it/s]\n# PTI :step: 289, epoch: 144:  58%|█████▊    | 291/500 [01:07<00:46,  4.46it/s]\n# PTI :step: 290, epoch: 145:  58%|█████▊    | 291/500 [01:07<00:46,  4.46it/s]\n# PTI :step: 290, epoch: 145:  58%|█████▊    | 292/500 [01:07<00:47,  4.35it/s]\n# PTI :step: 291, epoch: 145:  58%|█████▊    | 292/500 [01:07<00:47,  4.35it/s]\n# PTI :step: 291, epoch: 145:  59%|█████▊    | 293/500 [01:08<00:46,  4.48it/s]\n# PTI :step: 292, epoch: 146:  59%|█████▊    | 293/500 [01:08<00:46,  4.48it/s]\n# PTI :step: 292, epoch: 146:  59%|█████▉    | 294/500 [01:08<00:47,  4.36it/s]\n# PTI :step: 293, epoch: 146:  59%|█████▉    | 294/500 [01:08<00:47,  4.36it/s]\n# PTI :step: 293, epoch: 146:  59%|█████▉    | 295/500 [01:08<00:46,  4.40it/s]\n# PTI :step: 294, epoch: 147:  59%|█████▉    | 295/500 [01:08<00:46,  4.40it/s]\n# PTI :step: 294, epoch: 147:  59%|█████▉    | 296/500 [01:08<00:47,  4.29it/s]\n# PTI :step: 295, epoch: 147:  59%|█████▉    | 296/500 [01:08<00:47,  4.29it/s]\n# PTI :step: 295, epoch: 147:  59%|█████▉    | 297/500 [01:08<00:45,  4.43it/s]\n# PTI :step: 296, epoch: 148:  59%|█████▉    | 297/500 [01:08<00:45,  4.43it/s]\n# PTI :step: 296, epoch: 148:  60%|█████▉    | 298/500 [01:09<00:46,  4.32it/s]\n# PTI :step: 297, epoch: 148:  60%|█████▉    | 298/500 [01:09<00:46,  4.32it/s]\n# PTI :step: 297, epoch: 148:  60%|█████▉    | 299/500 [01:09<00:45,  4.43it/s]\n# PTI :step: 298, epoch: 149:  60%|█████▉    | 299/500 [01:09<00:45,  4.43it/s]\n# PTI :step: 298, epoch: 149:  60%|██████    | 300/500 [01:09<00:46,  4.33it/s]\n# PTI :step: 299, epoch: 149:  60%|██████    | 300/500 [01:09<00:46,  4.33it/s]\n# PTI :step: 299, epoch: 149:  60%|██████    | 301/500 [01:09<00:45,  4.39it/s]\n# PTI :step: 300, epoch: 150:  60%|██████    | 301/500 [01:09<00:45,  4.39it/s]\n# PTI :step: 300, epoch: 150:  60%|██████    | 302/500 [01:10<00:46,  4.30it/s]\n# PTI :step: 301, epoch: 150:  60%|██████    | 302/500 [01:10<00:46,  4.30it/s]\n# PTI :step: 301, epoch: 150:  61%|██████    | 303/500 [01:10<00:44,  4.42it/s]\n# PTI :step: 302, epoch: 151:  61%|██████    | 303/500 [01:10<00:44,  4.42it/s]\n# PTI :step: 302, epoch: 151:  61%|██████    | 304/500 [01:10<00:45,  4.30it/s]\n# PTI :step: 303, epoch: 151:  61%|██████    | 304/500 [01:10<00:45,  4.30it/s]\n# PTI :step: 303, epoch: 151:  61%|██████    | 305/500 [01:10<00:44,  4.43it/s]\n# PTI :step: 304, epoch: 152:  61%|██████    | 305/500 [01:10<00:44,  4.43it/s]\n# PTI :step: 304, epoch: 152:  61%|██████    | 306/500 [01:11<00:44,  4.32it/s]\n# PTI :step: 305, epoch: 152:  61%|██████    | 306/500 [01:11<00:44,  4.32it/s]\n# PTI :step: 305, epoch: 152:  61%|██████▏   | 307/500 [01:11<00:43,  4.43it/s]\n# PTI :step: 306, epoch: 153:  61%|██████▏   | 307/500 [01:11<00:43,  4.43it/s]\n# PTI :step: 306, epoch: 153:  62%|██████▏   | 308/500 [01:11<00:44,  4.33it/s]\n# PTI :step: 307, epoch: 153:  62%|██████▏   | 308/500 [01:11<00:44,  4.33it/s]\n# PTI :step: 307, epoch: 153:  62%|██████▏   | 309/500 [01:11<00:42,  4.48it/s]\n# PTI :step: 308, epoch: 154:  62%|██████▏   | 309/500 [01:11<00:42,  4.48it/s]\n# PTI :step: 308, epoch: 154:  62%|██████▏   | 310/500 [01:11<00:43,  4.36it/s]\n# PTI :step: 309, epoch: 154:  62%|██████▏   | 310/500 [01:11<00:43,  4.36it/s]\n# PTI :step: 309, epoch: 154:  62%|██████▏   | 311/500 [01:12<00:42,  4.49it/s]\n# PTI :step: 310, epoch: 155:  62%|██████▏   | 311/500 [01:12<00:42,  4.49it/s]\n# PTI :step: 310, epoch: 155:  62%|██████▏   | 312/500 [01:12<00:43,  4.37it/s]\n# PTI :step: 311, epoch: 155:  62%|██████▏   | 312/500 [01:12<00:43,  4.37it/s]\n# PTI :step: 311, epoch: 155:  63%|██████▎   | 313/500 [01:12<00:41,  4.49it/s]\n# PTI :step: 312, epoch: 156:  63%|██████▎   | 313/500 [01:12<00:41,  4.49it/s]\n# PTI :step: 312, epoch: 156:  63%|██████▎   | 314/500 [01:12<00:42,  4.37it/s]\n# PTI :step: 313, epoch: 156:  63%|██████▎   | 314/500 [01:12<00:42,  4.37it/s]\n# PTI :step: 313, epoch: 156:  63%|██████▎   | 315/500 [01:13<00:41,  4.42it/s]\n# PTI :step: 314, epoch: 157:  63%|██████▎   | 315/500 [01:13<00:41,  4.42it/s]\n# PTI :step: 314, epoch: 157:  63%|██████▎   | 316/500 [01:13<00:42,  4.32it/s]\n# PTI :step: 315, epoch: 157:  63%|██████▎   | 316/500 [01:13<00:42,  4.32it/s]\n# PTI :step: 315, epoch: 157:  63%|██████▎   | 317/500 [01:13<00:41,  4.46it/s]\n# PTI :step: 316, epoch: 158:  63%|██████▎   | 317/500 [01:13<00:41,  4.46it/s]\n# PTI :step: 316, epoch: 158:  64%|██████▎   | 318/500 [01:13<00:41,  4.34it/s]\n# PTI :step: 317, epoch: 158:  64%|██████▎   | 318/500 [01:13<00:41,  4.34it/s]\n# PTI :step: 317, epoch: 158:  64%|██████▍   | 319/500 [01:13<00:40,  4.48it/s]\n# PTI :step: 318, epoch: 159:  64%|██████▍   | 319/500 [01:13<00:40,  4.48it/s]\n# PTI :step: 318, epoch: 159:  64%|██████▍   | 320/500 [01:14<00:41,  4.35it/s]\n# PTI :step: 319, epoch: 159:  64%|██████▍   | 320/500 [01:14<00:41,  4.35it/s]\n# PTI :step: 319, epoch: 159:  64%|██████▍   | 321/500 [01:14<00:40,  4.45it/s]\n# PTI :step: 320, epoch: 160:  64%|██████▍   | 321/500 [01:14<00:40,  4.45it/s]\n# PTI :step: 320, epoch: 160:  64%|██████▍   | 322/500 [01:14<00:40,  4.34it/s]\n# PTI :step: 321, epoch: 160:  64%|██████▍   | 322/500 [01:14<00:40,  4.34it/s]\n# PTI :step: 321, epoch: 160:  65%|██████▍   | 323/500 [01:14<00:39,  4.48it/s]\n# PTI :step: 322, epoch: 161:  65%|██████▍   | 323/500 [01:14<00:39,  4.48it/s]\n# PTI :step: 322, epoch: 161:  65%|██████▍   | 324/500 [01:15<00:40,  4.36it/s]\n# PTI :step: 323, epoch: 161:  65%|██████▍   | 324/500 [01:15<00:40,  4.36it/s]\n# PTI :step: 323, epoch: 161:  65%|██████▌   | 325/500 [01:15<00:39,  4.48it/s]\n# PTI :step: 324, epoch: 162:  65%|██████▌   | 325/500 [01:15<00:39,  4.48it/s]\n# PTI :step: 324, epoch: 162:  65%|██████▌   | 326/500 [01:15<00:39,  4.35it/s]\n# PTI :step: 325, epoch: 162:  65%|██████▌   | 326/500 [01:15<00:39,  4.35it/s]\n# PTI :step: 325, epoch: 162:  65%|██████▌   | 327/500 [01:15<00:38,  4.46it/s]\n# PTI :step: 326, epoch: 163:  65%|██████▌   | 327/500 [01:15<00:38,  4.46it/s]\n# PTI :step: 326, epoch: 163:  66%|██████▌   | 328/500 [01:16<00:39,  4.35it/s]\n# PTI :step: 327, epoch: 163:  66%|██████▌   | 328/500 [01:16<00:39,  4.35it/s]\n# PTI :step: 327, epoch: 163:  66%|██████▌   | 329/500 [01:16<00:38,  4.44it/s]\n# PTI :step: 328, epoch: 164:  66%|██████▌   | 329/500 [01:16<00:38,  4.44it/s]\n# PTI :step: 328, epoch: 164:  66%|██████▌   | 330/500 [01:16<00:39,  4.33it/s]\n# PTI :step: 329, epoch: 164:  66%|██████▌   | 330/500 [01:16<00:39,  4.33it/s]\n# PTI :step: 329, epoch: 164:  66%|██████▌   | 331/500 [01:16<00:37,  4.47it/s]\n# PTI :step: 330, epoch: 165:  66%|██████▌   | 331/500 [01:16<00:37,  4.47it/s]\n# PTI :step: 330, epoch: 165:  66%|██████▋   | 332/500 [01:16<00:38,  4.35it/s]\n# PTI :step: 331, epoch: 165:  66%|██████▋   | 332/500 [01:16<00:38,  4.35it/s]\n# PTI :step: 331, epoch: 165:  67%|██████▋   | 333/500 [01:17<00:37,  4.44it/s]\n# PTI :step: 332, epoch: 166:  67%|██████▋   | 333/500 [01:17<00:37,  4.44it/s]\n# PTI :step: 332, epoch: 166:  67%|██████▋   | 334/500 [01:17<00:38,  4.33it/s]\n# PTI :step: 333, epoch: 166:  67%|██████▋   | 334/500 [01:17<00:38,  4.33it/s]\n# PTI :step: 333, epoch: 166:  67%|██████▋   | 335/500 [01:17<00:36,  4.47it/s]\n# PTI :step: 334, epoch: 167:  67%|██████▋   | 335/500 [01:17<00:36,  4.47it/s]\n# PTI :step: 334, epoch: 167:  67%|██████▋   | 336/500 [01:17<00:37,  4.36it/s]\n# PTI :step: 335, epoch: 167:  67%|██████▋   | 336/500 [01:17<00:37,  4.36it/s]\n# PTI :step: 335, epoch: 167:  67%|██████▋   | 337/500 [01:18<00:36,  4.47it/s]\n# PTI :step: 336, epoch: 168:  67%|██████▋   | 337/500 [01:18<00:36,  4.47it/s]\n# PTI :step: 336, epoch: 168:  68%|██████▊   | 338/500 [01:18<00:37,  4.34it/s]\n# PTI :step: 337, epoch: 168:  68%|██████▊   | 338/500 [01:18<00:37,  4.34it/s]\n# PTI :step: 337, epoch: 168:  68%|██████▊   | 339/500 [01:18<00:36,  4.42it/s]\n# PTI :step: 338, epoch: 169:  68%|██████▊   | 339/500 [01:18<00:36,  4.42it/s]\n# PTI :step: 338, epoch: 169:  68%|██████▊   | 340/500 [01:18<00:37,  4.28it/s]\n# PTI :step: 339, epoch: 169:  68%|██████▊   | 340/500 [01:18<00:37,  4.28it/s]\n# PTI :step: 339, epoch: 169:  68%|██████▊   | 341/500 [01:19<00:35,  4.43it/s]\n# PTI :step: 340, epoch: 170:  68%|██████▊   | 341/500 [01:19<00:35,  4.43it/s]\n# PTI :step: 340, epoch: 170:  68%|██████▊   | 342/500 [01:19<00:36,  4.32it/s]\n# PTI :step: 341, epoch: 170:  68%|██████▊   | 342/500 [01:19<00:36,  4.32it/s]\n# PTI :step: 341, epoch: 170:  69%|██████▊   | 343/500 [01:19<00:35,  4.46it/s]\n# PTI :step: 342, epoch: 171:  69%|██████▊   | 343/500 [01:19<00:35,  4.46it/s]\n# PTI :step: 342, epoch: 171:  69%|██████▉   | 344/500 [01:19<00:35,  4.35it/s]\n# PTI :step: 343, epoch: 171:  69%|██████▉   | 344/500 [01:19<00:35,  4.35it/s]\n# PTI :step: 343, epoch: 171:  69%|██████▉   | 345/500 [01:19<00:34,  4.49it/s]\n# PTI :step: 344, epoch: 172:  69%|██████▉   | 345/500 [01:19<00:34,  4.49it/s]\n# PTI :step: 344, epoch: 172:  69%|██████▉   | 346/500 [01:20<00:35,  4.37it/s]\n# PTI :step: 345, epoch: 172:  69%|██████▉   | 346/500 [01:20<00:35,  4.37it/s]\n# PTI :step: 345, epoch: 172:  69%|██████▉   | 347/500 [01:20<00:34,  4.50it/s]\n# PTI :step: 346, epoch: 173:  69%|██████▉   | 347/500 [01:20<00:34,  4.50it/s]\n# PTI :step: 346, epoch: 173:  70%|██████▉   | 348/500 [01:20<00:34,  4.35it/s]\n# PTI :step: 347, epoch: 173:  70%|██████▉   | 348/500 [01:20<00:34,  4.35it/s]\n# PTI :step: 347, epoch: 173:  70%|██████▉   | 349/500 [01:20<00:33,  4.48it/s]\n# PTI :step: 348, epoch: 174:  70%|██████▉   | 349/500 [01:20<00:33,  4.48it/s]\n# PTI :step: 348, epoch: 174:  70%|███████   | 350/500 [01:21<00:34,  4.35it/s]\n# PTI :step: 349, epoch: 174:  70%|███████   | 350/500 [01:21<00:34,  4.35it/s]\n# PTI :step: 349, epoch: 174:  70%|███████   | 351/500 [01:21<00:33,  4.45it/s]\n# PTI :step: 350, epoch: 175:  70%|███████   | 351/500 [01:21<00:33,  4.45it/s]\n# PTI :step: 350, epoch: 175:  70%|███████   | 352/500 [01:21<00:34,  4.34it/s]\n# PTI :step: 351, epoch: 175:  70%|███████   | 352/500 [01:21<00:34,  4.34it/s]\n# PTI :step: 351, epoch: 175:  71%|███████   | 353/500 [01:21<00:33,  4.42it/s]\n# PTI :step: 352, epoch: 176:  71%|███████   | 353/500 [01:21<00:33,  4.42it/s]\n# PTI :step: 352, epoch: 176:  71%|███████   | 354/500 [01:21<00:33,  4.32it/s]\n# PTI :step: 353, epoch: 176:  71%|███████   | 354/500 [01:21<00:33,  4.32it/s]\n# PTI :step: 353, epoch: 176:  71%|███████   | 355/500 [01:22<00:32,  4.43it/s]\n# PTI :step: 354, epoch: 177:  71%|███████   | 355/500 [01:22<00:32,  4.43it/s]\n# PTI :step: 354, epoch: 177:  71%|███████   | 356/500 [01:22<00:33,  4.33it/s]\n# PTI :step: 355, epoch: 177:  71%|███████   | 356/500 [01:22<00:33,  4.33it/s]\n# PTI :step: 355, epoch: 177:  71%|███████▏  | 357/500 [01:22<00:32,  4.46it/s]\n# PTI :step: 356, epoch: 178:  71%|███████▏  | 357/500 [01:22<00:32,  4.46it/s]\n# PTI :step: 356, epoch: 178:  72%|███████▏  | 358/500 [01:22<00:32,  4.34it/s]\n# PTI :step: 357, epoch: 178:  72%|███████▏  | 358/500 [01:22<00:32,  4.34it/s]\n# PTI :step: 357, epoch: 178:  72%|███████▏  | 359/500 [01:23<00:31,  4.46it/s]\n# PTI :step: 358, epoch: 179:  72%|███████▏  | 359/500 [01:23<00:31,  4.46it/s]\n# PTI :step: 358, epoch: 179:  72%|███████▏  | 360/500 [01:23<00:32,  4.35it/s]\n# PTI :step: 359, epoch: 179:  72%|███████▏  | 360/500 [01:23<00:32,  4.35it/s]\n# PTI :step: 359, epoch: 179:  72%|███████▏  | 361/500 [01:23<00:31,  4.47it/s]\n# PTI :step: 360, epoch: 180:  72%|███████▏  | 361/500 [01:23<00:31,  4.47it/s]\n# PTI :step: 360, epoch: 180:  72%|███████▏  | 362/500 [01:23<00:31,  4.35it/s]\n# PTI :step: 361, epoch: 180:  72%|███████▏  | 362/500 [01:23<00:31,  4.35it/s]\n# PTI :step: 361, epoch: 180:  73%|███████▎  | 363/500 [01:23<00:30,  4.49it/s]\n# PTI :step: 362, epoch: 181:  73%|███████▎  | 363/500 [01:23<00:30,  4.49it/s]\n# PTI :step: 362, epoch: 181:  73%|███████▎  | 364/500 [01:24<00:31,  4.36it/s]\n# PTI :step: 363, epoch: 181:  73%|███████▎  | 364/500 [01:24<00:31,  4.36it/s]\n# PTI :step: 363, epoch: 181:  73%|███████▎  | 365/500 [01:24<00:30,  4.48it/s]\n# PTI :step: 364, epoch: 182:  73%|███████▎  | 365/500 [01:24<00:30,  4.48it/s]\n# PTI :step: 364, epoch: 182:  73%|███████▎  | 366/500 [01:24<00:30,  4.36it/s]\n# PTI :step: 365, epoch: 182:  73%|███████▎  | 366/500 [01:24<00:30,  4.36it/s]\n# PTI :step: 365, epoch: 182:  73%|███████▎  | 367/500 [01:24<00:29,  4.46it/s]\n# PTI :step: 366, epoch: 183:  73%|███████▎  | 367/500 [01:24<00:29,  4.46it/s]\n# PTI :step: 366, epoch: 183:  74%|███████▎  | 368/500 [01:25<00:30,  4.35it/s]\n# PTI :step: 367, epoch: 183:  74%|███████▎  | 368/500 [01:25<00:30,  4.35it/s]\n# PTI :step: 367, epoch: 183:  74%|███████▍  | 369/500 [01:25<00:29,  4.45it/s]\n# PTI :step: 368, epoch: 184:  74%|███████▍  | 369/500 [01:25<00:29,  4.45it/s]\n# PTI :step: 368, epoch: 184:  74%|███████▍  | 370/500 [01:25<00:29,  4.34it/s]\n# PTI :step: 369, epoch: 184:  74%|███████▍  | 370/500 [01:25<00:29,  4.34it/s]\n# PTI :step: 369, epoch: 184:  74%|███████▍  | 371/500 [01:25<00:29,  4.43it/s]\n# PTI :step: 370, epoch: 185:  74%|███████▍  | 371/500 [01:25<00:29,  4.43it/s]\n# PTI :step: 370, epoch: 185:  74%|███████▍  | 372/500 [01:26<00:29,  4.27it/s]\n# PTI :step: 371, epoch: 185:  74%|███████▍  | 372/500 [01:26<00:29,  4.27it/s]\n# PTI :step: 371, epoch: 185:  75%|███████▍  | 373/500 [01:26<00:28,  4.40it/s]\n# PTI :step: 372, epoch: 186:  75%|███████▍  | 373/500 [01:26<00:28,  4.40it/s]\n# PTI :step: 372, epoch: 186:  75%|███████▍  | 374/500 [01:26<00:29,  4.31it/s]\n# PTI :step: 373, epoch: 186:  75%|███████▍  | 374/500 [01:26<00:29,  4.31it/s]\n# PTI :step: 373, epoch: 186:  75%|███████▌  | 375/500 [01:26<00:28,  4.44it/s]\n# PTI :step: 374, epoch: 187:  75%|███████▌  | 375/500 [01:26<00:28,  4.44it/s]\n# PTI :step: 374, epoch: 187:  75%|███████▌  | 376/500 [01:26<00:28,  4.34it/s]\n# PTI :step: 375, epoch: 187:  75%|███████▌  | 376/500 [01:26<00:28,  4.34it/s]\n# PTI :step: 375, epoch: 187:  75%|███████▌  | 377/500 [01:27<00:28,  4.38it/s]\n# PTI :step: 376, epoch: 188:  75%|███████▌  | 377/500 [01:27<00:28,  4.38it/s]\n# PTI :step: 376, epoch: 188:  76%|███████▌  | 378/500 [01:27<00:28,  4.28it/s]\n# PTI :step: 377, epoch: 188:  76%|███████▌  | 378/500 [01:27<00:28,  4.28it/s]\n# PTI :step: 377, epoch: 188:  76%|███████▌  | 379/500 [01:27<00:27,  4.40it/s]\n# PTI :step: 378, epoch: 189:  76%|███████▌  | 379/500 [01:27<00:27,  4.40it/s]\n# PTI :step: 378, epoch: 189:  76%|███████▌  | 380/500 [01:27<00:27,  4.31it/s]\n# PTI :step: 379, epoch: 189:  76%|███████▌  | 380/500 [01:27<00:27,  4.31it/s]\n# PTI :step: 379, epoch: 189:  76%|███████▌  | 381/500 [01:28<00:26,  4.46it/s]\n# PTI :step: 380, epoch: 190:  76%|███████▌  | 381/500 [01:28<00:26,  4.46it/s]\n# PTI :step: 380, epoch: 190:  76%|███████▋  | 382/500 [01:28<00:27,  4.35it/s]\n# PTI :step: 381, epoch: 190:  76%|███████▋  | 382/500 [01:28<00:27,  4.35it/s]\n# PTI :step: 381, epoch: 190:  77%|███████▋  | 383/500 [01:28<00:26,  4.41it/s]\n# PTI :step: 382, epoch: 191:  77%|███████▋  | 383/500 [01:28<00:26,  4.41it/s]\n# PTI :step: 382, epoch: 191:  77%|███████▋  | 384/500 [01:28<00:26,  4.31it/s]\n# PTI :step: 383, epoch: 191:  77%|███████▋  | 384/500 [01:28<00:26,  4.31it/s]\n# PTI :step: 383, epoch: 191:  77%|███████▋  | 385/500 [01:29<00:25,  4.44it/s]\n# PTI :step: 384, epoch: 192:  77%|███████▋  | 385/500 [01:29<00:25,  4.44it/s]\n# PTI :step: 384, epoch: 192:  77%|███████▋  | 386/500 [01:29<00:26,  4.28it/s]\n# PTI :step: 385, epoch: 192:  77%|███████▋  | 386/500 [01:29<00:26,  4.28it/s]\n# PTI :step: 385, epoch: 192:  77%|███████▋  | 387/500 [01:29<00:25,  4.44it/s]\n# PTI :step: 386, epoch: 193:  77%|███████▋  | 387/500 [01:29<00:25,  4.44it/s]\n# PTI :step: 386, epoch: 193:  78%|███████▊  | 388/500 [01:29<00:25,  4.33it/s]\n# PTI :step: 387, epoch: 193:  78%|███████▊  | 388/500 [01:29<00:25,  4.33it/s]\n# PTI :step: 387, epoch: 193:  78%|███████▊  | 389/500 [01:29<00:24,  4.46it/s]\n# PTI :step: 388, epoch: 194:  78%|███████▊  | 389/500 [01:29<00:24,  4.46it/s]\n# PTI :step: 388, epoch: 194:  78%|███████▊  | 390/500 [01:30<00:25,  4.35it/s]\n# PTI :step: 389, epoch: 194:  78%|███████▊  | 390/500 [01:30<00:25,  4.35it/s]\n# PTI :step: 389, epoch: 194:  78%|███████▊  | 391/500 [01:30<00:24,  4.47it/s]\n# PTI :step: 390, epoch: 195:  78%|███████▊  | 391/500 [01:30<00:24,  4.47it/s]\n# PTI :step: 390, epoch: 195:  78%|███████▊  | 392/500 [01:30<00:24,  4.36it/s]\n# PTI :step: 391, epoch: 195:  78%|███████▊  | 392/500 [01:30<00:24,  4.36it/s]\n# PTI :step: 391, epoch: 195:  79%|███████▊  | 393/500 [01:30<00:23,  4.47it/s]\n# PTI :step: 392, epoch: 196:  79%|███████▊  | 393/500 [01:30<00:23,  4.47it/s]\n# PTI :step: 392, epoch: 196:  79%|███████▉  | 394/500 [01:31<00:24,  4.35it/s]\n# PTI :step: 393, epoch: 196:  79%|███████▉  | 394/500 [01:31<00:24,  4.35it/s]\n# PTI :step: 393, epoch: 196:  79%|███████▉  | 395/500 [01:31<00:23,  4.42it/s]\n# PTI :step: 394, epoch: 197:  79%|███████▉  | 395/500 [01:31<00:23,  4.42it/s]\n# PTI :step: 394, epoch: 197:  79%|███████▉  | 396/500 [01:31<00:24,  4.32it/s]\n# PTI :step: 395, epoch: 197:  79%|███████▉  | 396/500 [01:31<00:24,  4.32it/s]\n# PTI :step: 395, epoch: 197:  79%|███████▉  | 397/500 [01:31<00:23,  4.46it/s]\n# PTI :step: 396, epoch: 198:  79%|███████▉  | 397/500 [01:31<00:23,  4.46it/s]\n# PTI :step: 396, epoch: 198:  80%|███████▉  | 398/500 [01:31<00:23,  4.35it/s]\n# PTI :step: 397, epoch: 198:  80%|███████▉  | 398/500 [01:31<00:23,  4.35it/s]\n# PTI :step: 397, epoch: 198:  80%|███████▉  | 399/500 [01:32<00:22,  4.46it/s]\n# PTI :step: 398, epoch: 199:  80%|███████▉  | 399/500 [01:32<00:22,  4.46it/s]\n# PTI :step: 398, epoch: 199:  80%|████████  | 400/500 [01:32<00:23,  4.35it/s]\n# PTI :step: 399, epoch: 199:  80%|████████  | 400/500 [01:32<00:23,  4.35it/s]\n# PTI :step: 399, epoch: 199:  80%|████████  | 401/500 [01:32<00:22,  4.49it/s]\n# PTI :step: 400, epoch: 200:  80%|████████  | 401/500 [01:32<00:22,  4.49it/s]\n# PTI :step: 400, epoch: 200:  80%|████████  | 402/500 [01:32<00:22,  4.37it/s]\n# PTI :step: 401, epoch: 200:  80%|████████  | 402/500 [01:32<00:22,  4.37it/s]\n# PTI :step: 401, epoch: 200:  81%|████████  | 403/500 [01:33<00:21,  4.49it/s]\n# PTI :step: 402, epoch: 201:  81%|████████  | 403/500 [01:33<00:21,  4.49it/s]\n# PTI :step: 402, epoch: 201:  81%|████████  | 404/500 [01:33<00:22,  4.35it/s]\n# PTI :step: 403, epoch: 201:  81%|████████  | 404/500 [01:33<00:22,  4.35it/s]\n# PTI :step: 403, epoch: 201:  81%|████████  | 405/500 [01:33<00:21,  4.46it/s]\n# PTI :step: 404, epoch: 202:  81%|████████  | 405/500 [01:33<00:21,  4.46it/s]\n# PTI :step: 404, epoch: 202:  81%|████████  | 406/500 [01:33<00:21,  4.35it/s]\n# PTI :step: 405, epoch: 202:  81%|████████  | 406/500 [01:33<00:21,  4.35it/s]\n# PTI :step: 405, epoch: 202:  81%|████████▏ | 407/500 [01:34<00:20,  4.48it/s]\n# PTI :step: 406, epoch: 203:  81%|████████▏ | 407/500 [01:34<00:20,  4.48it/s]\n# PTI :step: 406, epoch: 203:  82%|████████▏ | 408/500 [01:34<00:21,  4.35it/s]\n# PTI :step: 407, epoch: 203:  82%|████████▏ | 408/500 [01:34<00:21,  4.35it/s]\n# PTI :step: 407, epoch: 203:  82%|████████▏ | 409/500 [01:34<00:20,  4.49it/s]\n# PTI :step: 408, epoch: 204:  82%|████████▏ | 409/500 [01:34<00:20,  4.49it/s]\n# PTI :step: 408, epoch: 204:  82%|████████▏ | 410/500 [01:34<00:20,  4.37it/s]\n# PTI :step: 409, epoch: 204:  82%|████████▏ | 410/500 [01:34<00:20,  4.37it/s]\n# PTI :step: 409, epoch: 204:  82%|████████▏ | 411/500 [01:34<00:19,  4.51it/s]\n# PTI :step: 410, epoch: 205:  82%|████████▏ | 411/500 [01:34<00:19,  4.51it/s]\n# PTI :step: 410, epoch: 205:  82%|████████▏ | 412/500 [01:35<00:20,  4.38it/s]\n# PTI :step: 411, epoch: 205:  82%|████████▏ | 412/500 [01:35<00:20,  4.38it/s]\n# PTI :step: 411, epoch: 205:  83%|████████▎ | 413/500 [01:35<00:19,  4.44it/s]\n# PTI :step: 412, epoch: 206:  83%|████████▎ | 413/500 [01:35<00:19,  4.44it/s]\n# PTI :step: 412, epoch: 206:  83%|████████▎ | 414/500 [01:35<00:19,  4.34it/s]\n# PTI :step: 413, epoch: 206:  83%|████████▎ | 414/500 [01:35<00:19,  4.34it/s]\n# PTI :step: 413, epoch: 206:  83%|████████▎ | 415/500 [01:35<00:18,  4.49it/s]\n# PTI :step: 414, epoch: 207:  83%|████████▎ | 415/500 [01:35<00:18,  4.49it/s]\n# PTI :step: 414, epoch: 207:  83%|████████▎ | 416/500 [01:36<00:19,  4.37it/s]\n# PTI :step: 415, epoch: 207:  83%|████████▎ | 416/500 [01:36<00:19,  4.37it/s]\n# PTI :step: 415, epoch: 207:  83%|████████▎ | 417/500 [01:36<00:18,  4.44it/s]\n# PTI :step: 416, epoch: 208:  83%|████████▎ | 417/500 [01:36<00:18,  4.44it/s]\n# PTI :step: 416, epoch: 208:  84%|████████▎ | 418/500 [01:36<00:18,  4.33it/s]\n# PTI :step: 417, epoch: 208:  84%|████████▎ | 418/500 [01:36<00:18,  4.33it/s]\n# PTI :step: 417, epoch: 208:  84%|████████▍ | 419/500 [01:36<00:18,  4.44it/s]\n# PTI :step: 418, epoch: 209:  84%|████████▍ | 419/500 [01:36<00:18,  4.44it/s]\n# PTI :step: 418, epoch: 209:  84%|████████▍ | 420/500 [01:36<00:18,  4.34it/s]\n# PTI :step: 419, epoch: 209:  84%|████████▍ | 420/500 [01:36<00:18,  4.34it/s]\n# PTI :step: 419, epoch: 209:  84%|████████▍ | 421/500 [01:37<00:17,  4.47it/s]\n# PTI :step: 420, epoch: 210:  84%|████████▍ | 421/500 [01:37<00:17,  4.47it/s]\n# PTI :step: 420, epoch: 210:  84%|████████▍ | 422/500 [01:37<00:17,  4.36it/s]\n# PTI :step: 421, epoch: 210:  84%|████████▍ | 422/500 [01:37<00:17,  4.36it/s]\n# PTI :step: 421, epoch: 210:  85%|████████▍ | 423/500 [01:37<00:17,  4.47it/s]\n# PTI :step: 422, epoch: 211:  85%|████████▍ | 423/500 [01:37<00:17,  4.47it/s]\n# PTI :step: 422, epoch: 211:  85%|████████▍ | 424/500 [01:37<00:17,  4.34it/s]\n# PTI :step: 423, epoch: 211:  85%|████████▍ | 424/500 [01:37<00:17,  4.34it/s]\n# PTI :step: 423, epoch: 211:  85%|████████▌ | 425/500 [01:38<00:16,  4.45it/s]\n# PTI :step: 424, epoch: 212:  85%|████████▌ | 425/500 [01:38<00:16,  4.45it/s]\n# PTI :step: 424, epoch: 212:  85%|████████▌ | 426/500 [01:38<00:17,  4.34it/s]\n# PTI :step: 425, epoch: 212:  85%|████████▌ | 426/500 [01:38<00:17,  4.34it/s]\n# PTI :step: 425, epoch: 212:  85%|████████▌ | 427/500 [01:38<00:16,  4.48it/s]\n# PTI :step: 426, epoch: 213:  85%|████████▌ | 427/500 [01:38<00:16,  4.48it/s]\n# PTI :step: 426, epoch: 213:  86%|████████▌ | 428/500 [01:38<00:16,  4.36it/s]\n# PTI :step: 427, epoch: 213:  86%|████████▌ | 428/500 [01:38<00:16,  4.36it/s]\n# PTI :step: 427, epoch: 213:  86%|████████▌ | 429/500 [01:39<00:15,  4.44it/s]\n# PTI :step: 428, epoch: 214:  86%|████████▌ | 429/500 [01:39<00:15,  4.44it/s]\n# PTI :step: 428, epoch: 214:  86%|████████▌ | 430/500 [01:39<00:16,  4.27it/s]\n# PTI :step: 429, epoch: 214:  86%|████████▌ | 430/500 [01:39<00:16,  4.27it/s]\n# PTI :step: 429, epoch: 214:  86%|████████▌ | 431/500 [01:39<00:15,  4.42it/s]\n# PTI :step: 430, epoch: 215:  86%|████████▌ | 431/500 [01:39<00:15,  4.42it/s]\n# PTI :step: 430, epoch: 215:  86%|████████▋ | 432/500 [01:39<00:15,  4.33it/s]\n# PTI :step: 431, epoch: 215:  86%|████████▋ | 432/500 [01:39<00:15,  4.33it/s]\n# PTI :step: 431, epoch: 215:  87%|████████▋ | 433/500 [01:39<00:14,  4.47it/s]\n# PTI :step: 432, epoch: 216:  87%|████████▋ | 433/500 [01:39<00:14,  4.47it/s]\n# PTI :step: 432, epoch: 216:  87%|████████▋ | 434/500 [01:40<00:15,  4.36it/s]\n# PTI :step: 433, epoch: 216:  87%|████████▋ | 434/500 [01:40<00:15,  4.36it/s]\n# PTI :step: 433, epoch: 216:  87%|████████▋ | 435/500 [01:40<00:14,  4.45it/s]\n# PTI :step: 434, epoch: 217:  87%|████████▋ | 435/500 [01:40<00:14,  4.45it/s]\n# PTI :step: 434, epoch: 217:  87%|████████▋ | 436/500 [01:40<00:14,  4.33it/s]\n# PTI :step: 435, epoch: 217:  87%|████████▋ | 436/500 [01:40<00:14,  4.33it/s]\n# PTI :step: 435, epoch: 217:  87%|████████▋ | 437/500 [01:40<00:14,  4.46it/s]\n# PTI :step: 436, epoch: 218:  87%|████████▋ | 437/500 [01:40<00:14,  4.46it/s]\n# PTI :step: 436, epoch: 218:  88%|████████▊ | 438/500 [01:41<00:14,  4.34it/s]\n# PTI :step: 437, epoch: 218:  88%|████████▊ | 438/500 [01:41<00:14,  4.34it/s]\n# PTI :step: 437, epoch: 218:  88%|████████▊ | 439/500 [01:41<00:13,  4.46it/s]\n# PTI :step: 438, epoch: 219:  88%|████████▊ | 439/500 [01:41<00:13,  4.46it/s]\n# PTI :step: 438, epoch: 219:  88%|████████▊ | 440/500 [01:41<00:13,  4.36it/s]\n# PTI :step: 439, epoch: 219:  88%|████████▊ | 440/500 [01:41<00:13,  4.36it/s]\n# PTI :step: 439, epoch: 219:  88%|████████▊ | 441/500 [01:41<00:13,  4.43it/s]\n# PTI :step: 440, epoch: 220:  88%|████████▊ | 441/500 [01:41<00:13,  4.43it/s]\n# PTI :step: 440, epoch: 220:  88%|████████▊ | 442/500 [01:41<00:13,  4.32it/s]\n# PTI :step: 441, epoch: 220:  88%|████████▊ | 442/500 [01:41<00:13,  4.32it/s]\n# PTI :step: 441, epoch: 220:  89%|████████▊ | 443/500 [01:42<00:12,  4.46it/s]\n# PTI :step: 442, epoch: 221:  89%|████████▊ | 443/500 [01:42<00:12,  4.46it/s]\n# PTI :step: 442, epoch: 221:  89%|████████▉ | 444/500 [01:42<00:12,  4.34it/s]\n# PTI :step: 443, epoch: 221:  89%|████████▉ | 444/500 [01:42<00:12,  4.34it/s]\n# PTI :step: 443, epoch: 221:  89%|████████▉ | 445/500 [01:42<00:12,  4.47it/s]\n# PTI :step: 444, epoch: 222:  89%|████████▉ | 445/500 [01:42<00:12,  4.47it/s]\n# PTI :step: 444, epoch: 222:  89%|████████▉ | 446/500 [01:42<00:12,  4.35it/s]\n# PTI :step: 445, epoch: 222:  89%|████████▉ | 446/500 [01:42<00:12,  4.35it/s]\n# PTI :step: 445, epoch: 222:  89%|████████▉ | 447/500 [01:43<00:11,  4.45it/s]\n# PTI :step: 446, epoch: 223:  89%|████████▉ | 447/500 [01:43<00:11,  4.45it/s]\n# PTI :step: 446, epoch: 223:  90%|████████▉ | 448/500 [01:43<00:12,  4.33it/s]\n# PTI :step: 447, epoch: 223:  90%|████████▉ | 448/500 [01:43<00:12,  4.33it/s]\n# PTI :step: 447, epoch: 223:  90%|████████▉ | 449/500 [01:43<00:11,  4.47it/s]\n# PTI :step: 448, epoch: 224:  90%|████████▉ | 449/500 [01:43<00:11,  4.47it/s]\n# PTI :step: 448, epoch: 224:  90%|█████████ | 450/500 [01:43<00:11,  4.36it/s]\n# PTI :step: 449, epoch: 224:  90%|█████████ | 450/500 [01:43<00:11,  4.36it/s]\n# PTI :step: 449, epoch: 224:  90%|█████████ | 451/500 [01:44<00:10,  4.48it/s]\n# PTI :step: 450, epoch: 225:  90%|█████████ | 451/500 [01:44<00:10,  4.48it/s]\n# PTI :step: 450, epoch: 225:  90%|█████████ | 452/500 [01:44<00:11,  4.35it/s]\n# PTI :step: 451, epoch: 225:  90%|█████████ | 452/500 [01:44<00:11,  4.35it/s]\n# PTI :step: 451, epoch: 225:  91%|█████████ | 453/500 [01:44<00:10,  4.48it/s]\n# PTI :step: 452, epoch: 226:  91%|█████████ | 453/500 [01:44<00:10,  4.48it/s]\n# PTI :step: 452, epoch: 226:  91%|█████████ | 454/500 [01:44<00:10,  4.36it/s]\n# PTI :step: 453, epoch: 226:  91%|█████████ | 454/500 [01:44<00:10,  4.36it/s]\n# PTI :step: 453, epoch: 226:  91%|█████████ | 455/500 [01:44<00:10,  4.48it/s]\n# PTI :step: 454, epoch: 227:  91%|█████████ | 455/500 [01:44<00:10,  4.48it/s]\n# PTI :step: 454, epoch: 227:  91%|█████████ | 456/500 [01:45<00:10,  4.35it/s]\n# PTI :step: 455, epoch: 227:  91%|█████████ | 456/500 [01:45<00:10,  4.35it/s]\n# PTI :step: 455, epoch: 227:  91%|█████████▏| 457/500 [01:45<00:09,  4.47it/s]\n# PTI :step: 456, epoch: 228:  91%|█████████▏| 457/500 [01:45<00:09,  4.47it/s]\n# PTI :step: 456, epoch: 228:  92%|█████████▏| 458/500 [01:45<00:09,  4.35it/s]\n# PTI :step: 457, epoch: 228:  92%|█████████▏| 458/500 [01:45<00:09,  4.35it/s]\n# PTI :step: 457, epoch: 228:  92%|█████████▏| 459/500 [01:45<00:09,  4.48it/s]\n# PTI :step: 458, epoch: 229:  92%|█████████▏| 459/500 [01:45<00:09,  4.48it/s]\n# PTI :step: 458, epoch: 229:  92%|█████████▏| 460/500 [01:46<00:09,  4.37it/s]\n# PTI :step: 459, epoch: 229:  92%|█████████▏| 460/500 [01:46<00:09,  4.37it/s]\n# PTI :step: 459, epoch: 229:  92%|█████████▏| 461/500 [01:46<00:08,  4.43it/s]\n# PTI :step: 460, epoch: 230:  92%|█████████▏| 461/500 [01:46<00:08,  4.43it/s]\n# PTI :step: 460, epoch: 230:  92%|█████████▏| 462/500 [01:46<00:08,  4.32it/s]\n# PTI :step: 461, epoch: 230:  92%|█████████▏| 462/500 [01:46<00:08,  4.32it/s]\n# PTI :step: 461, epoch: 230:  93%|█████████▎| 463/500 [01:46<00:08,  4.39it/s]\n# PTI :step: 462, epoch: 231:  93%|█████████▎| 463/500 [01:46<00:08,  4.39it/s]\n# PTI :step: 462, epoch: 231:  93%|█████████▎| 464/500 [01:47<00:08,  4.26it/s]\n# PTI :step: 463, epoch: 231:  93%|█████████▎| 464/500 [01:47<00:08,  4.26it/s]\n# PTI :step: 463, epoch: 231:  93%|█████████▎| 465/500 [01:47<00:08,  4.33it/s]\n# PTI :step: 464, epoch: 232:  93%|█████████▎| 465/500 [01:47<00:08,  4.33it/s]\n# PTI :step: 464, epoch: 232:  93%|█████████▎| 466/500 [01:47<00:07,  4.26it/s]\n# PTI :step: 465, epoch: 232:  93%|█████████▎| 466/500 [01:47<00:07,  4.26it/s]\n# PTI :step: 465, epoch: 232:  93%|█████████▎| 467/500 [01:47<00:07,  4.35it/s]\n# PTI :step: 466, epoch: 233:  93%|█████████▎| 467/500 [01:47<00:07,  4.35it/s]\n# PTI :step: 466, epoch: 233:  94%|█████████▎| 468/500 [01:47<00:07,  4.25it/s]\n# PTI :step: 467, epoch: 233:  94%|█████████▎| 468/500 [01:47<00:07,  4.25it/s]\n# PTI :step: 467, epoch: 233:  94%|█████████▍| 469/500 [01:48<00:07,  4.39it/s]\n# PTI :step: 468, epoch: 234:  94%|█████████▍| 469/500 [01:48<00:07,  4.39it/s]\n# PTI :step: 468, epoch: 234:  94%|█████████▍| 470/500 [01:48<00:07,  4.26it/s]\n# PTI :step: 469, epoch: 234:  94%|█████████▍| 470/500 [01:48<00:07,  4.26it/s]\n# PTI :step: 469, epoch: 234:  94%|█████████▍| 471/500 [01:48<00:06,  4.38it/s]\n# PTI :step: 470, epoch: 235:  94%|█████████▍| 471/500 [01:48<00:06,  4.38it/s]\n# PTI :step: 470, epoch: 235:  94%|█████████▍| 472/500 [01:48<00:06,  4.26it/s]\n# PTI :step: 471, epoch: 235:  94%|█████████▍| 472/500 [01:48<00:06,  4.26it/s]\n# PTI :step: 471, epoch: 235:  95%|█████████▍| 473/500 [01:49<00:06,  4.34it/s]\n# PTI :step: 472, epoch: 236:  95%|█████████▍| 473/500 [01:49<00:06,  4.34it/s]\n# PTI :step: 472, epoch: 236:  95%|█████████▍| 474/500 [01:49<00:06,  4.26it/s]\n# PTI :step: 473, epoch: 236:  95%|█████████▍| 474/500 [01:49<00:06,  4.26it/s]\n# PTI :step: 473, epoch: 236:  95%|█████████▌| 475/500 [01:49<00:05,  4.37it/s]\n# PTI :step: 474, epoch: 237:  95%|█████████▌| 475/500 [01:49<00:05,  4.37it/s]\n# PTI :step: 474, epoch: 237:  95%|█████████▌| 476/500 [01:49<00:05,  4.26it/s]\n# PTI :step: 475, epoch: 237:  95%|█████████▌| 476/500 [01:49<00:05,  4.26it/s]\n# PTI :step: 475, epoch: 237:  95%|█████████▌| 477/500 [01:50<00:05,  4.38it/s]\n# PTI :step: 476, epoch: 238:  95%|█████████▌| 477/500 [01:50<00:05,  4.38it/s]\n# PTI :step: 476, epoch: 238:  96%|█████████▌| 478/500 [01:50<00:05,  4.28it/s]\n# PTI :step: 477, epoch: 238:  96%|█████████▌| 478/500 [01:50<00:05,  4.28it/s]\n# PTI :step: 477, epoch: 238:  96%|█████████▌| 479/500 [01:50<00:04,  4.44it/s]\n# PTI :step: 478, epoch: 239:  96%|█████████▌| 479/500 [01:50<00:04,  4.44it/s]\n# PTI :step: 478, epoch: 239:  96%|█████████▌| 480/500 [01:50<00:04,  4.30it/s]\n# PTI :step: 479, epoch: 239:  96%|█████████▌| 480/500 [01:50<00:04,  4.30it/s]\n# PTI :step: 479, epoch: 239:  96%|█████████▌| 481/500 [01:50<00:04,  4.42it/s]\n# PTI :step: 480, epoch: 240:  96%|█████████▌| 481/500 [01:50<00:04,  4.42it/s]\n# PTI :step: 480, epoch: 240:  96%|█████████▋| 482/500 [01:51<00:04,  4.31it/s]\n# PTI :step: 481, epoch: 240:  96%|█████████▋| 482/500 [01:51<00:04,  4.31it/s]\n# PTI :step: 481, epoch: 240:  97%|█████████▋| 483/500 [01:51<00:03,  4.40it/s]\n# PTI :step: 482, epoch: 241:  97%|█████████▋| 483/500 [01:51<00:03,  4.40it/s]\n# PTI :step: 482, epoch: 241:  97%|█████████▋| 484/500 [01:51<00:03,  4.28it/s]\n# PTI :step: 483, epoch: 241:  97%|█████████▋| 484/500 [01:51<00:03,  4.28it/s]\n# PTI :step: 483, epoch: 241:  97%|█████████▋| 485/500 [01:51<00:03,  4.42it/s]\n# PTI :step: 484, epoch: 242:  97%|█████████▋| 485/500 [01:51<00:03,  4.42it/s]\n# PTI :step: 484, epoch: 242:  97%|█████████▋| 486/500 [01:52<00:03,  4.31it/s]\n# PTI :step: 485, epoch: 242:  97%|█████████▋| 486/500 [01:52<00:03,  4.31it/s]\n# PTI :step: 485, epoch: 242:  97%|█████████▋| 487/500 [01:52<00:02,  4.44it/s]\n# PTI :step: 486, epoch: 243:  97%|█████████▋| 487/500 [01:52<00:02,  4.44it/s]\n# PTI :step: 486, epoch: 243:  98%|█████████▊| 488/500 [01:52<00:02,  4.34it/s]\n# PTI :step: 487, epoch: 243:  98%|█████████▊| 488/500 [01:52<00:02,  4.34it/s]\n# PTI :step: 487, epoch: 243:  98%|█████████▊| 489/500 [01:52<00:02,  4.43it/s]\n# PTI :step: 488, epoch: 244:  98%|█████████▊| 489/500 [01:52<00:02,  4.43it/s]\n# PTI :step: 488, epoch: 244:  98%|█████████▊| 490/500 [01:52<00:02,  4.28it/s]\n# PTI :step: 489, epoch: 244:  98%|█████████▊| 490/500 [01:52<00:02,  4.28it/s]\n# PTI :step: 489, epoch: 244:  98%|█████████▊| 491/500 [01:53<00:02,  4.41it/s]\n# PTI :step: 490, epoch: 245:  98%|█████████▊| 491/500 [01:53<00:02,  4.41it/s]\n# PTI :step: 490, epoch: 245:  98%|█████████▊| 492/500 [01:53<00:01,  4.31it/s]\n# PTI :step: 491, epoch: 245:  98%|█████████▊| 492/500 [01:53<00:01,  4.31it/s]\n# PTI :step: 491, epoch: 245:  99%|█████████▊| 493/500 [01:53<00:01,  4.41it/s]\n# PTI :step: 492, epoch: 246:  99%|█████████▊| 493/500 [01:53<00:01,  4.41it/s]\n# PTI :step: 492, epoch: 246:  99%|█████████▉| 494/500 [01:53<00:01,  4.27it/s]\n# PTI :step: 493, epoch: 246:  99%|█████████▉| 494/500 [01:53<00:01,  4.27it/s]\n# PTI :step: 493, epoch: 246:  99%|█████████▉| 495/500 [01:54<00:01,  4.39it/s]\n# PTI :step: 494, epoch: 247:  99%|█████████▉| 495/500 [01:54<00:01,  4.39it/s]\n# PTI :step: 494, epoch: 247:  99%|█████████▉| 496/500 [01:54<00:00,  4.29it/s]\n# PTI :step: 495, epoch: 247:  99%|█████████▉| 496/500 [01:54<00:00,  4.29it/s]\n# PTI :step: 495, epoch: 247:  99%|█████████▉| 497/500 [01:54<00:00,  4.43it/s]\n# PTI :step: 496, epoch: 248:  99%|█████████▉| 497/500 [01:54<00:00,  4.43it/s]\n# PTI :step: 496, epoch: 248: 100%|█████████▉| 498/500 [01:54<00:00,  4.32it/s]\n# PTI :step: 497, epoch: 248: 100%|█████████▉| 498/500 [01:54<00:00,  4.32it/s]\n# PTI :step: 497, epoch: 248: 100%|█████████▉| 499/500 [01:55<00:00,  4.42it/s]\n# PTI :step: 498, epoch: 249: 100%|█████████▉| 499/500 [01:55<00:00,  4.42it/s]\n# PTI :step: 498, epoch: 249: 100%|██████████| 500/500 [01:55<00:00,  4.32it/s]\nSaving final model for return\n# PTI :step: 499, epoch: 249: 100%|██████████| 500/500 [01:55<00:00,  4.32it/s]\n# PTI :step: 499, epoch: 249: 100%|██████████| 500/500 [01:55<00:00,  4.32it/s]\ntraining_out/lora.safetensors\ntraining_out/embeddings.pti\ntraining_out/special_params.json",
  "metrics": {
    "predict_time": 143.365302,
    "total_time": 143.344806
  },
  "output": "https://replicate.delivery/pbxt/Hqssn6XKrc5XBtLvWgWKmwNDAkEGoDzLctcp63bw7ifu0M7IA/trained_model.tar",
  "started_at": "2023-11-08T19:56:01.221846Z",
  "status": "succeeded",
  "urls": {
    "get": "https://api.replicate.com/v1/predictions/sfs2lolbezbisgcl2pqrfvxjbe",
    "cancel": "https://api.replicate.com/v1/predictions/sfs2lolbezbisgcl2pqrfvxjbe/cancel"
  },
  "version": "28d7173225f9a3320ad19bbf78e37bd23eb0681362aba460a191bcf25f5a7afc"
}

Generated in

2 minutes 23 seconds

Tweak itReport View full prediction

Image files:  ['./temp_in/zeke1.jpg', './temp_in/zeke2.jpg', './temp_in/zeke3.jpg', './temp_in/zeke4.jpg', './temp_in/zeke5.jpg', './temp_in/zeke6.jpg']
Generating 6 captions...
Input captioning text: a photo of TOK,
TOK
he looks to be taking a selfie while sitting in the middle of the floor
  0%|          | 0/6 [00:00<?, ?it/s]
TOK
curly man in glasses smiling in front of a beach
 17%|█▋        | 1/6 [00:00<00:01,  4.23it/s]
TOK
lacy woman wearing glasses in front of a man with curly hair
 33%|███▎      | 2/6 [00:00<00:00,  5.18it/s]
TOK
shaggy haired man in glasses looking up at something with a white wall behind him
 50%|█████     | 3/6 [00:00<00:00,  5.30it/s]
TOK
curly man wearing glasses, white shirt and tie, stares forward
 67%|██████▋   | 4/6 [00:00<00:00,  4.98it/s]
TOK
they are looking at the camera to see the man with glasses
Generated captions ['a photo of TOK, he looks to be taking a selfie while sitting in the middle of the floor', 'a photo of TOK, curly man in glasses smiling in front of a beach', 'a photo of TOK, lacy woman wearing glasses in front of a man with curly hair', 'a photo of TOK, shaggy haired man in glasses looking up at something with a white wall behind him', 'a photo of TOK, curly man wearing glasses, white shirt and tie, stares forward', 'a photo of TOK, they are looking at the camera to see the man with glasses']
 83%|████████▎ | 5/6 [00:00<00:00,  5.14it/s]
100%|██████████| 6/6 [00:01<00:00,  5.24it/s]
100%|██████████| 6/6 [00:01<00:00,  5.13it/s]
Generating 6 masks...
(175, 123, 247, 247)
(72, 115, 329, 329)
(290, 130, 170, 170)
(115, 141, 145, 145)
(82, 166, 270, 270)
(125, 145, 288, 288)
  0%|          | 0/6 [00:00<?, ?it/s]
100%|██████████| 6/6 [00:00<00:00, 53.65it/s]
100%|██████████| 6/6 [00:00<00:00, 53.57it/s]
Upscaling 6 images...
  0%|          | 0/6 [00:00<?, ?it/s]
 17%|█▋        | 1/6 [00:01<00:08,  1.67s/it]
 33%|███▎      | 2/6 [00:03<00:06,  1.64s/it]
 50%|█████     | 3/6 [00:04<00:04,  1.64s/it]
 67%|██████▋   | 4/6 [00:06<00:03,  1.63s/it]
 83%|████████▎ | 5/6 [00:08<00:01,  1.63s/it]
100%|██████████| 6/6 [00:09<00:00,  1.63s/it]
100%|██████████| 6/6 [00:09<00:00,  1.64s/it]
Using seed 2012688753
You are using a model of type clip_text_model to instantiate a model of type . This is not supported for all configurations of models and can yield errors.
You are using a model of type clip_text_model to instantiate a model of type . This is not supported for all configurations of models and can yield errors.
# PTI : Loaded models
0 text encodedr's std_token_embedding: 0.01531982421875
torch.Size([49410])
1 text encodedr's std_token_embedding: 0.014404296875
torch.Size([49410])
text_model.embeddings.token_embedding.weight
text_model.embeddings.token_embedding.weight
# PTI : Loading dataset, do_cache True
Captions to train on:
a photo of <s0><s1>, he looks to be taking a selfie while sitting in the middle of the floor
a photo of <s0><s1>, curly man in glasses smiling in front of a beach
a photo of <s0><s1>, lacy woman wearing glasses in front of a man with curly hair
a photo of <s0><s1>, shaggy haired man in glasses looking up at something with a white wall behind him
a photo of <s0><s1>, curly man wearing glasses, white shirt and tie, stares forward
a photo of <s0><s1>, they are looking at the camera to see the man with glasses
# PTI : Loaded dataset
# PTI :  Running training
# PTI :  Num examples = 6
# PTI :  Num batches each epoch = 2
# PTI :  Num Epochs = 250
# PTI :  Instantaneous batch size per device = 4
Total train batch size (w. parallel, distributed & accumulation) = 4
# PTI :  Gradient Accumulation steps = 1
# PTI :  Total optimization steps = 500
  0%|          | 0/500 [00:00<?, ?it/s]
# PTI :step: 0, epoch: 0:   0%|          | 1/500 [00:00<00:04, 117.24it/s]/root/.pyenv/versions/3.11.6/lib/python3.11/site-packages/diffusers/models/attention_processor.py:1815: FutureWarning: `LoRAAttnProcessor2_0` is deprecated and will be removed in version 0.26.0. Make sure use AttnProcessor2_0 instead by settingLoRA layers to `self.{to_q,to_k,to_v,to_out[0]}.lora_layer` respectively. This will be done automatically when using `LoraLoaderMixin.load_lora_weights`
deprecate(
# PTI :step: 0, epoch: 0:   0%|          | 2/500 [00:00<01:28,  5.63it/s]
# PTI :step: 1, epoch: 0:   0%|          | 2/500 [00:00<01:28,  5.63it/s]
# PTI :step: 1, epoch: 0:   1%|          | 3/500 [00:00<01:35,  5.19it/s]
# PTI :step: 2, epoch: 1:   1%|          | 3/500 [00:00<01:35,  5.19it/s]
# PTI :step: 2, epoch: 1:   1%|          | 4/500 [00:00<01:46,  4.65it/s]
# PTI :step: 3, epoch: 1:   1%|          | 4/500 [00:00<01:46,  4.65it/s]
# PTI :step: 3, epoch: 1:   1%|          | 5/500 [00:01<01:46,  4.66it/s]
# PTI :step: 4, epoch: 2:   1%|          | 5/500 [00:01<01:46,  4.66it/s]
# PTI :step: 4, epoch: 2:   1%|          | 6/500 [00:01<01:51,  4.41it/s]
# PTI :step: 5, epoch: 2:   1%|          | 6/500 [00:01<01:51,  4.41it/s]
# PTI :step: 5, epoch: 2:   1%|▏         | 7/500 [00:01<01:49,  4.49it/s]
# PTI :step: 6, epoch: 3:   1%|▏         | 7/500 [00:01<01:49,  4.49it/s]
# PTI :step: 6, epoch: 3:   2%|▏         | 8/500 [00:01<01:53,  4.33it/s]
# PTI :step: 7, epoch: 3:   2%|▏         | 8/500 [00:01<01:53,  4.33it/s]
# PTI :step: 7, epoch: 3:   2%|▏         | 9/500 [00:01<01:50,  4.43it/s]
# PTI :step: 8, epoch: 4:   2%|▏         | 9/500 [00:01<01:50,  4.43it/s]
# PTI :step: 8, epoch: 4:   2%|▏         | 10/500 [00:02<01:54,  4.29it/s]
# PTI :step: 9, epoch: 4:   2%|▏         | 10/500 [00:02<01:54,  4.29it/s]
# PTI :step: 9, epoch: 4:   2%|▏         | 11/500 [00:02<01:52,  4.33it/s]
# PTI :step: 10, epoch: 5:   2%|▏         | 11/500 [00:02<01:52,  4.33it/s]
# PTI :step: 10, epoch: 5:   2%|▏         | 12/500 [00:02<01:55,  4.21it/s]
# PTI :step: 11, epoch: 5:   2%|▏         | 12/500 [00:02<01:55,  4.21it/s]
# PTI :step: 11, epoch: 5:   3%|▎         | 13/500 [00:02<01:51,  4.35it/s]
# PTI :step: 12, epoch: 6:   3%|▎         | 13/500 [00:02<01:51,  4.35it/s]
# PTI :step: 12, epoch: 6:   3%|▎         | 14/500 [00:03<01:54,  4.25it/s]
# PTI :step: 13, epoch: 6:   3%|▎         | 14/500 [00:03<01:54,  4.25it/s]
# PTI :step: 13, epoch: 6:   3%|▎         | 15/500 [00:03<01:50,  4.37it/s]
# PTI :step: 14, epoch: 7:   3%|▎         | 15/500 [00:03<01:50,  4.37it/s]
# PTI :step: 14, epoch: 7:   3%|▎         | 16/500 [00:03<01:53,  4.26it/s]
# PTI :step: 15, epoch: 7:   3%|▎         | 16/500 [00:03<01:53,  4.26it/s]
# PTI :step: 15, epoch: 7:   3%|▎         | 17/500 [00:03<01:50,  4.39it/s]
# PTI :step: 16, epoch: 8:   3%|▎         | 17/500 [00:03<01:50,  4.39it/s]
# PTI :step: 16, epoch: 8:   4%|▎         | 18/500 [00:04<01:52,  4.27it/s]
# PTI :step: 17, epoch: 8:   4%|▎         | 18/500 [00:04<01:52,  4.27it/s]
# PTI :step: 17, epoch: 8:   4%|▍         | 19/500 [00:04<01:50,  4.37it/s]
# PTI :step: 18, epoch: 9:   4%|▍         | 19/500 [00:04<01:50,  4.37it/s]
# PTI :step: 18, epoch: 9:   4%|▍         | 20/500 [00:04<01:52,  4.25it/s]
# PTI :step: 19, epoch: 9:   4%|▍         | 20/500 [00:04<01:52,  4.25it/s]
# PTI :step: 19, epoch: 9:   4%|▍         | 21/500 [00:04<01:50,  4.32it/s]
# PTI :step: 20, epoch: 10:   4%|▍         | 21/500 [00:04<01:50,  4.32it/s]
# PTI :step: 20, epoch: 10:   4%|▍         | 22/500 [00:05<01:53,  4.22it/s]
# PTI :step: 21, epoch: 10:   4%|▍         | 22/500 [00:05<01:53,  4.22it/s]
# PTI :step: 21, epoch: 10:   5%|▍         | 23/500 [00:05<01:50,  4.33it/s]
# PTI :step: 22, epoch: 11:   5%|▍         | 23/500 [00:05<01:50,  4.33it/s]
# PTI :step: 22, epoch: 11:   5%|▍         | 24/500 [00:05<01:52,  4.23it/s]
# PTI :step: 23, epoch: 11:   5%|▍         | 24/500 [00:05<01:52,  4.23it/s]
# PTI :step: 23, epoch: 11:   5%|▌         | 25/500 [00:05<01:48,  4.36it/s]
# PTI :step: 24, epoch: 12:   5%|▌         | 25/500 [00:05<01:48,  4.36it/s]
# PTI :step: 24, epoch: 12:   5%|▌         | 26/500 [00:05<01:51,  4.25it/s]
# PTI :step: 25, epoch: 12:   5%|▌         | 26/500 [00:05<01:51,  4.25it/s]
# PTI :step: 25, epoch: 12:   5%|▌         | 27/500 [00:06<01:50,  4.29it/s]
# PTI :step: 26, epoch: 13:   5%|▌         | 27/500 [00:06<01:50,  4.29it/s]
# PTI :step: 26, epoch: 13:   6%|▌         | 28/500 [00:06<01:52,  4.20it/s]
# PTI :step: 27, epoch: 13:   6%|▌         | 28/500 [00:06<01:52,  4.20it/s]
# PTI :step: 27, epoch: 13:   6%|▌         | 29/500 [00:06<01:48,  4.35it/s]
# PTI :step: 28, epoch: 14:   6%|▌         | 29/500 [00:06<01:48,  4.35it/s]
# PTI :step: 28, epoch: 14:   6%|▌         | 30/500 [00:06<01:50,  4.24it/s]
# PTI :step: 29, epoch: 14:   6%|▌         | 30/500 [00:06<01:50,  4.24it/s]
# PTI :step: 29, epoch: 14:   6%|▌         | 31/500 [00:07<01:47,  4.36it/s]
# PTI :step: 30, epoch: 15:   6%|▌         | 31/500 [00:07<01:47,  4.36it/s]
# PTI :step: 30, epoch: 15:   6%|▋         | 32/500 [00:07<01:50,  4.24it/s]
# PTI :step: 31, epoch: 15:   6%|▋         | 32/500 [00:07<01:50,  4.24it/s]
# PTI :step: 31, epoch: 15:   7%|▋         | 33/500 [00:07<01:47,  4.35it/s]
# PTI :step: 32, epoch: 16:   7%|▋         | 33/500 [00:07<01:47,  4.35it/s]
# PTI :step: 32, epoch: 16:   7%|▋         | 34/500 [00:07<01:49,  4.24it/s]
# PTI :step: 33, epoch: 16:   7%|▋         | 34/500 [00:07<01:49,  4.24it/s]
# PTI :step: 33, epoch: 16:   7%|▋         | 35/500 [00:08<01:46,  4.38it/s]
# PTI :step: 34, epoch: 17:   7%|▋         | 35/500 [00:08<01:46,  4.38it/s]
# PTI :step: 34, epoch: 17:   7%|▋         | 36/500 [00:08<01:49,  4.26it/s]
# PTI :step: 35, epoch: 17:   7%|▋         | 36/500 [00:08<01:49,  4.26it/s]
# PTI :step: 35, epoch: 17:   7%|▋         | 37/500 [00:08<01:47,  4.31it/s]
# PTI :step: 36, epoch: 18:   7%|▋         | 37/500 [00:08<01:47,  4.31it/s]
# PTI :step: 36, epoch: 18:   8%|▊         | 38/500 [00:08<01:49,  4.20it/s]
# PTI :step: 37, epoch: 18:   8%|▊         | 38/500 [00:08<01:49,  4.20it/s]
# PTI :step: 37, epoch: 18:   8%|▊         | 39/500 [00:08<01:46,  4.33it/s]
# PTI :step: 38, epoch: 19:   8%|▊         | 39/500 [00:08<01:46,  4.33it/s]
# PTI :step: 38, epoch: 19:   8%|▊         | 40/500 [00:09<01:48,  4.23it/s]
# PTI :step: 39, epoch: 19:   8%|▊         | 40/500 [00:09<01:48,  4.23it/s]
# PTI :step: 39, epoch: 19:   8%|▊         | 41/500 [00:09<01:45,  4.36it/s]
# PTI :step: 40, epoch: 20:   8%|▊         | 41/500 [00:09<01:45,  4.36it/s]
# PTI :step: 40, epoch: 20:   8%|▊         | 42/500 [00:09<01:48,  4.23it/s]
# PTI :step: 41, epoch: 20:   8%|▊         | 42/500 [00:09<01:48,  4.23it/s]
# PTI :step: 41, epoch: 20:   9%|▊         | 43/500 [00:09<01:46,  4.31it/s]
# PTI :step: 42, epoch: 21:   9%|▊         | 43/500 [00:09<01:46,  4.31it/s]
# PTI :step: 42, epoch: 21:   9%|▉         | 44/500 [00:10<01:48,  4.21it/s]
# PTI :step: 43, epoch: 21:   9%|▉         | 44/500 [00:10<01:48,  4.21it/s]
# PTI :step: 43, epoch: 21:   9%|▉         | 45/500 [00:10<01:44,  4.35it/s]
# PTI :step: 44, epoch: 22:   9%|▉         | 45/500 [00:10<01:44,  4.35it/s]
# PTI :step: 44, epoch: 22:   9%|▉         | 46/500 [00:10<01:46,  4.25it/s]
# PTI :step: 45, epoch: 22:   9%|▉         | 46/500 [00:10<01:46,  4.25it/s]
# PTI :step: 45, epoch: 22:   9%|▉         | 47/500 [00:10<01:43,  4.37it/s]
# PTI :step: 46, epoch: 23:   9%|▉         | 47/500 [00:10<01:43,  4.37it/s]
# PTI :step: 46, epoch: 23:  10%|▉         | 48/500 [00:11<01:46,  4.26it/s]
# PTI :step: 47, epoch: 23:  10%|▉         | 48/500 [00:11<01:46,  4.26it/s]
# PTI :step: 47, epoch: 23:  10%|▉         | 49/500 [00:11<01:43,  4.37it/s]
# PTI :step: 48, epoch: 24:  10%|▉         | 49/500 [00:11<01:43,  4.37it/s]
# PTI :step: 48, epoch: 24:  10%|█         | 50/500 [00:11<01:45,  4.26it/s]
# PTI :step: 49, epoch: 24:  10%|█         | 50/500 [00:11<01:45,  4.26it/s]
# PTI :step: 49, epoch: 24:  10%|█         | 51/500 [00:11<01:42,  4.38it/s]
# PTI :step: 50, epoch: 25:  10%|█         | 51/500 [00:11<01:42,  4.38it/s]
# PTI :step: 50, epoch: 25:  10%|█         | 52/500 [00:12<01:45,  4.26it/s]
# PTI :step: 51, epoch: 25:  10%|█         | 52/500 [00:12<01:45,  4.26it/s]
# PTI :step: 51, epoch: 25:  11%|█         | 53/500 [00:12<01:42,  4.37it/s]
# PTI :step: 52, epoch: 26:  11%|█         | 53/500 [00:12<01:42,  4.37it/s]
# PTI :step: 52, epoch: 26:  11%|█         | 54/500 [00:12<01:44,  4.26it/s]
# PTI :step: 53, epoch: 26:  11%|█         | 54/500 [00:12<01:44,  4.26it/s]
# PTI :step: 53, epoch: 26:  11%|█         | 55/500 [00:12<01:42,  4.32it/s]
# PTI :step: 54, epoch: 27:  11%|█         | 55/500 [00:12<01:42,  4.32it/s]
# PTI :step: 54, epoch: 27:  11%|█         | 56/500 [00:12<01:45,  4.21it/s]
# PTI :step: 55, epoch: 27:  11%|█         | 56/500 [00:12<01:45,  4.21it/s]
# PTI :step: 55, epoch: 27:  11%|█▏        | 57/500 [00:13<01:42,  4.32it/s]
# PTI :step: 56, epoch: 28:  11%|█▏        | 57/500 [00:13<01:42,  4.32it/s]
# PTI :step: 56, epoch: 28:  12%|█▏        | 58/500 [00:13<01:44,  4.23it/s]
# PTI :step: 57, epoch: 28:  12%|█▏        | 58/500 [00:13<01:44,  4.23it/s]
# PTI :step: 57, epoch: 28:  12%|█▏        | 59/500 [00:13<01:41,  4.35it/s]
# PTI :step: 58, epoch: 29:  12%|█▏        | 59/500 [00:13<01:41,  4.35it/s]
# PTI :step: 58, epoch: 29:  12%|█▏        | 60/500 [00:13<01:43,  4.25it/s]
# PTI :step: 59, epoch: 29:  12%|█▏        | 60/500 [00:13<01:43,  4.25it/s]
# PTI :step: 59, epoch: 29:  12%|█▏        | 61/500 [00:14<01:40,  4.37it/s]
# PTI :step: 60, epoch: 30:  12%|█▏        | 61/500 [00:14<01:40,  4.37it/s]
# PTI :step: 60, epoch: 30:  12%|█▏        | 62/500 [00:14<01:43,  4.23it/s]
# PTI :step: 61, epoch: 30:  12%|█▏        | 62/500 [00:14<01:43,  4.23it/s]
# PTI :step: 61, epoch: 30:  13%|█▎        | 63/500 [00:14<01:41,  4.31it/s]
# PTI :step: 62, epoch: 31:  13%|█▎        | 63/500 [00:14<01:41,  4.31it/s]
# PTI :step: 62, epoch: 31:  13%|█▎        | 64/500 [00:14<01:43,  4.22it/s]
# PTI :step: 63, epoch: 31:  13%|█▎        | 64/500 [00:14<01:43,  4.22it/s]
# PTI :step: 63, epoch: 31:  13%|█▎        | 65/500 [00:15<01:39,  4.35it/s]
# PTI :step: 64, epoch: 32:  13%|█▎        | 65/500 [00:15<01:39,  4.35it/s]
# PTI :step: 64, epoch: 32:  13%|█▎        | 66/500 [00:15<01:42,  4.24it/s]
# PTI :step: 65, epoch: 32:  13%|█▎        | 66/500 [00:15<01:42,  4.24it/s]
# PTI :step: 65, epoch: 32:  13%|█▎        | 67/500 [00:15<01:38,  4.38it/s]
# PTI :step: 66, epoch: 33:  13%|█▎        | 67/500 [00:15<01:38,  4.38it/s]
# PTI :step: 66, epoch: 33:  14%|█▎        | 68/500 [00:15<01:41,  4.27it/s]
# PTI :step: 67, epoch: 33:  14%|█▎        | 68/500 [00:15<01:41,  4.27it/s]
# PTI :step: 67, epoch: 33:  14%|█▍        | 69/500 [00:15<01:38,  4.37it/s]
# PTI :step: 68, epoch: 34:  14%|█▍        | 69/500 [00:15<01:38,  4.37it/s]
# PTI :step: 68, epoch: 34:  14%|█▍        | 70/500 [00:16<01:41,  4.25it/s]
# PTI :step: 69, epoch: 34:  14%|█▍        | 70/500 [00:16<01:41,  4.25it/s]
# PTI :step: 69, epoch: 34:  14%|█▍        | 71/500 [00:16<01:37,  4.38it/s]
# PTI :step: 70, epoch: 35:  14%|█▍        | 71/500 [00:16<01:37,  4.38it/s]
# PTI :step: 70, epoch: 35:  14%|█▍        | 72/500 [00:16<01:40,  4.26it/s]
# PTI :step: 71, epoch: 35:  14%|█▍        | 72/500 [00:16<01:40,  4.26it/s]
# PTI :step: 71, epoch: 35:  15%|█▍        | 73/500 [00:16<01:38,  4.35it/s]
# PTI :step: 72, epoch: 36:  15%|█▍        | 73/500 [00:16<01:38,  4.35it/s]
# PTI :step: 72, epoch: 36:  15%|█▍        | 74/500 [00:17<01:40,  4.24it/s]
# PTI :step: 73, epoch: 36:  15%|█▍        | 74/500 [00:17<01:40,  4.24it/s]
# PTI :step: 73, epoch: 36:  15%|█▌        | 75/500 [00:17<01:37,  4.37it/s]
# PTI :step: 74, epoch: 37:  15%|█▌        | 75/500 [00:17<01:37,  4.37it/s]
# PTI :step: 74, epoch: 37:  15%|█▌        | 76/500 [00:17<01:39,  4.26it/s]
# PTI :step: 75, epoch: 37:  15%|█▌        | 76/500 [00:17<01:39,  4.26it/s]
# PTI :step: 75, epoch: 37:  15%|█▌        | 77/500 [00:17<01:37,  4.36it/s]
# PTI :step: 76, epoch: 38:  15%|█▌        | 77/500 [00:17<01:37,  4.36it/s]
# PTI :step: 76, epoch: 38:  16%|█▌        | 78/500 [00:18<01:39,  4.25it/s]
# PTI :step: 77, epoch: 38:  16%|█▌        | 78/500 [00:18<01:39,  4.25it/s]
# PTI :step: 77, epoch: 38:  16%|█▌        | 79/500 [00:18<01:37,  4.32it/s]
# PTI :step: 78, epoch: 39:  16%|█▌        | 79/500 [00:18<01:37,  4.32it/s]
# PTI :step: 78, epoch: 39:  16%|█▌        | 80/500 [00:18<01:40,  4.20it/s]
# PTI :step: 79, epoch: 39:  16%|█▌        | 80/500 [00:18<01:40,  4.20it/s]
# PTI :step: 79, epoch: 39:  16%|█▌        | 81/500 [00:18<01:36,  4.33it/s]
# PTI :step: 80, epoch: 40:  16%|█▌        | 81/500 [00:18<01:36,  4.33it/s]
# PTI :step: 80, epoch: 40:  16%|█▋        | 82/500 [00:18<01:38,  4.24it/s]
# PTI :step: 81, epoch: 40:  16%|█▋        | 82/500 [00:18<01:38,  4.24it/s]
# PTI :step: 81, epoch: 40:  17%|█▋        | 83/500 [00:19<01:36,  4.34it/s]
# PTI :step: 82, epoch: 41:  17%|█▋        | 83/500 [00:19<01:36,  4.34it/s]
# PTI :step: 82, epoch: 41:  17%|█▋        | 84/500 [00:19<01:38,  4.23it/s]
# PTI :step: 83, epoch: 41:  17%|█▋        | 84/500 [00:19<01:38,  4.23it/s]
# PTI :step: 83, epoch: 41:  17%|█▋        | 85/500 [00:19<01:35,  4.35it/s]
# PTI :step: 84, epoch: 42:  17%|█▋        | 85/500 [00:19<01:35,  4.35it/s]
# PTI :step: 84, epoch: 42:  17%|█▋        | 86/500 [00:19<01:37,  4.25it/s]
# PTI :step: 85, epoch: 42:  17%|█▋        | 86/500 [00:19<01:37,  4.25it/s]
# PTI :step: 85, epoch: 42:  17%|█▋        | 87/500 [00:20<01:34,  4.37it/s]
# PTI :step: 86, epoch: 43:  17%|█▋        | 87/500 [00:20<01:34,  4.37it/s]
# PTI :step: 86, epoch: 43:  18%|█▊        | 88/500 [00:20<01:36,  4.26it/s]
# PTI :step: 87, epoch: 43:  18%|█▊        | 88/500 [00:20<01:36,  4.26it/s]
# PTI :step: 87, epoch: 43:  18%|█▊        | 89/500 [00:20<01:34,  4.33it/s]
# PTI :step: 88, epoch: 44:  18%|█▊        | 89/500 [00:20<01:34,  4.33it/s]
# PTI :step: 88, epoch: 44:  18%|█▊        | 90/500 [00:20<01:37,  4.22it/s]
# PTI :step: 89, epoch: 44:  18%|█▊        | 90/500 [00:20<01:37,  4.22it/s]
# PTI :step: 89, epoch: 44:  18%|█▊        | 91/500 [00:21<01:34,  4.34it/s]
# PTI :step: 90, epoch: 45:  18%|█▊        | 91/500 [00:21<01:34,  4.34it/s]
# PTI :step: 90, epoch: 45:  18%|█▊        | 92/500 [00:21<01:36,  4.21it/s]
# PTI :step: 91, epoch: 45:  18%|█▊        | 92/500 [00:21<01:36,  4.21it/s]
# PTI :step: 91, epoch: 45:  19%|█▊        | 93/500 [00:21<01:34,  4.31it/s]
# PTI :step: 92, epoch: 46:  19%|█▊        | 93/500 [00:21<01:34,  4.31it/s]
# PTI :step: 92, epoch: 46:  19%|█▉        | 94/500 [00:21<01:36,  4.22it/s]
# PTI :step: 93, epoch: 46:  19%|█▉        | 94/500 [00:21<01:36,  4.22it/s]
# PTI :step: 93, epoch: 46:  19%|█▉        | 95/500 [00:22<01:33,  4.33it/s]
# PTI :step: 94, epoch: 47:  19%|█▉        | 95/500 [00:22<01:33,  4.33it/s]
# PTI :step: 94, epoch: 47:  19%|█▉        | 96/500 [00:22<01:35,  4.23it/s]
# PTI :step: 95, epoch: 47:  19%|█▉        | 96/500 [00:22<01:35,  4.23it/s]
# PTI :step: 95, epoch: 47:  19%|█▉        | 97/500 [00:22<01:32,  4.37it/s]
# PTI :step: 96, epoch: 48:  19%|█▉        | 97/500 [00:22<01:32,  4.37it/s]
# PTI :step: 96, epoch: 48:  20%|█▉        | 98/500 [00:22<01:34,  4.27it/s]
# PTI :step: 97, epoch: 48:  20%|█▉        | 98/500 [00:22<01:34,  4.27it/s]
# PTI :step: 97, epoch: 48:  20%|█▉        | 99/500 [00:22<01:31,  4.39it/s]
# PTI :step: 98, epoch: 49:  20%|█▉        | 99/500 [00:22<01:31,  4.39it/s]
# PTI :step: 98, epoch: 49:  20%|██        | 100/500 [00:23<01:33,  4.28it/s]
# PTI :step: 99, epoch: 49:  20%|██        | 100/500 [00:23<01:33,  4.28it/s]
# PTI :step: 99, epoch: 49:  20%|██        | 101/500 [00:23<01:30,  4.40it/s]
# PTI :step: 100, epoch: 50:  20%|██        | 101/500 [00:23<01:30,  4.40it/s]
# PTI :step: 100, epoch: 50:  20%|██        | 102/500 [00:23<01:32,  4.28it/s]
# PTI :step: 101, epoch: 50:  20%|██        | 102/500 [00:23<01:32,  4.28it/s]
# PTI :step: 101, epoch: 50:  21%|██        | 103/500 [00:23<01:30,  4.38it/s]
# PTI :step: 102, epoch: 51:  21%|██        | 103/500 [00:23<01:30,  4.38it/s]
# PTI :step: 102, epoch: 51:  21%|██        | 104/500 [00:24<01:32,  4.27it/s]
# PTI :step: 103, epoch: 51:  21%|██        | 104/500 [00:24<01:32,  4.27it/s]
# PTI :step: 103, epoch: 51:  21%|██        | 105/500 [00:24<01:30,  4.39it/s]
# PTI :step: 104, epoch: 52:  21%|██        | 105/500 [00:24<01:30,  4.39it/s]
# PTI :step: 104, epoch: 52:  21%|██        | 106/500 [00:24<01:32,  4.27it/s]
# PTI :step: 105, epoch: 52:  21%|██        | 106/500 [00:24<01:32,  4.27it/s]
# PTI :step: 105, epoch: 52:  21%|██▏       | 107/500 [00:24<01:29,  4.41it/s]
# PTI :step: 106, epoch: 53:  21%|██▏       | 107/500 [00:24<01:29,  4.41it/s]
# PTI :step: 106, epoch: 53:  22%|██▏       | 108/500 [00:25<01:31,  4.29it/s]
# PTI :step: 107, epoch: 53:  22%|██▏       | 108/500 [00:25<01:31,  4.29it/s]
# PTI :step: 107, epoch: 53:  22%|██▏       | 109/500 [00:25<01:30,  4.32it/s]
# PTI :step: 108, epoch: 54:  22%|██▏       | 109/500 [00:25<01:30,  4.32it/s]
# PTI :step: 108, epoch: 54:  22%|██▏       | 110/500 [00:25<01:32,  4.23it/s]
# PTI :step: 109, epoch: 54:  22%|██▏       | 110/500 [00:25<01:32,  4.23it/s]
# PTI :step: 109, epoch: 54:  22%|██▏       | 111/500 [00:25<01:29,  4.36it/s]
# PTI :step: 110, epoch: 55:  22%|██▏       | 111/500 [00:25<01:29,  4.36it/s]
# PTI :step: 110, epoch: 55:  22%|██▏       | 112/500 [00:25<01:31,  4.26it/s]
# PTI :step: 111, epoch: 55:  22%|██▏       | 112/500 [00:25<01:31,  4.26it/s]
# PTI :step: 111, epoch: 55:  23%|██▎       | 113/500 [00:26<01:28,  4.37it/s]
# PTI :step: 112, epoch: 56:  23%|██▎       | 113/500 [00:26<01:28,  4.37it/s]
# PTI :step: 112, epoch: 56:  23%|██▎       | 114/500 [00:26<01:30,  4.26it/s]
# PTI :step: 113, epoch: 56:  23%|██▎       | 114/500 [00:26<01:30,  4.26it/s]
# PTI :step: 113, epoch: 56:  23%|██▎       | 115/500 [00:26<01:27,  4.39it/s]
# PTI :step: 114, epoch: 57:  23%|██▎       | 115/500 [00:26<01:27,  4.39it/s]
# PTI :step: 114, epoch: 57:  23%|██▎       | 116/500 [00:26<01:29,  4.27it/s]
# PTI :step: 115, epoch: 57:  23%|██▎       | 116/500 [00:26<01:29,  4.27it/s]
# PTI :step: 115, epoch: 57:  23%|██▎       | 117/500 [00:27<01:27,  4.39it/s]
# PTI :step: 116, epoch: 58:  23%|██▎       | 117/500 [00:27<01:27,  4.39it/s]
# PTI :step: 116, epoch: 58:  24%|██▎       | 118/500 [00:27<01:29,  4.26it/s]
# PTI :step: 117, epoch: 58:  24%|██▎       | 118/500 [00:27<01:29,  4.26it/s]
# PTI :step: 117, epoch: 58:  24%|██▍       | 119/500 [00:27<01:27,  4.38it/s]
# PTI :step: 118, epoch: 59:  24%|██▍       | 119/500 [00:27<01:27,  4.38it/s]
# PTI :step: 118, epoch: 59:  24%|██▍       | 120/500 [00:27<01:29,  4.26it/s]
# PTI :step: 119, epoch: 59:  24%|██▍       | 120/500 [00:27<01:29,  4.26it/s]
# PTI :step: 119, epoch: 59:  24%|██▍       | 121/500 [00:28<01:26,  4.39it/s]
# PTI :step: 120, epoch: 60:  24%|██▍       | 121/500 [00:28<01:26,  4.39it/s]
# PTI :step: 120, epoch: 60:  24%|██▍       | 122/500 [00:28<01:28,  4.26it/s]
# PTI :step: 121, epoch: 60:  24%|██▍       | 122/500 [00:28<01:28,  4.26it/s]
# PTI :step: 121, epoch: 60:  25%|██▍       | 123/500 [00:28<01:26,  4.36it/s]
# PTI :step: 122, epoch: 61:  25%|██▍       | 123/500 [00:28<01:26,  4.36it/s]
# PTI :step: 122, epoch: 61:  25%|██▍       | 124/500 [00:28<01:28,  4.24it/s]
# PTI :step: 123, epoch: 61:  25%|██▍       | 124/500 [00:28<01:28,  4.24it/s]
# PTI :step: 123, epoch: 61:  25%|██▌       | 125/500 [00:28<01:26,  4.36it/s]
# PTI :step: 124, epoch: 62:  25%|██▌       | 125/500 [00:28<01:26,  4.36it/s]
# PTI :step: 124, epoch: 62:  25%|██▌       | 126/500 [00:29<01:27,  4.25it/s]
# PTI :step: 125, epoch: 62:  25%|██▌       | 126/500 [00:29<01:27,  4.25it/s]
# PTI :step: 125, epoch: 62:  25%|██▌       | 127/500 [00:29<01:25,  4.38it/s]
# PTI :step: 126, epoch: 63:  25%|██▌       | 127/500 [00:29<01:25,  4.38it/s]
# PTI :step: 126, epoch: 63:  26%|██▌       | 128/500 [00:29<01:27,  4.26it/s]
# PTI :step: 127, epoch: 63:  26%|██▌       | 128/500 [00:29<01:27,  4.26it/s]
# PTI :step: 127, epoch: 63:  26%|██▌       | 129/500 [00:29<01:24,  4.38it/s]
# PTI :step: 128, epoch: 64:  26%|██▌       | 129/500 [00:29<01:24,  4.38it/s]
# PTI :step: 128, epoch: 64:  26%|██▌       | 130/500 [00:30<01:26,  4.26it/s]
# PTI :step: 129, epoch: 64:  26%|██▌       | 130/500 [00:30<01:26,  4.26it/s]
# PTI :step: 129, epoch: 64:  26%|██▌       | 131/500 [00:30<01:25,  4.34it/s]
# PTI :step: 130, epoch: 65:  26%|██▌       | 131/500 [00:30<01:25,  4.34it/s]
# PTI :step: 130, epoch: 65:  26%|██▋       | 132/500 [00:30<01:27,  4.21it/s]
# PTI :step: 131, epoch: 65:  26%|██▋       | 132/500 [00:30<01:27,  4.21it/s]
# PTI :step: 131, epoch: 65:  27%|██▋       | 133/500 [00:30<01:24,  4.35it/s]
# PTI :step: 132, epoch: 66:  27%|██▋       | 133/500 [00:30<01:24,  4.35it/s]
# PTI :step: 132, epoch: 66:  27%|██▋       | 134/500 [00:31<01:26,  4.24it/s]
# PTI :step: 133, epoch: 66:  27%|██▋       | 134/500 [00:31<01:26,  4.24it/s]
# PTI :step: 133, epoch: 66:  27%|██▋       | 135/500 [00:31<01:24,  4.33it/s]
# PTI :step: 134, epoch: 67:  27%|██▋       | 135/500 [00:31<01:24,  4.33it/s]
# PTI :step: 134, epoch: 67:  27%|██▋       | 136/500 [00:31<01:26,  4.23it/s]
# PTI :step: 135, epoch: 67:  27%|██▋       | 136/500 [00:31<01:26,  4.23it/s]
# PTI :step: 135, epoch: 67:  27%|██▋       | 137/500 [00:31<01:23,  4.37it/s]
# PTI :step: 136, epoch: 68:  27%|██▋       | 137/500 [00:31<01:23,  4.37it/s]
# PTI :step: 136, epoch: 68:  28%|██▊       | 138/500 [00:31<01:25,  4.24it/s]
# PTI :step: 137, epoch: 68:  28%|██▊       | 138/500 [00:31<01:25,  4.24it/s]
# PTI :step: 137, epoch: 68:  28%|██▊       | 139/500 [00:32<01:23,  4.34it/s]
# PTI :step: 138, epoch: 69:  28%|██▊       | 139/500 [00:32<01:23,  4.34it/s]
# PTI :step: 138, epoch: 69:  28%|██▊       | 140/500 [00:32<01:24,  4.24it/s]
# PTI :step: 139, epoch: 69:  28%|██▊       | 140/500 [00:32<01:24,  4.24it/s]
# PTI :step: 139, epoch: 69:  28%|██▊       | 141/500 [00:32<01:22,  4.37it/s]
# PTI :step: 140, epoch: 70:  28%|██▊       | 141/500 [00:32<01:22,  4.37it/s]
# PTI :step: 140, epoch: 70:  28%|██▊       | 142/500 [00:32<01:24,  4.25it/s]
# PTI :step: 141, epoch: 70:  28%|██▊       | 142/500 [00:32<01:24,  4.25it/s]
# PTI :step: 141, epoch: 70:  29%|██▊       | 143/500 [00:33<01:22,  4.35it/s]
# PTI :step: 142, epoch: 71:  29%|██▊       | 143/500 [00:33<01:22,  4.35it/s]
# PTI :step: 142, epoch: 71:  29%|██▉       | 144/500 [00:33<01:24,  4.23it/s]
# PTI :step: 143, epoch: 71:  29%|██▉       | 144/500 [00:33<01:24,  4.23it/s]
# PTI :step: 143, epoch: 71:  29%|██▉       | 145/500 [00:33<01:21,  4.36it/s]
# PTI :step: 144, epoch: 72:  29%|██▉       | 145/500 [00:33<01:21,  4.36it/s]
# PTI :step: 144, epoch: 72:  29%|██▉       | 146/500 [00:33<01:23,  4.25it/s]
# PTI :step: 145, epoch: 72:  29%|██▉       | 146/500 [00:33<01:23,  4.25it/s]
# PTI :step: 145, epoch: 72:  29%|██▉       | 147/500 [00:34<01:20,  4.36it/s]
# PTI :step: 146, epoch: 73:  29%|██▉       | 147/500 [00:34<01:20,  4.36it/s]
# PTI :step: 146, epoch: 73:  30%|██▉       | 148/500 [00:34<01:23,  4.23it/s]
# PTI :step: 147, epoch: 73:  30%|██▉       | 148/500 [00:34<01:23,  4.23it/s]
# PTI :step: 147, epoch: 73:  30%|██▉       | 149/500 [00:34<01:20,  4.36it/s]
# PTI :step: 148, epoch: 74:  30%|██▉       | 149/500 [00:34<01:20,  4.36it/s]
# PTI :step: 148, epoch: 74:  30%|███       | 150/500 [00:34<01:22,  4.25it/s]
# PTI :step: 149, epoch: 74:  30%|███       | 150/500 [00:34<01:22,  4.25it/s]
# PTI :step: 149, epoch: 74:  30%|███       | 151/500 [00:34<01:19,  4.38it/s]
# PTI :step: 150, epoch: 75:  30%|███       | 151/500 [00:34<01:19,  4.38it/s]
# PTI :step: 150, epoch: 75:  30%|███       | 152/500 [00:35<01:21,  4.26it/s]
# PTI :step: 151, epoch: 75:  30%|███       | 152/500 [00:35<01:21,  4.26it/s]
# PTI :step: 151, epoch: 75:  31%|███       | 153/500 [00:35<01:19,  4.39it/s]
# PTI :step: 152, epoch: 76:  31%|███       | 153/500 [00:35<01:19,  4.39it/s]
# PTI :step: 152, epoch: 76:  31%|███       | 154/500 [00:35<01:21,  4.27it/s]
# PTI :step: 153, epoch: 76:  31%|███       | 154/500 [00:35<01:21,  4.27it/s]
# PTI :step: 153, epoch: 76:  31%|███       | 155/500 [00:35<01:18,  4.40it/s]
# PTI :step: 154, epoch: 77:  31%|███       | 155/500 [00:35<01:18,  4.40it/s]
# PTI :step: 154, epoch: 77:  31%|███       | 156/500 [00:36<01:20,  4.27it/s]
# PTI :step: 155, epoch: 77:  31%|███       | 156/500 [00:36<01:20,  4.27it/s]
# PTI :step: 155, epoch: 77:  31%|███▏      | 157/500 [00:36<01:18,  4.40it/s]
# PTI :step: 156, epoch: 78:  31%|███▏      | 157/500 [00:36<01:18,  4.40it/s]
# PTI :step: 156, epoch: 78:  32%|███▏      | 158/500 [00:36<01:20,  4.27it/s]
# PTI :step: 157, epoch: 78:  32%|███▏      | 158/500 [00:36<01:20,  4.27it/s]
# PTI :step: 157, epoch: 78:  32%|███▏      | 159/500 [00:36<01:17,  4.39it/s]
# PTI :step: 158, epoch: 79:  32%|███▏      | 159/500 [00:36<01:17,  4.39it/s]
# PTI :step: 158, epoch: 79:  32%|███▏      | 160/500 [00:37<01:20,  4.24it/s]
# PTI :step: 159, epoch: 79:  32%|███▏      | 160/500 [00:37<01:20,  4.24it/s]
# PTI :step: 159, epoch: 79:  32%|███▏      | 161/500 [00:37<01:17,  4.35it/s]
# PTI :step: 160, epoch: 80:  32%|███▏      | 161/500 [00:37<01:17,  4.35it/s]
# PTI :step: 160, epoch: 80:  32%|███▏      | 162/500 [00:37<01:19,  4.24it/s]
# PTI :step: 161, epoch: 80:  32%|███▏      | 162/500 [00:37<01:19,  4.24it/s]
# PTI :step: 161, epoch: 80:  33%|███▎      | 163/500 [00:37<01:18,  4.30it/s]
# PTI :step: 162, epoch: 81:  33%|███▎      | 163/500 [00:37<01:18,  4.30it/s]
# PTI :step: 162, epoch: 81:  33%|███▎      | 164/500 [00:38<01:20,  4.20it/s]
# PTI :step: 163, epoch: 81:  33%|███▎      | 164/500 [00:38<01:20,  4.20it/s]
# PTI :step: 163, epoch: 81:  33%|███▎      | 165/500 [00:38<01:18,  4.28it/s]
# PTI :step: 164, epoch: 82:  33%|███▎      | 165/500 [00:38<01:18,  4.28it/s]
# PTI :step: 164, epoch: 82:  33%|███▎      | 166/500 [00:38<01:19,  4.19it/s]
# PTI :step: 165, epoch: 82:  33%|███▎      | 166/500 [00:38<01:19,  4.19it/s]
# PTI :step: 165, epoch: 82:  33%|███▎      | 167/500 [00:38<01:16,  4.33it/s]
# PTI :step: 166, epoch: 83:  33%|███▎      | 167/500 [00:38<01:16,  4.33it/s]
# PTI :step: 166, epoch: 83:  34%|███▎      | 168/500 [00:38<01:18,  4.23it/s]
# PTI :step: 167, epoch: 83:  34%|███▎      | 168/500 [00:38<01:18,  4.23it/s]
# PTI :step: 167, epoch: 83:  34%|███▍      | 169/500 [00:39<01:18,  4.24it/s]
# PTI :step: 168, epoch: 84:  34%|███▍      | 169/500 [00:39<01:18,  4.24it/s]
# PTI :step: 168, epoch: 84:  34%|███▍      | 170/500 [00:39<01:19,  4.17it/s]
# PTI :step: 169, epoch: 84:  34%|███▍      | 170/500 [00:39<01:19,  4.17it/s]
# PTI :step: 169, epoch: 84:  34%|███▍      | 171/500 [00:39<01:16,  4.31it/s]
# PTI :step: 170, epoch: 85:  34%|███▍      | 171/500 [00:39<01:16,  4.31it/s]
# PTI :step: 170, epoch: 85:  34%|███▍      | 172/500 [00:39<01:17,  4.22it/s]
# PTI :step: 171, epoch: 85:  34%|███▍      | 172/500 [00:39<01:17,  4.22it/s]
# PTI :step: 171, epoch: 85:  35%|███▍      | 173/500 [00:40<01:16,  4.30it/s]
# PTI :step: 172, epoch: 86:  35%|███▍      | 173/500 [00:40<01:16,  4.30it/s]
# PTI :step: 172, epoch: 86:  35%|███▍      | 174/500 [00:40<01:18,  4.17it/s]
# PTI :step: 173, epoch: 86:  35%|███▍      | 174/500 [00:40<01:18,  4.17it/s]
# PTI :step: 173, epoch: 86:  35%|███▌      | 175/500 [00:40<01:15,  4.31it/s]
# PTI :step: 174, epoch: 87:  35%|███▌      | 175/500 [00:40<01:15,  4.31it/s]
# PTI :step: 174, epoch: 87:  35%|███▌      | 176/500 [00:40<01:16,  4.21it/s]
# PTI :step: 175, epoch: 87:  35%|███▌      | 176/500 [00:40<01:16,  4.21it/s]
# PTI :step: 175, epoch: 87:  35%|███▌      | 177/500 [00:41<01:15,  4.29it/s]
# PTI :step: 176, epoch: 88:  35%|███▌      | 177/500 [00:41<01:15,  4.29it/s]
# PTI :step: 176, epoch: 88:  36%|███▌      | 178/500 [00:41<01:16,  4.19it/s]
# PTI :step: 177, epoch: 88:  36%|███▌      | 178/500 [00:41<01:16,  4.19it/s]
# PTI :step: 177, epoch: 88:  36%|███▌      | 179/500 [00:41<01:14,  4.30it/s]
# PTI :step: 178, epoch: 89:  36%|███▌      | 179/500 [00:41<01:14,  4.30it/s]
# PTI :step: 178, epoch: 89:  36%|███▌      | 180/500 [00:41<01:16,  4.21it/s]
# PTI :step: 179, epoch: 89:  36%|███▌      | 180/500 [00:41<01:16,  4.21it/s]
# PTI :step: 179, epoch: 89:  36%|███▌      | 181/500 [00:42<01:13,  4.31it/s]
# PTI :step: 180, epoch: 90:  36%|███▌      | 181/500 [00:42<01:13,  4.31it/s]
# PTI :step: 180, epoch: 90:  36%|███▋      | 182/500 [00:42<01:15,  4.22it/s]
# PTI :step: 181, epoch: 90:  36%|███▋      | 182/500 [00:42<01:15,  4.22it/s]
# PTI :step: 181, epoch: 90:  37%|███▋      | 183/500 [00:42<01:12,  4.35it/s]
# PTI :step: 182, epoch: 91:  37%|███▋      | 183/500 [00:42<01:12,  4.35it/s]
# PTI :step: 182, epoch: 91:  37%|███▋      | 184/500 [00:42<01:14,  4.24it/s]
# PTI :step: 183, epoch: 91:  37%|███▋      | 184/500 [00:42<01:14,  4.24it/s]
# PTI :step: 183, epoch: 91:  37%|███▋      | 185/500 [00:42<01:12,  4.36it/s]
# PTI :step: 184, epoch: 92:  37%|███▋      | 185/500 [00:42<01:12,  4.36it/s]
# PTI :step: 184, epoch: 92:  37%|███▋      | 186/500 [00:43<01:13,  4.24it/s]
# PTI :step: 185, epoch: 92:  37%|███▋      | 186/500 [00:43<01:13,  4.24it/s]
# PTI :step: 185, epoch: 92:  37%|███▋      | 187/500 [00:43<01:11,  4.37it/s]
# PTI :step: 186, epoch: 93:  37%|███▋      | 187/500 [00:43<01:11,  4.37it/s]
# PTI :step: 186, epoch: 93:  38%|███▊      | 188/500 [00:43<01:13,  4.26it/s]
# PTI :step: 187, epoch: 93:  38%|███▊      | 188/500 [00:43<01:13,  4.26it/s]
# PTI :step: 187, epoch: 93:  38%|███▊      | 189/500 [00:43<01:10,  4.39it/s]
# PTI :step: 188, epoch: 94:  38%|███▊      | 189/500 [00:43<01:10,  4.39it/s]
# PTI :step: 188, epoch: 94:  38%|███▊      | 190/500 [00:44<01:12,  4.28it/s]
# PTI :step: 189, epoch: 94:  38%|███▊      | 190/500 [00:44<01:12,  4.28it/s]
# PTI :step: 189, epoch: 94:  38%|███▊      | 191/500 [00:44<01:11,  4.34it/s]
# PTI :step: 190, epoch: 95:  38%|███▊      | 191/500 [00:44<01:11,  4.34it/s]
# PTI :step: 190, epoch: 95:  38%|███▊      | 192/500 [00:44<01:12,  4.24it/s]
# PTI :step: 191, epoch: 95:  38%|███▊      | 192/500 [00:44<01:12,  4.24it/s]
# PTI :step: 191, epoch: 95:  39%|███▊      | 193/500 [00:44<01:10,  4.38it/s]
# PTI :step: 192, epoch: 96:  39%|███▊      | 193/500 [00:44<01:10,  4.38it/s]
# PTI :step: 192, epoch: 96:  39%|███▉      | 194/500 [00:45<01:11,  4.27it/s]
# PTI :step: 193, epoch: 96:  39%|███▉      | 194/500 [00:45<01:11,  4.27it/s]
# PTI :step: 193, epoch: 96:  39%|███▉      | 195/500 [00:45<01:09,  4.38it/s]
# PTI :step: 194, epoch: 97:  39%|███▉      | 195/500 [00:45<01:09,  4.38it/s]
# PTI :step: 194, epoch: 97:  39%|███▉      | 196/500 [00:45<01:11,  4.27it/s]
# PTI :step: 195, epoch: 97:  39%|███▉      | 196/500 [00:45<01:11,  4.27it/s]
# PTI :step: 195, epoch: 97:  39%|███▉      | 197/500 [00:45<01:09,  4.36it/s]
# PTI :step: 196, epoch: 98:  39%|███▉      | 197/500 [00:45<01:09,  4.36it/s]
# PTI :step: 196, epoch: 98:  40%|███▉      | 198/500 [00:45<01:11,  4.25it/s]
# PTI :step: 197, epoch: 98:  40%|███▉      | 198/500 [00:45<01:11,  4.25it/s]
# PTI :step: 197, epoch: 98:  40%|███▉      | 199/500 [00:46<01:09,  4.31it/s]
# PTI :step: 198, epoch: 99:  40%|███▉      | 199/500 [00:46<01:09,  4.31it/s]
# PTI :step: 198, epoch: 99:  40%|████      | 200/500 [00:46<01:11,  4.21it/s]
# PTI :step: 199, epoch: 99:  40%|████      | 200/500 [00:46<01:11,  4.21it/s]
# PTI :step: 199, epoch: 99:  40%|████      | 201/500 [00:46<01:09,  4.32it/s]
# PTI :step: 200, epoch: 100:  40%|████      | 201/500 [00:46<01:09,  4.32it/s]
# PTI :step: 200, epoch: 100:  40%|████      | 202/500 [00:46<01:10,  4.22it/s]
# PTI :step: 201, epoch: 100:  40%|████      | 202/500 [00:46<01:10,  4.22it/s]
# PTI :step: 201, epoch: 100:  41%|████      | 203/500 [00:47<01:08,  4.32it/s]
# PTI :step: 202, epoch: 101:  41%|████      | 203/500 [00:47<01:08,  4.32it/s]
# PTI :step: 202, epoch: 101:  41%|████      | 204/500 [00:47<01:10,  4.21it/s]
# PTI :step: 203, epoch: 101:  41%|████      | 204/500 [00:47<01:10,  4.21it/s]
# PTI :step: 203, epoch: 101:  41%|████      | 205/500 [00:47<01:07,  4.36it/s]
# PTI :step: 204, epoch: 102:  41%|████      | 205/500 [00:47<01:07,  4.36it/s]
# PTI :step: 204, epoch: 102:  41%|████      | 206/500 [00:47<01:09,  4.25it/s]
# PTI :step: 205, epoch: 102:  41%|████      | 206/500 [00:47<01:09,  4.25it/s]
# PTI :step: 205, epoch: 102:  41%|████▏     | 207/500 [00:48<01:06,  4.38it/s]
# PTI :step: 206, epoch: 103:  41%|████▏     | 207/500 [00:48<01:06,  4.38it/s]
# PTI :step: 206, epoch: 103:  42%|████▏     | 208/500 [00:48<01:08,  4.26it/s]
# PTI :step: 207, epoch: 103:  42%|████▏     | 208/500 [00:48<01:08,  4.26it/s]
# PTI :step: 207, epoch: 103:  42%|████▏     | 209/500 [00:48<01:07,  4.33it/s]
# PTI :step: 208, epoch: 104:  42%|████▏     | 209/500 [00:48<01:07,  4.33it/s]
# PTI :step: 208, epoch: 104:  42%|████▏     | 210/500 [00:48<01:08,  4.23it/s]
# PTI :step: 209, epoch: 104:  42%|████▏     | 210/500 [00:48<01:08,  4.23it/s]
# PTI :step: 209, epoch: 104:  42%|████▏     | 211/500 [00:49<01:08,  4.24it/s]
# PTI :step: 210, epoch: 105:  42%|████▏     | 211/500 [00:49<01:08,  4.24it/s]
# PTI :step: 210, epoch: 105:  42%|████▏     | 212/500 [00:49<01:09,  4.13it/s]
# PTI :step: 211, epoch: 105:  42%|████▏     | 212/500 [00:49<01:09,  4.13it/s]
# PTI :step: 211, epoch: 105:  43%|████▎     | 213/500 [00:49<01:06,  4.30it/s]
# PTI :step: 212, epoch: 106:  43%|████▎     | 213/500 [00:49<01:06,  4.30it/s]
# PTI :step: 212, epoch: 106:  43%|████▎     | 214/500 [00:49<01:08,  4.20it/s]
# PTI :step: 213, epoch: 106:  43%|████▎     | 214/500 [00:49<01:08,  4.20it/s]
# PTI :step: 213, epoch: 106:  43%|████▎     | 215/500 [00:49<01:05,  4.34it/s]
# PTI :step: 214, epoch: 107:  43%|████▎     | 215/500 [00:49<01:05,  4.34it/s]
# PTI :step: 214, epoch: 107:  43%|████▎     | 216/500 [00:50<01:07,  4.23it/s]
# PTI :step: 215, epoch: 107:  43%|████▎     | 216/500 [00:50<01:07,  4.23it/s]
# PTI :step: 215, epoch: 107:  43%|████▎     | 217/500 [00:50<01:04,  4.37it/s]
# PTI :step: 216, epoch: 108:  43%|████▎     | 217/500 [00:50<01:04,  4.37it/s]
# PTI :step: 216, epoch: 108:  44%|████▎     | 218/500 [00:50<01:06,  4.26it/s]
# PTI :step: 217, epoch: 108:  44%|████▎     | 218/500 [00:50<01:06,  4.26it/s]
# PTI :step: 217, epoch: 108:  44%|████▍     | 219/500 [00:50<01:04,  4.38it/s]
# PTI :step: 218, epoch: 109:  44%|████▍     | 219/500 [00:50<01:04,  4.38it/s]
# PTI :step: 218, epoch: 109:  44%|████▍     | 220/500 [00:51<01:05,  4.26it/s]
# PTI :step: 219, epoch: 109:  44%|████▍     | 220/500 [00:51<01:05,  4.26it/s]
# PTI :step: 219, epoch: 109:  44%|████▍     | 221/500 [00:51<01:03,  4.36it/s]
# PTI :step: 220, epoch: 110:  44%|████▍     | 221/500 [00:51<01:03,  4.36it/s]
# PTI :step: 220, epoch: 110:  44%|████▍     | 222/500 [00:51<01:05,  4.24it/s]
# PTI :step: 221, epoch: 110:  44%|████▍     | 222/500 [00:51<01:05,  4.24it/s]
# PTI :step: 221, epoch: 110:  45%|████▍     | 223/500 [00:51<01:04,  4.32it/s]
# PTI :step: 222, epoch: 111:  45%|████▍     | 223/500 [00:51<01:04,  4.32it/s]
# PTI :step: 222, epoch: 111:  45%|████▍     | 224/500 [00:52<01:05,  4.22it/s]
# PTI :step: 223, epoch: 111:  45%|████▍     | 224/500 [00:52<01:05,  4.22it/s]
# PTI :step: 223, epoch: 111:  45%|████▌     | 225/500 [00:52<01:03,  4.32it/s]
# PTI :step: 224, epoch: 112:  45%|████▌     | 225/500 [00:52<01:03,  4.32it/s]
# PTI :step: 224, epoch: 112:  45%|████▌     | 226/500 [00:52<01:05,  4.20it/s]
# PTI :step: 225, epoch: 112:  45%|████▌     | 226/500 [00:52<01:05,  4.20it/s]
# PTI :step: 225, epoch: 112:  45%|████▌     | 227/500 [00:52<01:02,  4.34it/s]
# PTI :step: 226, epoch: 113:  45%|████▌     | 227/500 [00:52<01:02,  4.34it/s]
# PTI :step: 226, epoch: 113:  46%|████▌     | 228/500 [00:53<01:04,  4.22it/s]
# PTI :step: 227, epoch: 113:  46%|████▌     | 228/500 [00:53<01:04,  4.22it/s]
# PTI :step: 227, epoch: 113:  46%|████▌     | 229/500 [00:53<01:02,  4.32it/s]
# PTI :step: 228, epoch: 114:  46%|████▌     | 229/500 [00:53<01:02,  4.32it/s]
# PTI :step: 228, epoch: 114:  46%|████▌     | 230/500 [00:53<01:04,  4.22it/s]
# PTI :step: 229, epoch: 114:  46%|████▌     | 230/500 [00:53<01:04,  4.22it/s]
# PTI :step: 229, epoch: 114:  46%|████▌     | 231/500 [00:53<01:01,  4.36it/s]
# PTI :step: 230, epoch: 115:  46%|████▌     | 231/500 [00:53<01:01,  4.36it/s]
# PTI :step: 230, epoch: 115:  46%|████▋     | 232/500 [00:53<01:03,  4.25it/s]
# PTI :step: 231, epoch: 115:  46%|████▋     | 232/500 [00:53<01:03,  4.25it/s]
# PTI :step: 231, epoch: 115:  47%|████▋     | 233/500 [00:54<01:02,  4.29it/s]
# PTI :step: 232, epoch: 116:  47%|████▋     | 233/500 [00:54<01:02,  4.29it/s]
# PTI :step: 232, epoch: 116:  47%|████▋     | 234/500 [00:54<01:03,  4.20it/s]
# PTI :step: 233, epoch: 116:  47%|████▋     | 234/500 [00:54<01:03,  4.20it/s]
# PTI :step: 233, epoch: 116:  47%|████▋     | 235/500 [00:54<01:01,  4.33it/s]
# PTI :step: 234, epoch: 117:  47%|████▋     | 235/500 [00:54<01:01,  4.33it/s]
# PTI :step: 234, epoch: 117:  47%|████▋     | 236/500 [00:54<01:02,  4.20it/s]
# PTI :step: 235, epoch: 117:  47%|████▋     | 236/500 [00:54<01:02,  4.20it/s]
# PTI :step: 235, epoch: 117:  47%|████▋     | 237/500 [00:55<01:00,  4.31it/s]
# PTI :step: 236, epoch: 118:  47%|████▋     | 237/500 [00:55<01:00,  4.31it/s]
# PTI :step: 236, epoch: 118:  48%|████▊     | 238/500 [00:55<01:02,  4.21it/s]
# PTI :step: 237, epoch: 118:  48%|████▊     | 238/500 [00:55<01:02,  4.21it/s]
# PTI :step: 237, epoch: 118:  48%|████▊     | 239/500 [00:55<01:00,  4.28it/s]
# PTI :step: 238, epoch: 119:  48%|████▊     | 239/500 [00:55<01:00,  4.28it/s]
# PTI :step: 238, epoch: 119:  48%|████▊     | 240/500 [00:55<01:02,  4.19it/s]
# PTI :step: 239, epoch: 119:  48%|████▊     | 240/500 [00:55<01:02,  4.19it/s]
# PTI :step: 239, epoch: 119:  48%|████▊     | 241/500 [00:56<00:59,  4.32it/s]
# PTI :step: 240, epoch: 120:  48%|████▊     | 241/500 [00:56<00:59,  4.32it/s]
# PTI :step: 240, epoch: 120:  48%|████▊     | 242/500 [00:56<01:01,  4.17it/s]
# PTI :step: 241, epoch: 120:  48%|████▊     | 242/500 [00:56<01:01,  4.17it/s]
# PTI :step: 241, epoch: 120:  49%|████▊     | 243/500 [00:56<00:59,  4.29it/s]
# PTI :step: 242, epoch: 121:  49%|████▊     | 243/500 [00:56<00:59,  4.29it/s]
# PTI :step: 242, epoch: 121:  49%|████▉     | 244/500 [00:56<01:00,  4.21it/s]
# PTI :step: 243, epoch: 121:  49%|████▉     | 244/500 [00:56<01:00,  4.21it/s]
# PTI :step: 243, epoch: 121:  49%|████▉     | 245/500 [00:56<00:58,  4.35it/s]
# PTI :step: 244, epoch: 122:  49%|████▉     | 245/500 [00:56<00:58,  4.35it/s]
# PTI :step: 244, epoch: 122:  49%|████▉     | 246/500 [00:57<01:00,  4.21it/s]
# PTI :step: 245, epoch: 122:  49%|████▉     | 246/500 [00:57<01:00,  4.21it/s]
# PTI :step: 245, epoch: 122:  49%|████▉     | 247/500 [00:57<00:59,  4.27it/s]
# PTI :step: 246, epoch: 123:  49%|████▉     | 247/500 [00:57<00:59,  4.27it/s]
# PTI :step: 246, epoch: 123:  50%|████▉     | 248/500 [00:57<01:00,  4.19it/s]
# PTI :step: 247, epoch: 123:  50%|████▉     | 248/500 [00:57<01:00,  4.19it/s]
# PTI :step: 247, epoch: 123:  50%|████▉     | 249/500 [00:57<00:58,  4.32it/s]
# PTI :step: 248, epoch: 124:  50%|████▉     | 249/500 [00:57<00:58,  4.32it/s]
# PTI :step: 248, epoch: 124:  50%|█████     | 250/500 [00:58<00:59,  4.23it/s]
# PTI :  Pivot halfway
# PTI :step: 249, epoch: 124:  50%|█████     | 250/500 [00:58<00:59,  4.23it/s]
# PTI :step: 249, epoch: 124:  50%|█████     | 251/500 [00:58<00:57,  4.36it/s]
# PTI :step: 250, epoch: 125:  50%|█████     | 251/500 [00:58<00:57,  4.36it/s]
# PTI :step: 250, epoch: 125:  50%|█████     | 252/500 [00:58<01:07,  3.69it/s]
# PTI :step: 251, epoch: 125:  50%|█████     | 252/500 [00:58<01:07,  3.69it/s]
# PTI :step: 251, epoch: 125:  51%|█████     | 253/500 [00:58<01:02,  3.96it/s]
# PTI :step: 252, epoch: 126:  51%|█████     | 253/500 [00:58<01:02,  3.96it/s]
# PTI :step: 252, epoch: 126:  51%|█████     | 254/500 [00:59<01:01,  3.97it/s]
# PTI :step: 253, epoch: 126:  51%|█████     | 254/500 [00:59<01:01,  3.97it/s]
# PTI :step: 253, epoch: 126:  51%|█████     | 255/500 [00:59<00:58,  4.19it/s]
# PTI :step: 254, epoch: 127:  51%|█████     | 255/500 [00:59<00:58,  4.19it/s]
# PTI :step: 254, epoch: 127:  51%|█████     | 256/500 [00:59<00:58,  4.16it/s]
# PTI :step: 255, epoch: 127:  51%|█████     | 256/500 [00:59<00:58,  4.16it/s]
# PTI :step: 255, epoch: 127:  51%|█████▏    | 257/500 [00:59<00:56,  4.30it/s]
# PTI :step: 256, epoch: 128:  51%|█████▏    | 257/500 [00:59<00:56,  4.30it/s]
# PTI :step: 256, epoch: 128:  52%|█████▏    | 258/500 [01:00<00:57,  4.20it/s]
# PTI :step: 257, epoch: 128:  52%|█████▏    | 258/500 [01:00<00:57,  4.20it/s]
# PTI :step: 257, epoch: 128:  52%|█████▏    | 259/500 [01:00<00:55,  4.36it/s]
# PTI :step: 258, epoch: 129:  52%|█████▏    | 259/500 [01:00<00:55,  4.36it/s]
# PTI :step: 258, epoch: 129:  52%|█████▏    | 260/500 [01:00<00:56,  4.28it/s]
# PTI :step: 259, epoch: 129:  52%|█████▏    | 260/500 [01:00<00:56,  4.28it/s]
# PTI :step: 259, epoch: 129:  52%|█████▏    | 261/500 [01:00<00:53,  4.44it/s]
# PTI :step: 260, epoch: 130:  52%|█████▏    | 261/500 [01:00<00:53,  4.44it/s]
# PTI :step: 260, epoch: 130:  52%|█████▏    | 262/500 [01:01<00:54,  4.34it/s]
# PTI :step: 261, epoch: 130:  52%|█████▏    | 262/500 [01:01<00:54,  4.34it/s]
# PTI :step: 261, epoch: 130:  53%|█████▎    | 263/500 [01:01<00:53,  4.43it/s]
# PTI :step: 262, epoch: 131:  53%|█████▎    | 263/500 [01:01<00:53,  4.43it/s]
# PTI :step: 262, epoch: 131:  53%|█████▎    | 264/500 [01:01<00:54,  4.32it/s]
# PTI :step: 263, epoch: 131:  53%|█████▎    | 264/500 [01:01<00:54,  4.32it/s]
# PTI :step: 263, epoch: 131:  53%|█████▎    | 265/500 [01:01<00:52,  4.47it/s]
# PTI :step: 264, epoch: 132:  53%|█████▎    | 265/500 [01:01<00:52,  4.47it/s]
# PTI :step: 264, epoch: 132:  53%|█████▎    | 266/500 [01:01<00:53,  4.35it/s]
# PTI :step: 265, epoch: 132:  53%|█████▎    | 266/500 [01:01<00:53,  4.35it/s]
# PTI :step: 265, epoch: 132:  53%|█████▎    | 267/500 [01:02<00:52,  4.45it/s]
# PTI :step: 266, epoch: 133:  53%|█████▎    | 267/500 [01:02<00:52,  4.45it/s]
# PTI :step: 266, epoch: 133:  54%|█████▎    | 268/500 [01:02<00:53,  4.34it/s]
# PTI :step: 267, epoch: 133:  54%|█████▎    | 268/500 [01:02<00:53,  4.34it/s]
# PTI :step: 267, epoch: 133:  54%|█████▍    | 269/500 [01:02<00:51,  4.44it/s]
# PTI :step: 268, epoch: 134:  54%|█████▍    | 269/500 [01:02<00:51,  4.44it/s]
# PTI :step: 268, epoch: 134:  54%|█████▍    | 270/500 [01:02<00:53,  4.33it/s]
# PTI :step: 269, epoch: 134:  54%|█████▍    | 270/500 [01:02<00:53,  4.33it/s]
# PTI :step: 269, epoch: 134:  54%|█████▍    | 271/500 [01:03<00:51,  4.44it/s]
# PTI :step: 270, epoch: 135:  54%|█████▍    | 271/500 [01:03<00:51,  4.44it/s]
# PTI :step: 270, epoch: 135:  54%|█████▍    | 272/500 [01:03<00:53,  4.27it/s]
# PTI :step: 271, epoch: 135:  54%|█████▍    | 272/500 [01:03<00:53,  4.27it/s]
# PTI :step: 271, epoch: 135:  55%|█████▍    | 273/500 [01:03<00:51,  4.44it/s]
# PTI :step: 272, epoch: 136:  55%|█████▍    | 273/500 [01:03<00:51,  4.44it/s]
# PTI :step: 272, epoch: 136:  55%|█████▍    | 274/500 [01:03<00:52,  4.34it/s]
# PTI :step: 273, epoch: 136:  55%|█████▍    | 274/500 [01:03<00:52,  4.34it/s]
# PTI :step: 273, epoch: 136:  55%|█████▌    | 275/500 [01:03<00:50,  4.47it/s]
# PTI :step: 274, epoch: 137:  55%|█████▌    | 275/500 [01:03<00:50,  4.47it/s]
# PTI :step: 274, epoch: 137:  55%|█████▌    | 276/500 [01:04<00:51,  4.35it/s]
# PTI :step: 275, epoch: 137:  55%|█████▌    | 276/500 [01:04<00:51,  4.35it/s]
# PTI :step: 275, epoch: 137:  55%|█████▌    | 277/500 [01:04<00:49,  4.47it/s]
# PTI :step: 276, epoch: 138:  55%|█████▌    | 277/500 [01:04<00:49,  4.47it/s]
# PTI :step: 276, epoch: 138:  56%|█████▌    | 278/500 [01:04<00:50,  4.36it/s]
# PTI :step: 277, epoch: 138:  56%|█████▌    | 278/500 [01:04<00:50,  4.36it/s]
# PTI :step: 277, epoch: 138:  56%|█████▌    | 279/500 [01:04<00:49,  4.45it/s]
# PTI :step: 278, epoch: 139:  56%|█████▌    | 279/500 [01:04<00:49,  4.45it/s]
# PTI :step: 278, epoch: 139:  56%|█████▌    | 280/500 [01:05<00:50,  4.32it/s]
# PTI :step: 279, epoch: 139:  56%|█████▌    | 280/500 [01:05<00:50,  4.32it/s]
# PTI :step: 279, epoch: 139:  56%|█████▌    | 281/500 [01:05<00:49,  4.44it/s]
# PTI :step: 280, epoch: 140:  56%|█████▌    | 281/500 [01:05<00:49,  4.44it/s]
# PTI :step: 280, epoch: 140:  56%|█████▋    | 282/500 [01:05<00:50,  4.34it/s]
# PTI :step: 281, epoch: 140:  56%|█████▋    | 282/500 [01:05<00:50,  4.34it/s]
# PTI :step: 281, epoch: 140:  57%|█████▋    | 283/500 [01:05<00:48,  4.44it/s]
# PTI :step: 282, epoch: 141:  57%|█████▋    | 283/500 [01:05<00:48,  4.44it/s]
# PTI :step: 282, epoch: 141:  57%|█████▋    | 284/500 [01:06<00:50,  4.30it/s]
# PTI :step: 283, epoch: 141:  57%|█████▋    | 284/500 [01:06<00:50,  4.30it/s]
# PTI :step: 283, epoch: 141:  57%|█████▋    | 285/500 [01:06<00:48,  4.43it/s]
# PTI :step: 284, epoch: 142:  57%|█████▋    | 285/500 [01:06<00:48,  4.43it/s]
# PTI :step: 284, epoch: 142:  57%|█████▋    | 286/500 [01:06<00:49,  4.32it/s]
# PTI :step: 285, epoch: 142:  57%|█████▋    | 286/500 [01:06<00:49,  4.32it/s]
# PTI :step: 285, epoch: 142:  57%|█████▋    | 287/500 [01:06<00:47,  4.45it/s]
# PTI :step: 286, epoch: 143:  57%|█████▋    | 287/500 [01:06<00:47,  4.45it/s]
# PTI :step: 286, epoch: 143:  58%|█████▊    | 288/500 [01:06<00:48,  4.34it/s]
# PTI :step: 287, epoch: 143:  58%|█████▊    | 288/500 [01:06<00:48,  4.34it/s]
# PTI :step: 287, epoch: 143:  58%|█████▊    | 289/500 [01:07<00:47,  4.42it/s]
# PTI :step: 288, epoch: 144:  58%|█████▊    | 289/500 [01:07<00:47,  4.42it/s]
# PTI :step: 288, epoch: 144:  58%|█████▊    | 290/500 [01:07<00:48,  4.31it/s]
# PTI :step: 289, epoch: 144:  58%|█████▊    | 290/500 [01:07<00:48,  4.31it/s]
# PTI :step: 289, epoch: 144:  58%|█████▊    | 291/500 [01:07<00:46,  4.46it/s]
# PTI :step: 290, epoch: 145:  58%|█████▊    | 291/500 [01:07<00:46,  4.46it/s]
# PTI :step: 290, epoch: 145:  58%|█████▊    | 292/500 [01:07<00:47,  4.35it/s]
# PTI :step: 291, epoch: 145:  58%|█████▊    | 292/500 [01:07<00:47,  4.35it/s]
# PTI :step: 291, epoch: 145:  59%|█████▊    | 293/500 [01:08<00:46,  4.48it/s]
# PTI :step: 292, epoch: 146:  59%|█████▊    | 293/500 [01:08<00:46,  4.48it/s]
# PTI :step: 292, epoch: 146:  59%|█████▉    | 294/500 [01:08<00:47,  4.36it/s]
# PTI :step: 293, epoch: 146:  59%|█████▉    | 294/500 [01:08<00:47,  4.36it/s]
# PTI :step: 293, epoch: 146:  59%|█████▉    | 295/500 [01:08<00:46,  4.40it/s]
# PTI :step: 294, epoch: 147:  59%|█████▉    | 295/500 [01:08<00:46,  4.40it/s]
# PTI :step: 294, epoch: 147:  59%|█████▉    | 296/500 [01:08<00:47,  4.29it/s]
# PTI :step: 295, epoch: 147:  59%|█████▉    | 296/500 [01:08<00:47,  4.29it/s]
# PTI :step: 295, epoch: 147:  59%|█████▉    | 297/500 [01:08<00:45,  4.43it/s]
# PTI :step: 296, epoch: 148:  59%|█████▉    | 297/500 [01:08<00:45,  4.43it/s]
# PTI :step: 296, epoch: 148:  60%|█████▉    | 298/500 [01:09<00:46,  4.32it/s]
# PTI :step: 297, epoch: 148:  60%|█████▉    | 298/500 [01:09<00:46,  4.32it/s]
# PTI :step: 297, epoch: 148:  60%|█████▉    | 299/500 [01:09<00:45,  4.43it/s]
# PTI :step: 298, epoch: 149:  60%|█████▉    | 299/500 [01:09<00:45,  4.43it/s]
# PTI :step: 298, epoch: 149:  60%|██████    | 300/500 [01:09<00:46,  4.33it/s]
# PTI :step: 299, epoch: 149:  60%|██████    | 300/500 [01:09<00:46,  4.33it/s]
# PTI :step: 299, epoch: 149:  60%|██████    | 301/500 [01:09<00:45,  4.39it/s]
# PTI :step: 300, epoch: 150:  60%|██████    | 301/500 [01:09<00:45,  4.39it/s]
# PTI :step: 300, epoch: 150:  60%|██████    | 302/500 [01:10<00:46,  4.30it/s]
# PTI :step: 301, epoch: 150:  60%|██████    | 302/500 [01:10<00:46,  4.30it/s]
# PTI :step: 301, epoch: 150:  61%|██████    | 303/500 [01:10<00:44,  4.42it/s]
# PTI :step: 302, epoch: 151:  61%|██████    | 303/500 [01:10<00:44,  4.42it/s]
# PTI :step: 302, epoch: 151:  61%|██████    | 304/500 [01:10<00:45,  4.30it/s]
# PTI :step: 303, epoch: 151:  61%|██████    | 304/500 [01:10<00:45,  4.30it/s]
# PTI :step: 303, epoch: 151:  61%|██████    | 305/500 [01:10<00:44,  4.43it/s]
# PTI :step: 304, epoch: 152:  61%|██████    | 305/500 [01:10<00:44,  4.43it/s]
# PTI :step: 304, epoch: 152:  61%|██████    | 306/500 [01:11<00:44,  4.32it/s]
# PTI :step: 305, epoch: 152:  61%|██████    | 306/500 [01:11<00:44,  4.32it/s]
# PTI :step: 305, epoch: 152:  61%|██████▏   | 307/500 [01:11<00:43,  4.43it/s]
# PTI :step: 306, epoch: 153:  61%|██████▏   | 307/500 [01:11<00:43,  4.43it/s]
# PTI :step: 306, epoch: 153:  62%|██████▏   | 308/500 [01:11<00:44,  4.33it/s]
# PTI :step: 307, epoch: 153:  62%|██████▏   | 308/500 [01:11<00:44,  4.33it/s]
# PTI :step: 307, epoch: 153:  62%|██████▏   | 309/500 [01:11<00:42,  4.48it/s]
# PTI :step: 308, epoch: 154:  62%|██████▏   | 309/500 [01:11<00:42,  4.48it/s]
# PTI :step: 308, epoch: 154:  62%|██████▏   | 310/500 [01:11<00:43,  4.36it/s]
# PTI :step: 309, epoch: 154:  62%|██████▏   | 310/500 [01:11<00:43,  4.36it/s]
# PTI :step: 309, epoch: 154:  62%|██████▏   | 311/500 [01:12<00:42,  4.49it/s]
# PTI :step: 310, epoch: 155:  62%|██████▏   | 311/500 [01:12<00:42,  4.49it/s]
# PTI :step: 310, epoch: 155:  62%|██████▏   | 312/500 [01:12<00:43,  4.37it/s]
# PTI :step: 311, epoch: 155:  62%|██████▏   | 312/500 [01:12<00:43,  4.37it/s]
# PTI :step: 311, epoch: 155:  63%|██████▎   | 313/500 [01:12<00:41,  4.49it/s]
# PTI :step: 312, epoch: 156:  63%|██████▎   | 313/500 [01:12<00:41,  4.49it/s]
# PTI :step: 312, epoch: 156:  63%|██████▎   | 314/500 [01:12<00:42,  4.37it/s]
# PTI :step: 313, epoch: 156:  63%|██████▎   | 314/500 [01:12<00:42,  4.37it/s]
# PTI :step: 313, epoch: 156:  63%|██████▎   | 315/500 [01:13<00:41,  4.42it/s]
# PTI :step: 314, epoch: 157:  63%|██████▎   | 315/500 [01:13<00:41,  4.42it/s]
# PTI :step: 314, epoch: 157:  63%|██████▎   | 316/500 [01:13<00:42,  4.32it/s]
# PTI :step: 315, epoch: 157:  63%|██████▎   | 316/500 [01:13<00:42,  4.32it/s]
# PTI :step: 315, epoch: 157:  63%|██████▎   | 317/500 [01:13<00:41,  4.46it/s]
# PTI :step: 316, epoch: 158:  63%|██████▎   | 317/500 [01:13<00:41,  4.46it/s]
# PTI :step: 316, epoch: 158:  64%|██████▎   | 318/500 [01:13<00:41,  4.34it/s]
# PTI :step: 317, epoch: 158:  64%|██████▎   | 318/500 [01:13<00:41,  4.34it/s]
# PTI :step: 317, epoch: 158:  64%|██████▍   | 319/500 [01:13<00:40,  4.48it/s]
# PTI :step: 318, epoch: 159:  64%|██████▍   | 319/500 [01:13<00:40,  4.48it/s]
# PTI :step: 318, epoch: 159:  64%|██████▍   | 320/500 [01:14<00:41,  4.35it/s]
# PTI :step: 319, epoch: 159:  64%|██████▍   | 320/500 [01:14<00:41,  4.35it/s]
# PTI :step: 319, epoch: 159:  64%|██████▍   | 321/500 [01:14<00:40,  4.45it/s]
# PTI :step: 320, epoch: 160:  64%|██████▍   | 321/500 [01:14<00:40,  4.45it/s]
# PTI :step: 320, epoch: 160:  64%|██████▍   | 322/500 [01:14<00:40,  4.34it/s]
# PTI :step: 321, epoch: 160:  64%|██████▍   | 322/500 [01:14<00:40,  4.34it/s]
# PTI :step: 321, epoch: 160:  65%|██████▍   | 323/500 [01:14<00:39,  4.48it/s]
# PTI :step: 322, epoch: 161:  65%|██████▍   | 323/500 [01:14<00:39,  4.48it/s]
# PTI :step: 322, epoch: 161:  65%|██████▍   | 324/500 [01:15<00:40,  4.36it/s]
# PTI :step: 323, epoch: 161:  65%|██████▍   | 324/500 [01:15<00:40,  4.36it/s]
# PTI :step: 323, epoch: 161:  65%|██████▌   | 325/500 [01:15<00:39,  4.48it/s]
# PTI :step: 324, epoch: 162:  65%|██████▌   | 325/500 [01:15<00:39,  4.48it/s]
# PTI :step: 324, epoch: 162:  65%|██████▌   | 326/500 [01:15<00:39,  4.35it/s]
# PTI :step: 325, epoch: 162:  65%|██████▌   | 326/500 [01:15<00:39,  4.35it/s]
# PTI :step: 325, epoch: 162:  65%|██████▌   | 327/500 [01:15<00:38,  4.46it/s]
# PTI :step: 326, epoch: 163:  65%|██████▌   | 327/500 [01:15<00:38,  4.46it/s]
# PTI :step: 326, epoch: 163:  66%|██████▌   | 328/500 [01:16<00:39,  4.35it/s]
# PTI :step: 327, epoch: 163:  66%|██████▌   | 328/500 [01:16<00:39,  4.35it/s]
# PTI :step: 327, epoch: 163:  66%|██████▌   | 329/500 [01:16<00:38,  4.44it/s]
# PTI :step: 328, epoch: 164:  66%|██████▌   | 329/500 [01:16<00:38,  4.44it/s]
# PTI :step: 328, epoch: 164:  66%|██████▌   | 330/500 [01:16<00:39,  4.33it/s]
# PTI :step: 329, epoch: 164:  66%|██████▌   | 330/500 [01:16<00:39,  4.33it/s]
# PTI :step: 329, epoch: 164:  66%|██████▌   | 331/500 [01:16<00:37,  4.47it/s]
# PTI :step: 330, epoch: 165:  66%|██████▌   | 331/500 [01:16<00:37,  4.47it/s]
# PTI :step: 330, epoch: 165:  66%|██████▋   | 332/500 [01:16<00:38,  4.35it/s]
# PTI :step: 331, epoch: 165:  66%|██████▋   | 332/500 [01:16<00:38,  4.35it/s]
# PTI :step: 331, epoch: 165:  67%|██████▋   | 333/500 [01:17<00:37,  4.44it/s]
# PTI :step: 332, epoch: 166:  67%|██████▋   | 333/500 [01:17<00:37,  4.44it/s]
# PTI :step: 332, epoch: 166:  67%|██████▋   | 334/500 [01:17<00:38,  4.33it/s]
# PTI :step: 333, epoch: 166:  67%|██████▋   | 334/500 [01:17<00:38,  4.33it/s]
# PTI :step: 333, epoch: 166:  67%|██████▋   | 335/500 [01:17<00:36,  4.47it/s]
# PTI :step: 334, epoch: 167:  67%|██████▋   | 335/500 [01:17<00:36,  4.47it/s]
# PTI :step: 334, epoch: 167:  67%|██████▋   | 336/500 [01:17<00:37,  4.36it/s]
# PTI :step: 335, epoch: 167:  67%|██████▋   | 336/500 [01:17<00:37,  4.36it/s]
# PTI :step: 335, epoch: 167:  67%|██████▋   | 337/500 [01:18<00:36,  4.47it/s]
# PTI :step: 336, epoch: 168:  67%|██████▋   | 337/500 [01:18<00:36,  4.47it/s]
# PTI :step: 336, epoch: 168:  68%|██████▊   | 338/500 [01:18<00:37,  4.34it/s]
# PTI :step: 337, epoch: 168:  68%|██████▊   | 338/500 [01:18<00:37,  4.34it/s]
# PTI :step: 337, epoch: 168:  68%|██████▊   | 339/500 [01:18<00:36,  4.42it/s]
# PTI :step: 338, epoch: 169:  68%|██████▊   | 339/500 [01:18<00:36,  4.42it/s]
# PTI :step: 338, epoch: 169:  68%|██████▊   | 340/500 [01:18<00:37,  4.28it/s]
# PTI :step: 339, epoch: 169:  68%|██████▊   | 340/500 [01:18<00:37,  4.28it/s]
# PTI :step: 339, epoch: 169:  68%|██████▊   | 341/500 [01:19<00:35,  4.43it/s]
# PTI :step: 340, epoch: 170:  68%|██████▊   | 341/500 [01:19<00:35,  4.43it/s]
# PTI :step: 340, epoch: 170:  68%|██████▊   | 342/500 [01:19<00:36,  4.32it/s]
# PTI :step: 341, epoch: 170:  68%|██████▊   | 342/500 [01:19<00:36,  4.32it/s]
# PTI :step: 341, epoch: 170:  69%|██████▊   | 343/500 [01:19<00:35,  4.46it/s]
# PTI :step: 342, epoch: 171:  69%|██████▊   | 343/500 [01:19<00:35,  4.46it/s]
# PTI :step: 342, epoch: 171:  69%|██████▉   | 344/500 [01:19<00:35,  4.35it/s]
# PTI :step: 343, epoch: 171:  69%|██████▉   | 344/500 [01:19<00:35,  4.35it/s]
# PTI :step: 343, epoch: 171:  69%|██████▉   | 345/500 [01:19<00:34,  4.49it/s]
# PTI :step: 344, epoch: 172:  69%|██████▉   | 345/500 [01:19<00:34,  4.49it/s]
# PTI :step: 344, epoch: 172:  69%|██████▉   | 346/500 [01:20<00:35,  4.37it/s]
# PTI :step: 345, epoch: 172:  69%|██████▉   | 346/500 [01:20<00:35,  4.37it/s]
# PTI :step: 345, epoch: 172:  69%|██████▉   | 347/500 [01:20<00:34,  4.50it/s]
# PTI :step: 346, epoch: 173:  69%|██████▉   | 347/500 [01:20<00:34,  4.50it/s]
# PTI :step: 346, epoch: 173:  70%|██████▉   | 348/500 [01:20<00:34,  4.35it/s]
# PTI :step: 347, epoch: 173:  70%|██████▉   | 348/500 [01:20<00:34,  4.35it/s]
# PTI :step: 347, epoch: 173:  70%|██████▉   | 349/500 [01:20<00:33,  4.48it/s]
# PTI :step: 348, epoch: 174:  70%|██████▉   | 349/500 [01:20<00:33,  4.48it/s]
# PTI :step: 348, epoch: 174:  70%|███████   | 350/500 [01:21<00:34,  4.35it/s]
# PTI :step: 349, epoch: 174:  70%|███████   | 350/500 [01:21<00:34,  4.35it/s]
# PTI :step: 349, epoch: 174:  70%|███████   | 351/500 [01:21<00:33,  4.45it/s]
# PTI :step: 350, epoch: 175:  70%|███████   | 351/500 [01:21<00:33,  4.45it/s]
# PTI :step: 350, epoch: 175:  70%|███████   | 352/500 [01:21<00:34,  4.34it/s]
# PTI :step: 351, epoch: 175:  70%|███████   | 352/500 [01:21<00:34,  4.34it/s]
# PTI :step: 351, epoch: 175:  71%|███████   | 353/500 [01:21<00:33,  4.42it/s]
# PTI :step: 352, epoch: 176:  71%|███████   | 353/500 [01:21<00:33,  4.42it/s]
# PTI :step: 352, epoch: 176:  71%|███████   | 354/500 [01:21<00:33,  4.32it/s]
# PTI :step: 353, epoch: 176:  71%|███████   | 354/500 [01:21<00:33,  4.32it/s]
# PTI :step: 353, epoch: 176:  71%|███████   | 355/500 [01:22<00:32,  4.43it/s]
# PTI :step: 354, epoch: 177:  71%|███████   | 355/500 [01:22<00:32,  4.43it/s]
# PTI :step: 354, epoch: 177:  71%|███████   | 356/500 [01:22<00:33,  4.33it/s]
# PTI :step: 355, epoch: 177:  71%|███████   | 356/500 [01:22<00:33,  4.33it/s]
# PTI :step: 355, epoch: 177:  71%|███████▏  | 357/500 [01:22<00:32,  4.46it/s]
# PTI :step: 356, epoch: 178:  71%|███████▏  | 357/500 [01:22<00:32,  4.46it/s]
# PTI :step: 356, epoch: 178:  72%|███████▏  | 358/500 [01:22<00:32,  4.34it/s]
# PTI :step: 357, epoch: 178:  72%|███████▏  | 358/500 [01:22<00:32,  4.34it/s]
# PTI :step: 357, epoch: 178:  72%|███████▏  | 359/500 [01:23<00:31,  4.46it/s]
# PTI :step: 358, epoch: 179:  72%|███████▏  | 359/500 [01:23<00:31,  4.46it/s]
# PTI :step: 358, epoch: 179:  72%|███████▏  | 360/500 [01:23<00:32,  4.35it/s]
# PTI :step: 359, epoch: 179:  72%|███████▏  | 360/500 [01:23<00:32,  4.35it/s]
# PTI :step: 359, epoch: 179:  72%|███████▏  | 361/500 [01:23<00:31,  4.47it/s]
# PTI :step: 360, epoch: 180:  72%|███████▏  | 361/500 [01:23<00:31,  4.47it/s]
# PTI :step: 360, epoch: 180:  72%|███████▏  | 362/500 [01:23<00:31,  4.35it/s]
# PTI :step: 361, epoch: 180:  72%|███████▏  | 362/500 [01:23<00:31,  4.35it/s]
# PTI :step: 361, epoch: 180:  73%|███████▎  | 363/500 [01:23<00:30,  4.49it/s]
# PTI :step: 362, epoch: 181:  73%|███████▎  | 363/500 [01:23<00:30,  4.49it/s]
# PTI :step: 362, epoch: 181:  73%|███████▎  | 364/500 [01:24<00:31,  4.36it/s]
# PTI :step: 363, epoch: 181:  73%|███████▎  | 364/500 [01:24<00:31,  4.36it/s]
# PTI :step: 363, epoch: 181:  73%|███████▎  | 365/500 [01:24<00:30,  4.48it/s]
# PTI :step: 364, epoch: 182:  73%|███████▎  | 365/500 [01:24<00:30,  4.48it/s]
# PTI :step: 364, epoch: 182:  73%|███████▎  | 366/500 [01:24<00:30,  4.36it/s]
# PTI :step: 365, epoch: 182:  73%|███████▎  | 366/500 [01:24<00:30,  4.36it/s]
# PTI :step: 365, epoch: 182:  73%|███████▎  | 367/500 [01:24<00:29,  4.46it/s]
# PTI :step: 366, epoch: 183:  73%|███████▎  | 367/500 [01:24<00:29,  4.46it/s]
# PTI :step: 366, epoch: 183:  74%|███████▎  | 368/500 [01:25<00:30,  4.35it/s]
# PTI :step: 367, epoch: 183:  74%|███████▎  | 368/500 [01:25<00:30,  4.35it/s]
# PTI :step: 367, epoch: 183:  74%|███████▍  | 369/500 [01:25<00:29,  4.45it/s]
# PTI :step: 368, epoch: 184:  74%|███████▍  | 369/500 [01:25<00:29,  4.45it/s]
# PTI :step: 368, epoch: 184:  74%|███████▍  | 370/500 [01:25<00:29,  4.34it/s]
# PTI :step: 369, epoch: 184:  74%|███████▍  | 370/500 [01:25<00:29,  4.34it/s]
# PTI :step: 369, epoch: 184:  74%|███████▍  | 371/500 [01:25<00:29,  4.43it/s]
# PTI :step: 370, epoch: 185:  74%|███████▍  | 371/500 [01:25<00:29,  4.43it/s]
# PTI :step: 370, epoch: 185:  74%|███████▍  | 372/500 [01:26<00:29,  4.27it/s]
# PTI :step: 371, epoch: 185:  74%|███████▍  | 372/500 [01:26<00:29,  4.27it/s]
# PTI :step: 371, epoch: 185:  75%|███████▍  | 373/500 [01:26<00:28,  4.40it/s]
# PTI :step: 372, epoch: 186:  75%|███████▍  | 373/500 [01:26<00:28,  4.40it/s]
# PTI :step: 372, epoch: 186:  75%|███████▍  | 374/500 [01:26<00:29,  4.31it/s]
# PTI :step: 373, epoch: 186:  75%|███████▍  | 374/500 [01:26<00:29,  4.31it/s]
# PTI :step: 373, epoch: 186:  75%|███████▌  | 375/500 [01:26<00:28,  4.44it/s]
# PTI :step: 374, epoch: 187:  75%|███████▌  | 375/500 [01:26<00:28,  4.44it/s]
# PTI :step: 374, epoch: 187:  75%|███████▌  | 376/500 [01:26<00:28,  4.34it/s]
# PTI :step: 375, epoch: 187:  75%|███████▌  | 376/500 [01:26<00:28,  4.34it/s]
# PTI :step: 375, epoch: 187:  75%|███████▌  | 377/500 [01:27<00:28,  4.38it/s]
# PTI :step: 376, epoch: 188:  75%|███████▌  | 377/500 [01:27<00:28,  4.38it/s]
# PTI :step: 376, epoch: 188:  76%|███████▌  | 378/500 [01:27<00:28,  4.28it/s]
# PTI :step: 377, epoch: 188:  76%|███████▌  | 378/500 [01:27<00:28,  4.28it/s]
# PTI :step: 377, epoch: 188:  76%|███████▌  | 379/500 [01:27<00:27,  4.40it/s]
# PTI :step: 378, epoch: 189:  76%|███████▌  | 379/500 [01:27<00:27,  4.40it/s]
# PTI :step: 378, epoch: 189:  76%|███████▌  | 380/500 [01:27<00:27,  4.31it/s]
# PTI :step: 379, epoch: 189:  76%|███████▌  | 380/500 [01:27<00:27,  4.31it/s]
# PTI :step: 379, epoch: 189:  76%|███████▌  | 381/500 [01:28<00:26,  4.46it/s]
# PTI :step: 380, epoch: 190:  76%|███████▌  | 381/500 [01:28<00:26,  4.46it/s]
# PTI :step: 380, epoch: 190:  76%|███████▋  | 382/500 [01:28<00:27,  4.35it/s]
# PTI :step: 381, epoch: 190:  76%|███████▋  | 382/500 [01:28<00:27,  4.35it/s]
# PTI :step: 381, epoch: 190:  77%|███████▋  | 383/500 [01:28<00:26,  4.41it/s]
# PTI :step: 382, epoch: 191:  77%|███████▋  | 383/500 [01:28<00:26,  4.41it/s]
# PTI :step: 382, epoch: 191:  77%|███████▋  | 384/500 [01:28<00:26,  4.31it/s]
# PTI :step: 383, epoch: 191:  77%|███████▋  | 384/500 [01:28<00:26,  4.31it/s]
# PTI :step: 383, epoch: 191:  77%|███████▋  | 385/500 [01:29<00:25,  4.44it/s]
# PTI :step: 384, epoch: 192:  77%|███████▋  | 385/500 [01:29<00:25,  4.44it/s]
# PTI :step: 384, epoch: 192:  77%|███████▋  | 386/500 [01:29<00:26,  4.28it/s]
# PTI :step: 385, epoch: 192:  77%|███████▋  | 386/500 [01:29<00:26,  4.28it/s]
# PTI :step: 385, epoch: 192:  77%|███████▋  | 387/500 [01:29<00:25,  4.44it/s]
# PTI :step: 386, epoch: 193:  77%|███████▋  | 387/500 [01:29<00:25,  4.44it/s]
# PTI :step: 386, epoch: 193:  78%|███████▊  | 388/500 [01:29<00:25,  4.33it/s]
# PTI :step: 387, epoch: 193:  78%|███████▊  | 388/500 [01:29<00:25,  4.33it/s]
# PTI :step: 387, epoch: 193:  78%|███████▊  | 389/500 [01:29<00:24,  4.46it/s]
# PTI :step: 388, epoch: 194:  78%|███████▊  | 389/500 [01:29<00:24,  4.46it/s]
# PTI :step: 388, epoch: 194:  78%|███████▊  | 390/500 [01:30<00:25,  4.35it/s]
# PTI :step: 389, epoch: 194:  78%|███████▊  | 390/500 [01:30<00:25,  4.35it/s]
# PTI :step: 389, epoch: 194:  78%|███████▊  | 391/500 [01:30<00:24,  4.47it/s]
# PTI :step: 390, epoch: 195:  78%|███████▊  | 391/500 [01:30<00:24,  4.47it/s]
# PTI :step: 390, epoch: 195:  78%|███████▊  | 392/500 [01:30<00:24,  4.36it/s]
# PTI :step: 391, epoch: 195:  78%|███████▊  | 392/500 [01:30<00:24,  4.36it/s]
# PTI :step: 391, epoch: 195:  79%|███████▊  | 393/500 [01:30<00:23,  4.47it/s]
# PTI :step: 392, epoch: 196:  79%|███████▊  | 393/500 [01:30<00:23,  4.47it/s]
# PTI :step: 392, epoch: 196:  79%|███████▉  | 394/500 [01:31<00:24,  4.35it/s]
# PTI :step: 393, epoch: 196:  79%|███████▉  | 394/500 [01:31<00:24,  4.35it/s]
# PTI :step: 393, epoch: 196:  79%|███████▉  | 395/500 [01:31<00:23,  4.42it/s]
# PTI :step: 394, epoch: 197:  79%|███████▉  | 395/500 [01:31<00:23,  4.42it/s]
# PTI :step: 394, epoch: 197:  79%|███████▉  | 396/500 [01:31<00:24,  4.32it/s]
# PTI :step: 395, epoch: 197:  79%|███████▉  | 396/500 [01:31<00:24,  4.32it/s]
# PTI :step: 395, epoch: 197:  79%|███████▉  | 397/500 [01:31<00:23,  4.46it/s]
# PTI :step: 396, epoch: 198:  79%|███████▉  | 397/500 [01:31<00:23,  4.46it/s]
# PTI :step: 396, epoch: 198:  80%|███████▉  | 398/500 [01:31<00:23,  4.35it/s]
# PTI :step: 397, epoch: 198:  80%|███████▉  | 398/500 [01:31<00:23,  4.35it/s]
# PTI :step: 397, epoch: 198:  80%|███████▉  | 399/500 [01:32<00:22,  4.46it/s]
# PTI :step: 398, epoch: 199:  80%|███████▉  | 399/500 [01:32<00:22,  4.46it/s]
# PTI :step: 398, epoch: 199:  80%|████████  | 400/500 [01:32<00:23,  4.35it/s]
# PTI :step: 399, epoch: 199:  80%|████████  | 400/500 [01:32<00:23,  4.35it/s]
# PTI :step: 399, epoch: 199:  80%|████████  | 401/500 [01:32<00:22,  4.49it/s]
# PTI :step: 400, epoch: 200:  80%|████████  | 401/500 [01:32<00:22,  4.49it/s]
# PTI :step: 400, epoch: 200:  80%|████████  | 402/500 [01:32<00:22,  4.37it/s]
# PTI :step: 401, epoch: 200:  80%|████████  | 402/500 [01:32<00:22,  4.37it/s]
# PTI :step: 401, epoch: 200:  81%|████████  | 403/500 [01:33<00:21,  4.49it/s]
# PTI :step: 402, epoch: 201:  81%|████████  | 403/500 [01:33<00:21,  4.49it/s]
# PTI :step: 402, epoch: 201:  81%|████████  | 404/500 [01:33<00:22,  4.35it/s]
# PTI :step: 403, epoch: 201:  81%|████████  | 404/500 [01:33<00:22,  4.35it/s]
# PTI :step: 403, epoch: 201:  81%|████████  | 405/500 [01:33<00:21,  4.46it/s]
# PTI :step: 404, epoch: 202:  81%|████████  | 405/500 [01:33<00:21,  4.46it/s]
# PTI :step: 404, epoch: 202:  81%|████████  | 406/500 [01:33<00:21,  4.35it/s]
# PTI :step: 405, epoch: 202:  81%|████████  | 406/500 [01:33<00:21,  4.35it/s]
# PTI :step: 405, epoch: 202:  81%|████████▏ | 407/500 [01:34<00:20,  4.48it/s]
# PTI :step: 406, epoch: 203:  81%|████████▏ | 407/500 [01:34<00:20,  4.48it/s]
# PTI :step: 406, epoch: 203:  82%|████████▏ | 408/500 [01:34<00:21,  4.35it/s]
# PTI :step: 407, epoch: 203:  82%|████████▏ | 408/500 [01:34<00:21,  4.35it/s]
# PTI :step: 407, epoch: 203:  82%|████████▏ | 409/500 [01:34<00:20,  4.49it/s]
# PTI :step: 408, epoch: 204:  82%|████████▏ | 409/500 [01:34<00:20,  4.49it/s]
# PTI :step: 408, epoch: 204:  82%|████████▏ | 410/500 [01:34<00:20,  4.37it/s]
# PTI :step: 409, epoch: 204:  82%|████████▏ | 410/500 [01:34<00:20,  4.37it/s]
# PTI :step: 409, epoch: 204:  82%|████████▏ | 411/500 [01:34<00:19,  4.51it/s]
# PTI :step: 410, epoch: 205:  82%|████████▏ | 411/500 [01:34<00:19,  4.51it/s]
# PTI :step: 410, epoch: 205:  82%|████████▏ | 412/500 [01:35<00:20,  4.38it/s]
# PTI :step: 411, epoch: 205:  82%|████████▏ | 412/500 [01:35<00:20,  4.38it/s]
# PTI :step: 411, epoch: 205:  83%|████████▎ | 413/500 [01:35<00:19,  4.44it/s]
# PTI :step: 412, epoch: 206:  83%|████████▎ | 413/500 [01:35<00:19,  4.44it/s]
# PTI :step: 412, epoch: 206:  83%|████████▎ | 414/500 [01:35<00:19,  4.34it/s]
# PTI :step: 413, epoch: 206:  83%|████████▎ | 414/500 [01:35<00:19,  4.34it/s]
# PTI :step: 413, epoch: 206:  83%|████████▎ | 415/500 [01:35<00:18,  4.49it/s]
# PTI :step: 414, epoch: 207:  83%|████████▎ | 415/500 [01:35<00:18,  4.49it/s]
# PTI :step: 414, epoch: 207:  83%|████████▎ | 416/500 [01:36<00:19,  4.37it/s]
# PTI :step: 415, epoch: 207:  83%|████████▎ | 416/500 [01:36<00:19,  4.37it/s]
# PTI :step: 415, epoch: 207:  83%|████████▎ | 417/500 [01:36<00:18,  4.44it/s]
# PTI :step: 416, epoch: 208:  83%|████████▎ | 417/500 [01:36<00:18,  4.44it/s]
# PTI :step: 416, epoch: 208:  84%|████████▎ | 418/500 [01:36<00:18,  4.33it/s]
# PTI :step: 417, epoch: 208:  84%|████████▎ | 418/500 [01:36<00:18,  4.33it/s]
# PTI :step: 417, epoch: 208:  84%|████████▍ | 419/500 [01:36<00:18,  4.44it/s]
# PTI :step: 418, epoch: 209:  84%|████████▍ | 419/500 [01:36<00:18,  4.44it/s]
# PTI :step: 418, epoch: 209:  84%|████████▍ | 420/500 [01:36<00:18,  4.34it/s]
# PTI :step: 419, epoch: 209:  84%|████████▍ | 420/500 [01:36<00:18,  4.34it/s]
# PTI :step: 419, epoch: 209:  84%|████████▍ | 421/500 [01:37<00:17,  4.47it/s]
# PTI :step: 420, epoch: 210:  84%|████████▍ | 421/500 [01:37<00:17,  4.47it/s]
# PTI :step: 420, epoch: 210:  84%|████████▍ | 422/500 [01:37<00:17,  4.36it/s]
# PTI :step: 421, epoch: 210:  84%|████████▍ | 422/500 [01:37<00:17,  4.36it/s]
# PTI :step: 421, epoch: 210:  85%|████████▍ | 423/500 [01:37<00:17,  4.47it/s]
# PTI :step: 422, epoch: 211:  85%|████████▍ | 423/500 [01:37<00:17,  4.47it/s]
# PTI :step: 422, epoch: 211:  85%|████████▍ | 424/500 [01:37<00:17,  4.34it/s]
# PTI :step: 423, epoch: 211:  85%|████████▍ | 424/500 [01:37<00:17,  4.34it/s]
# PTI :step: 423, epoch: 211:  85%|████████▌ | 425/500 [01:38<00:16,  4.45it/s]
# PTI :step: 424, epoch: 212:  85%|████████▌ | 425/500 [01:38<00:16,  4.45it/s]
# PTI :step: 424, epoch: 212:  85%|████████▌ | 426/500 [01:38<00:17,  4.34it/s]
# PTI :step: 425, epoch: 212:  85%|████████▌ | 426/500 [01:38<00:17,  4.34it/s]
# PTI :step: 425, epoch: 212:  85%|████████▌ | 427/500 [01:38<00:16,  4.48it/s]
# PTI :step: 426, epoch: 213:  85%|████████▌ | 427/500 [01:38<00:16,  4.48it/s]
# PTI :step: 426, epoch: 213:  86%|████████▌ | 428/500 [01:38<00:16,  4.36it/s]
# PTI :step: 427, epoch: 213:  86%|████████▌ | 428/500 [01:38<00:16,  4.36it/s]
# PTI :step: 427, epoch: 213:  86%|████████▌ | 429/500 [01:39<00:15,  4.44it/s]
# PTI :step: 428, epoch: 214:  86%|████████▌ | 429/500 [01:39<00:15,  4.44it/s]
# PTI :step: 428, epoch: 214:  86%|████████▌ | 430/500 [01:39<00:16,  4.27it/s]
# PTI :step: 429, epoch: 214:  86%|████████▌ | 430/500 [01:39<00:16,  4.27it/s]
# PTI :step: 429, epoch: 214:  86%|████████▌ | 431/500 [01:39<00:15,  4.42it/s]
# PTI :step: 430, epoch: 215:  86%|████████▌ | 431/500 [01:39<00:15,  4.42it/s]
# PTI :step: 430, epoch: 215:  86%|████████▋ | 432/500 [01:39<00:15,  4.33it/s]
# PTI :step: 431, epoch: 215:  86%|████████▋ | 432/500 [01:39<00:15,  4.33it/s]
# PTI :step: 431, epoch: 215:  87%|████████▋ | 433/500 [01:39<00:14,  4.47it/s]
# PTI :step: 432, epoch: 216:  87%|████████▋ | 433/500 [01:39<00:14,  4.47it/s]
# PTI :step: 432, epoch: 216:  87%|████████▋ | 434/500 [01:40<00:15,  4.36it/s]
# PTI :step: 433, epoch: 216:  87%|████████▋ | 434/500 [01:40<00:15,  4.36it/s]
# PTI :step: 433, epoch: 216:  87%|████████▋ | 435/500 [01:40<00:14,  4.45it/s]
# PTI :step: 434, epoch: 217:  87%|████████▋ | 435/500 [01:40<00:14,  4.45it/s]
# PTI :step: 434, epoch: 217:  87%|████████▋ | 436/500 [01:40<00:14,  4.33it/s]
# PTI :step: 435, epoch: 217:  87%|████████▋ | 436/500 [01:40<00:14,  4.33it/s]
# PTI :step: 435, epoch: 217:  87%|████████▋ | 437/500 [01:40<00:14,  4.46it/s]
# PTI :step: 436, epoch: 218:  87%|████████▋ | 437/500 [01:40<00:14,  4.46it/s]
# PTI :step: 436, epoch: 218:  88%|████████▊ | 438/500 [01:41<00:14,  4.34it/s]
# PTI :step: 437, epoch: 218:  88%|████████▊ | 438/500 [01:41<00:14,  4.34it/s]
# PTI :step: 437, epoch: 218:  88%|████████▊ | 439/500 [01:41<00:13,  4.46it/s]
# PTI :step: 438, epoch: 219:  88%|████████▊ | 439/500 [01:41<00:13,  4.46it/s]
# PTI :step: 438, epoch: 219:  88%|████████▊ | 440/500 [01:41<00:13,  4.36it/s]
# PTI :step: 439, epoch: 219:  88%|████████▊ | 440/500 [01:41<00:13,  4.36it/s]
# PTI :step: 439, epoch: 219:  88%|████████▊ | 441/500 [01:41<00:13,  4.43it/s]
# PTI :step: 440, epoch: 220:  88%|████████▊ | 441/500 [01:41<00:13,  4.43it/s]
# PTI :step: 440, epoch: 220:  88%|████████▊ | 442/500 [01:41<00:13,  4.32it/s]
# PTI :step: 441, epoch: 220:  88%|████████▊ | 442/500 [01:41<00:13,  4.32it/s]
# PTI :step: 441, epoch: 220:  89%|████████▊ | 443/500 [01:42<00:12,  4.46it/s]
# PTI :step: 442, epoch: 221:  89%|████████▊ | 443/500 [01:42<00:12,  4.46it/s]
# PTI :step: 442, epoch: 221:  89%|████████▉ | 444/500 [01:42<00:12,  4.34it/s]
# PTI :step: 443, epoch: 221:  89%|████████▉ | 444/500 [01:42<00:12,  4.34it/s]
# PTI :step: 443, epoch: 221:  89%|████████▉ | 445/500 [01:42<00:12,  4.47it/s]
# PTI :step: 444, epoch: 222:  89%|████████▉ | 445/500 [01:42<00:12,  4.47it/s]
# PTI :step: 444, epoch: 222:  89%|████████▉ | 446/500 [01:42<00:12,  4.35it/s]
# PTI :step: 445, epoch: 222:  89%|████████▉ | 446/500 [01:42<00:12,  4.35it/s]
# PTI :step: 445, epoch: 222:  89%|████████▉ | 447/500 [01:43<00:11,  4.45it/s]
# PTI :step: 446, epoch: 223:  89%|████████▉ | 447/500 [01:43<00:11,  4.45it/s]
# PTI :step: 446, epoch: 223:  90%|████████▉ | 448/500 [01:43<00:12,  4.33it/s]
# PTI :step: 447, epoch: 223:  90%|████████▉ | 448/500 [01:43<00:12,  4.33it/s]
# PTI :step: 447, epoch: 223:  90%|████████▉ | 449/500 [01:43<00:11,  4.47it/s]
# PTI :step: 448, epoch: 224:  90%|████████▉ | 449/500 [01:43<00:11,  4.47it/s]
# PTI :step: 448, epoch: 224:  90%|█████████ | 450/500 [01:43<00:11,  4.36it/s]
# PTI :step: 449, epoch: 224:  90%|█████████ | 450/500 [01:43<00:11,  4.36it/s]
# PTI :step: 449, epoch: 224:  90%|█████████ | 451/500 [01:44<00:10,  4.48it/s]
# PTI :step: 450, epoch: 225:  90%|█████████ | 451/500 [01:44<00:10,  4.48it/s]
# PTI :step: 450, epoch: 225:  90%|█████████ | 452/500 [01:44<00:11,  4.35it/s]
# PTI :step: 451, epoch: 225:  90%|█████████ | 452/500 [01:44<00:11,  4.35it/s]
# PTI :step: 451, epoch: 225:  91%|█████████ | 453/500 [01:44<00:10,  4.48it/s]
# PTI :step: 452, epoch: 226:  91%|█████████ | 453/500 [01:44<00:10,  4.48it/s]
# PTI :step: 452, epoch: 226:  91%|█████████ | 454/500 [01:44<00:10,  4.36it/s]
# PTI :step: 453, epoch: 226:  91%|█████████ | 454/500 [01:44<00:10,  4.36it/s]
# PTI :step: 453, epoch: 226:  91%|█████████ | 455/500 [01:44<00:10,  4.48it/s]
# PTI :step: 454, epoch: 227:  91%|█████████ | 455/500 [01:44<00:10,  4.48it/s]
# PTI :step: 454, epoch: 227:  91%|█████████ | 456/500 [01:45<00:10,  4.35it/s]
# PTI :step: 455, epoch: 227:  91%|█████████ | 456/500 [01:45<00:10,  4.35it/s]
# PTI :step: 455, epoch: 227:  91%|█████████▏| 457/500 [01:45<00:09,  4.47it/s]
# PTI :step: 456, epoch: 228:  91%|█████████▏| 457/500 [01:45<00:09,  4.47it/s]
# PTI :step: 456, epoch: 228:  92%|█████████▏| 458/500 [01:45<00:09,  4.35it/s]
# PTI :step: 457, epoch: 228:  92%|█████████▏| 458/500 [01:45<00:09,  4.35it/s]
# PTI :step: 457, epoch: 228:  92%|█████████▏| 459/500 [01:45<00:09,  4.48it/s]
# PTI :step: 458, epoch: 229:  92%|█████████▏| 459/500 [01:45<00:09,  4.48it/s]
# PTI :step: 458, epoch: 229:  92%|█████████▏| 460/500 [01:46<00:09,  4.37it/s]
# PTI :step: 459, epoch: 229:  92%|█████████▏| 460/500 [01:46<00:09,  4.37it/s]
# PTI :step: 459, epoch: 229:  92%|█████████▏| 461/500 [01:46<00:08,  4.43it/s]
# PTI :step: 460, epoch: 230:  92%|█████████▏| 461/500 [01:46<00:08,  4.43it/s]
# PTI :step: 460, epoch: 230:  92%|█████████▏| 462/500 [01:46<00:08,  4.32it/s]
# PTI :step: 461, epoch: 230:  92%|█████████▏| 462/500 [01:46<00:08,  4.32it/s]
# PTI :step: 461, epoch: 230:  93%|█████████▎| 463/500 [01:46<00:08,  4.39it/s]
# PTI :step: 462, epoch: 231:  93%|█████████▎| 463/500 [01:46<00:08,  4.39it/s]
# PTI :step: 462, epoch: 231:  93%|█████████▎| 464/500 [01:47<00:08,  4.26it/s]
# PTI :step: 463, epoch: 231:  93%|█████████▎| 464/500 [01:47<00:08,  4.26it/s]
# PTI :step: 463, epoch: 231:  93%|█████████▎| 465/500 [01:47<00:08,  4.33it/s]
# PTI :step: 464, epoch: 232:  93%|█████████▎| 465/500 [01:47<00:08,  4.33it/s]
# PTI :step: 464, epoch: 232:  93%|█████████▎| 466/500 [01:47<00:07,  4.26it/s]
# PTI :step: 465, epoch: 232:  93%|█████████▎| 466/500 [01:47<00:07,  4.26it/s]
# PTI :step: 465, epoch: 232:  93%|█████████▎| 467/500 [01:47<00:07,  4.35it/s]
# PTI :step: 466, epoch: 233:  93%|█████████▎| 467/500 [01:47<00:07,  4.35it/s]
# PTI :step: 466, epoch: 233:  94%|█████████▎| 468/500 [01:47<00:07,  4.25it/s]
# PTI :step: 467, epoch: 233:  94%|█████████▎| 468/500 [01:47<00:07,  4.25it/s]
# PTI :step: 467, epoch: 233:  94%|█████████▍| 469/500 [01:48<00:07,  4.39it/s]
# PTI :step: 468, epoch: 234:  94%|█████████▍| 469/500 [01:48<00:07,  4.39it/s]
# PTI :step: 468, epoch: 234:  94%|█████████▍| 470/500 [01:48<00:07,  4.26it/s]
# PTI :step: 469, epoch: 234:  94%|█████████▍| 470/500 [01:48<00:07,  4.26it/s]
# PTI :step: 469, epoch: 234:  94%|█████████▍| 471/500 [01:48<00:06,  4.38it/s]
# PTI :step: 470, epoch: 235:  94%|█████████▍| 471/500 [01:48<00:06,  4.38it/s]
# PTI :step: 470, epoch: 235:  94%|█████████▍| 472/500 [01:48<00:06,  4.26it/s]
# PTI :step: 471, epoch: 235:  94%|█████████▍| 472/500 [01:48<00:06,  4.26it/s]
# PTI :step: 471, epoch: 235:  95%|█████████▍| 473/500 [01:49<00:06,  4.34it/s]
# PTI :step: 472, epoch: 236:  95%|█████████▍| 473/500 [01:49<00:06,  4.34it/s]
# PTI :step: 472, epoch: 236:  95%|█████████▍| 474/500 [01:49<00:06,  4.26it/s]
# PTI :step: 473, epoch: 236:  95%|█████████▍| 474/500 [01:49<00:06,  4.26it/s]
# PTI :step: 473, epoch: 236:  95%|█████████▌| 475/500 [01:49<00:05,  4.37it/s]
# PTI :step: 474, epoch: 237:  95%|█████████▌| 475/500 [01:49<00:05,  4.37it/s]
# PTI :step: 474, epoch: 237:  95%|█████████▌| 476/500 [01:49<00:05,  4.26it/s]
# PTI :step: 475, epoch: 237:  95%|█████████▌| 476/500 [01:49<00:05,  4.26it/s]
# PTI :step: 475, epoch: 237:  95%|█████████▌| 477/500 [01:50<00:05,  4.38it/s]
# PTI :step: 476, epoch: 238:  95%|█████████▌| 477/500 [01:50<00:05,  4.38it/s]
# PTI :step: 476, epoch: 238:  96%|█████████▌| 478/500 [01:50<00:05,  4.28it/s]
# PTI :step: 477, epoch: 238:  96%|█████████▌| 478/500 [01:50<00:05,  4.28it/s]
# PTI :step: 477, epoch: 238:  96%|█████████▌| 479/500 [01:50<00:04,  4.44it/s]
# PTI :step: 478, epoch: 239:  96%|█████████▌| 479/500 [01:50<00:04,  4.44it/s]
# PTI :step: 478, epoch: 239:  96%|█████████▌| 480/500 [01:50<00:04,  4.30it/s]
# PTI :step: 479, epoch: 239:  96%|█████████▌| 480/500 [01:50<00:04,  4.30it/s]
# PTI :step: 479, epoch: 239:  96%|█████████▌| 481/500 [01:50<00:04,  4.42it/s]
# PTI :step: 480, epoch: 240:  96%|█████████▌| 481/500 [01:50<00:04,  4.42it/s]
# PTI :step: 480, epoch: 240:  96%|█████████▋| 482/500 [01:51<00:04,  4.31it/s]
# PTI :step: 481, epoch: 240:  96%|█████████▋| 482/500 [01:51<00:04,  4.31it/s]
# PTI :step: 481, epoch: 240:  97%|█████████▋| 483/500 [01:51<00:03,  4.40it/s]
# PTI :step: 482, epoch: 241:  97%|█████████▋| 483/500 [01:51<00:03,  4.40it/s]
# PTI :step: 482, epoch: 241:  97%|█████████▋| 484/500 [01:51<00:03,  4.28it/s]
# PTI :step: 483, epoch: 241:  97%|█████████▋| 484/500 [01:51<00:03,  4.28it/s]
# PTI :step: 483, epoch: 241:  97%|█████████▋| 485/500 [01:51<00:03,  4.42it/s]
# PTI :step: 484, epoch: 242:  97%|█████████▋| 485/500 [01:51<00:03,  4.42it/s]
# PTI :step: 484, epoch: 242:  97%|█████████▋| 486/500 [01:52<00:03,  4.31it/s]
# PTI :step: 485, epoch: 242:  97%|█████████▋| 486/500 [01:52<00:03,  4.31it/s]
# PTI :step: 485, epoch: 242:  97%|█████████▋| 487/500 [01:52<00:02,  4.44it/s]
# PTI :step: 486, epoch: 243:  97%|█████████▋| 487/500 [01:52<00:02,  4.44it/s]
# PTI :step: 486, epoch: 243:  98%|█████████▊| 488/500 [01:52<00:02,  4.34it/s]
# PTI :step: 487, epoch: 243:  98%|█████████▊| 488/500 [01:52<00:02,  4.34it/s]
# PTI :step: 487, epoch: 243:  98%|█████████▊| 489/500 [01:52<00:02,  4.43it/s]
# PTI :step: 488, epoch: 244:  98%|█████████▊| 489/500 [01:52<00:02,  4.43it/s]
# PTI :step: 488, epoch: 244:  98%|█████████▊| 490/500 [01:52<00:02,  4.28it/s]
# PTI :step: 489, epoch: 244:  98%|█████████▊| 490/500 [01:52<00:02,  4.28it/s]
# PTI :step: 489, epoch: 244:  98%|█████████▊| 491/500 [01:53<00:02,  4.41it/s]
# PTI :step: 490, epoch: 245:  98%|█████████▊| 491/500 [01:53<00:02,  4.41it/s]
# PTI :step: 490, epoch: 245:  98%|█████████▊| 492/500 [01:53<00:01,  4.31it/s]
# PTI :step: 491, epoch: 245:  98%|█████████▊| 492/500 [01:53<00:01,  4.31it/s]
# PTI :step: 491, epoch: 245:  99%|█████████▊| 493/500 [01:53<00:01,  4.41it/s]
# PTI :step: 492, epoch: 246:  99%|█████████▊| 493/500 [01:53<00:01,  4.41it/s]
# PTI :step: 492, epoch: 246:  99%|█████████▉| 494/500 [01:53<00:01,  4.27it/s]
# PTI :step: 493, epoch: 246:  99%|█████████▉| 494/500 [01:53<00:01,  4.27it/s]
# PTI :step: 493, epoch: 246:  99%|█████████▉| 495/500 [01:54<00:01,  4.39it/s]
# PTI :step: 494, epoch: 247:  99%|█████████▉| 495/500 [01:54<00:01,  4.39it/s]
# PTI :step: 494, epoch: 247:  99%|█████████▉| 496/500 [01:54<00:00,  4.29it/s]
# PTI :step: 495, epoch: 247:  99%|█████████▉| 496/500 [01:54<00:00,  4.29it/s]
# PTI :step: 495, epoch: 247:  99%|█████████▉| 497/500 [01:54<00:00,  4.43it/s]
# PTI :step: 496, epoch: 248:  99%|█████████▉| 497/500 [01:54<00:00,  4.43it/s]
# PTI :step: 496, epoch: 248: 100%|█████████▉| 498/500 [01:54<00:00,  4.32it/s]
# PTI :step: 497, epoch: 248: 100%|█████████▉| 498/500 [01:54<00:00,  4.32it/s]
# PTI :step: 497, epoch: 248: 100%|█████████▉| 499/500 [01:55<00:00,  4.42it/s]
# PTI :step: 498, epoch: 249: 100%|█████████▉| 499/500 [01:55<00:00,  4.42it/s]
# PTI :step: 498, epoch: 249: 100%|██████████| 500/500 [01:55<00:00,  4.32it/s]
Saving final model for return
# PTI :step: 499, epoch: 249: 100%|██████████| 500/500 [01:55<00:00,  4.32it/s]
# PTI :step: 499, epoch: 249: 100%|██████████| 500/500 [01:55<00:00,  4.32it/s]
training_out/lora.safetensors
training_out/embeddings.pti
training_out/special_params.json

Run time and cost

This model costs approximately $0.26 to run on Replicate, or 3 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia L40S GPU hardware. Predictions typically complete within 5 minutes. The predict time for this model varies significantly based on the inputs.

Readme

About

This is a hacked together, proof of concept model to train your own SSD-1B LoRAs. At this time, it does not support the standard Replicate training method (replicate.trainings.create).

I took the original SDXL parameters and just halved the num_train_epochs, and max_train_steps

Goal: To create SSD LoRAs at half the time, and half the cost of SDXL, but with no loss in quality

All you need are your training images in a zip file and select “use_face_detection_instead” if training faces

Run inference on SSD-1B LoRAs here