Skip to content

A note about BrushData! #86

Description

@zhanjiahui

I download brushdata from the following link:
Brushdata on huggingface from ramdom123123

Then my training was abnormally terminated because one of the tar files was corrupted.
I asked ChatGPT to write a script to help me detect incorrect tar files:

  import tarfile
  import os
  
  tar_dir = '/your/path/BrushData/'
  
  for tar_file in os.listdir(tar_dir):
      if not tar_file.endswith('.tar'):
          continue
      path = os.path.join(tar_dir, tar_file)
      try:
          with tarfile.open(path, 'r') as tar:
              print(path)
              tar.getmembers()
      except tarfile.ReadError:
          print(f"{tar_file} is corrupted.")

Finally I found 00185.tar was corrupted.

Image

Additionally, according to a discussion, 00182.tar file is missing too.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Fields

    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions