Skip to content

SOCA metadata extraction fails for repositories larger than 200 MB due to SOMEF download limit #1036

Description

@SergioZSZ

Summary

When extracting metadata for some repositories with SOCA, the extraction fails because SOMEF skips repository archives larger than a hardcoded 200 MB download limit

This affects SOCA command:

soca extract -i <repos-file> -o <output-dir>

For repositories whose GitHub archive is larger than 200 MB, SOCA/SOMEF logs warnings like:

 WARNING - Repository archive skipped due to size limit: 200 MB or not content lenght.
  ERROR - Error processing the target repository

As a result, the repository metadata is not extracted because in SOMEF the download limit is hardcoded in the constants:
SIZE_DOWNLOAD_LIMIT_MB = 200

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions