Skip to content

acm_scrape

Module: src.scrapers.acm_scrape
Category: Scrapers

Usage

python -m src.scrapers.acm_scrape [options]

Options

usage: acm_scrape.py [-h] --conference CONFERENCE --years YEARS
                     [--format {json,yaml,summary}]
                     [--max-workers MAX_WORKERS] [--delay DELAY]
                     [--all-papers]

Scrape ACM DL proceedings for paper titles and artifact badges.

options:
  -h, --help            show this help message and exit
  --conference CONFERENCE, -c CONFERENCE
                        Conference key(s), comma-separated (e.g. ccs, sosp)
  --years YEARS, -y YEARS
                        Year(s) to scrape, comma-separated (e.g. 2023,2024)
  --format {json,yaml,summary}, -f {json,yaml,summary}
                        Output format
  --max-workers MAX_WORKERS
                        Max parallel requests (default: 4)
  --delay DELAY         Delay between requests in seconds (default: 0.5)
  --all-papers          Include papers without badges