generate_paper_index¶
src.generators.generate_paper_index
¶
Generate a canonical paper index (papers.json) from authors.yml.
Each unique paper gets a stable integer ID. Co-authored papers are stored once in the index and referenced by ID from each author entry.
Usage
python -m src.generators.generate_paper_index --data_dir ../reprodb.github.io
load_existing_index(path)
¶
Load existing paper index to preserve IDs across runs.
Source code in src/generators/generate_paper_index.py
22 23 24 25 26 27 28 29 30 31 32 33 | |
build_paper_index(authors_data, existing_by_title, max_id)
¶
Build paper index from authors.yml data.
Returns (papers_list, norm_title_to_id dict).
Source code in src/generators/generate_paper_index.py
36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 | |