feat(01-01): create Python package scaffold with config system

- pyproject.toml: installable package with bioinformatics dependencies
- Pydantic config schema with validation (ensembl_release >= 100, directory creation)
- YAML config loader with override support
- Default config with Ensembl 113, gnomAD v4.1
- 5 passing tests for config validation and hashing
This commit is contained in:
2026-02-11 16:24:35 +08:00
parent cab2f5fc66
commit 4a80a0398e
8 changed files with 459 additions and 0 deletions

25
config/default.yaml Normal file
View File

@@ -0,0 +1,25 @@
# Default pipeline configuration
data_dir: data
cache_dir: data/cache
duckdb_path: data/pipeline.duckdb
versions:
ensembl_release: 113
gnomad_version: v4.1
gtex_version: v8
hpa_version: "23.0"
api:
rate_limit_per_second: 5
max_retries: 5
cache_ttl_seconds: 86400
timeout_seconds: 30
scoring:
gnomad: 0.20
expression: 0.20
annotation: 0.15
localization: 0.15
animal_model: 0.15
literature: 0.15