Scientific Memory

Buildable, machine-checkable scientific knowledge.

Quick start · Documentation · Repository status · Contributing

Overview

Scientific Memory turns mathematically structured science from prose into machine-checkable, executable, composable artifacts with full provenance. It is not a paper summarizer or a theorem leaderboard: it is a knowledge-upgrading pipeline built for durable scientific inheritance.

You get	How
Traceable claims	Every claim anchored with `source_span` and schema-valid JSON
Formal layer	Lean 4 + mathlib, linked from the corpus via mapping and theorem cards
Executable witnesses	Kernels with explicit verification boundaries
Inspectable output	Portal rendered only from canonical manifests and exports
Reproducible gates	Unified validation, CI, benchmarks, and signed releases

Mission

The project optimizes for:

Pillar	Meaning
Explicit claims & assumptions	No silent hand-waving between text and formal code
Formal declarations	Machine-checked where the project commits to it
Executable kernels	Where numerical or computational alignment matters
Versioned provenance	Artifacts you can audit and rebuild
Reproducible builds	Lean, Python, and portal all part of one bar

What lives here

Area	Role
`corpus/`	Schema-first papers: metadata, claims, assumptions, symbols, manifests
`formal/`	Lean 4 library (`ScientificMemory`) linked to the corpus
`schemas/`	Canonical JSON Schema for all public artifacts
`pipeline/`	`sm_pipeline`: ingest, extract, validate (gate engine), publish, portal export
`kernels/`	Executable kernels + shared `kernels/conformance/` test helpers
`portal/`	Next.js UI from `corpus-export.json` and corpus data
`benchmarks/`	Regression tasks, gold labels, thresholds, proof-success trends

Quick start

git clone https://github.com/fraware/scientific-memory.git
cd scientific-memory

just bootstrap    # toolchains and dependencies
just build        # Lean + portal + Python tests
just validate     # full corpus / schema / graph gates
just portal       # local dev server (see terminal for URL)

Situation	Command
Something failed early	`just doctor` (uv, pnpm, Lean, Lake)
Lean only	`just lake-build` or `just lake-build-verbose LOG=lake-build.log`
Full pre-PR sweep	`just check`
Without `just` (or Git Bash missing)	Contributor playbook – Local CI

Canonical Local Workflow

Use this as the single local path before opening a PR:

just bootstrap
just check
just benchmark

If you cannot use just, run the equivalent uv/lake/pnpm commands from Contributor playbook – Local CI. On Windows, just uses Git for Windows Bash (see that section). That playbook section is also the canonical non-just path.

Repository status

Current tree (corpus, pipeline, CI, metrics) — click to expand

Corpus: Eight indexed papers in corpus/index.json: six formalized core slices plus two hard-dimension stress scaffolds (stress_units_dimensional_2024, stress_approx_asymptotic_2024). Scaffold test_new_paper was retired from the index. Per-paper machine-checked counts and manifest build_hash_version / dependency-graph edge counts are generated in docs/status/repo-snapshot.md (just repo-snapshot); do not treat README prose as the live source of those numbers.
Pipeline: Ingest through publish and portal export; unified validation in gate_engine. Trust boundary (canonical JSON vs LLM/suggestion sidecars, publish integrity, build hash v2): docs/reference/trust-boundary-and-extraction.md. Tests: run just test or uv run pytest --collect-only -q for current counts (pipeline + kernel packages).
CI: All seven gates in place (Lean build, schema + graph + migration checks, provenance, coverage, portal build + smoke test, benchmark regression with proof-success snapshot, per-paper slices, trend history, runtime budgets, minimum thresholds and tasks_ceiling upper bounds in benchmarks/baseline_thresholds.json (e.g. source-span alignment error rate on tasks.gold), release integrity). Gate 7: checksums plus Sigstore (cosign) keyless signing; tagged releases publish a GitHub Release with changelog, checksums, signatures, and release-bundle.zip. Verify script: scripts/verify_release_checksums.sh. Quality: Ruff on pipeline; tests as above.
Infra: Policy docs in docs/infra/ (README, cache-policy, release-policy); CI and release under .github/workflows/ and repo root.
Contributor tooling: just doctor for environment diagnostics; stage banners in just check; just lake-build / just lake-build-verbose LOG=... for Lean build logs. SPEC and playbook use real paper ID langmuir_1918_adsorption in examples.
Metrics: just metrics (median intake, dependency, symbol conflict, proof completion, axiom count, research-value including literature_errors, claims_with_clarified_assumptions, kernels_with_formally_linked_invariants, source-span alignment, normalization visibility, assumption-suggestions, dimension-visibility, dimension-suggestions). just benchmark writes benchmarks/reports/latest.json with proof_success_snapshot, proof_success_summary.md, and task outputs including tasks.gold (precision/recall/F1, papers_with_gold, and source-span alignment fields), tasks.llm_suggestions / tasks.llm_lean_suggestions (optional LLM sidecar footprint metrics), tasks.llm_eval (reviewed reference bundles under benchmarks/llm_eval/), and llm_prompt_templates (declared prompt SHA-256 map). Gate 6 compares against benchmarks/baseline_thresholds.json (tasks minima and tasks_ceiling). Use just scaffold-gold <paper_id> when admitting a paper (all indexed papers currently have gold).
LLM integration: Optional Prime Intellect inference for claims, mapping, and Lean proposals (suggest-only, human-gated apply). Full end-to-end pipeline run validated on math_sum_evens with allenai/olmo-3.1-32b-instruct; evaluation infrastructure includes prompt versioning, reference fixtures, benchmark task llm_eval, and human review rubric. See docs/tooling/prime-intellect-llm.md and docs/testing/llm-lean-live-test-matrix.md.
Optional: Blueprint check (just check-paper-blueprint), check-tooling (pandoc), extract-from-source, build-verso, mcp-server. Blueprints under docs/blueprints/ cover Langmuir, Freundlich, temkin_1941_adsorption, and physics_kinematics_uniform (mapping mirror where present). Role playbooks: docs/playbooks/ (formalizer, reviewer, domain-expander, release-manager). Portal dependencies pinned (Next.js ^14.2, React ^18.3) for reproducible builds. Also: Hypothesis-based property tests for adsorption kernels; shared kernel test helpers (kernels/conformance/ workspace package); theorem-card reviewer lifecycle (contributor-playbook.md); batch-admit --dry-run; snapshot baseline quality validation; validate-all --report-json for gate reports; just repo-snapshot for docs/status/repo-snapshot.md.

Artifact flow

flowchart TD
  subgraph intake [Intake]
    Add[Add paper]
    Extract[Extract claims and context]
  end

  subgraph optional [Optional LLM assistance]
    LLM[LLM proposals]
    Review[Human review]
    Apply[Apply after review]
  end

  subgraph canonical [Canonical work]
    Norm[Normalize and link]
    Map[Map to Lean]
    Formal[Formalize in Lean]
  end

  subgraph validation [Validation publish]
    Gates[Gate engine validate all]
    Publish[Publish manifests and theorem cards]
    Export[Portal export]
  end

  subgraph outputs [Outputs]
    Portal[Portal pages]
    Bench[Benchmarks and regression]
    Manifests[Published artifacts]
  end

  Add --> Extract
  Extract --> Norm
  Norm --> Map
  Map --> Formal
  Norm --> Gates
  Formal --> Gates
  Gates --> Publish
  Publish --> Manifests
  Publish --> Export
  Export --> Portal
  Publish --> Bench
  Formal --> Bench

  Extract -.-> LLM
  LLM --> Review
  Review --> Apply
  Apply --> Norm
  Apply --> Map
  Apply --> Formal
  Apply --> Gates

Documentation

Topic	Link
Index	docs/README.md
Contributor playbook (setup, paper workflow, local CI, reuse, review, verification, Verso, schema migrations, Gate 7)	docs/contributor-playbook.md
Architecture	docs/architecture.md
Roadmap	ROADMAP.md
Paper intake (SPEC 8.1)	docs/paper-intake.md
Metrics (SPEC 12)	docs/metrics.md
ADRs	docs/adr/README.md
Infra / CI policy	docs/infra/README.md
Repo snapshot	docs/status/repo-snapshot.md (`just repo-snapshot`)
Maintainers (public push, CI, triage, launch)	docs/maintainers.md
MCP tooling (optional)	docs/tooling/mcp-lean-tooling.md
Prime Intellect LLM (optional, suggest-only)	docs/tooling/prime-intellect-llm.md
Trust boundary and manual E2E scenarios	docs/reference/trust-boundary-and-extraction.md · docs/testing/trust-hardening-e2e-scenarios.md · LLM Lean live test matrix
Pandoc / LaTeX (optional)	docs/tooling/pandoc-latex-integration.md

Contributing

Resource	Link
How to contribute	CONTRIBUTING.md
Step-by-step playbook	docs/contributor-playbook.md
Pipeline extension points	docs/pipeline-extension-points.md

Design principles

Artifact-first, model-second — durable JSON and Lean, not one-off prose.
Provenance is mandatory — claims and cards stay tied to sources.
Verification boundaries are explicit — proof vs witness vs heuristic is visible.
Claim bundles are the core unit — not isolated theorems in a void.
Full buildability is the minimum bar — no merge without the agreed gates.

License

Licensed under Apache-2.0 — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.devcontainer		.devcontainer
.github		.github
benchmarks		benchmarks
blueprint		blueprint
corpus		corpus
docs		docs
formal		formal
kernels		kernels
pipeline		pipeline
portal		portal
schemas		schemas
scripts		scripts
tests/smoke		tests/smoke
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CITATION.cff		CITATION.cff
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
JUSTFILE		JUSTFILE
LICENSE		LICENSE
README.md		README.md
ROADMAP.md		ROADMAP.md
SECURITY.md		SECURITY.md
lake-manifest.json		lake-manifest.json
lakefile.toml		lakefile.toml
lean-toolchain		lean-toolchain
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
pyproject.toml		pyproject.toml
turbo.json		turbo.json
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scientific Memory

Overview

Mission

What lives here

Quick start

Canonical Local Workflow

Repository status

Artifact flow

Documentation

Contributing

Design principles

License

About

Uh oh!

Releases 2

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Scientific Memory

Overview

Mission

What lives here

Quick start

Canonical Local Workflow

Repository status

Artifact flow

Documentation

Contributing

Design principles

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 2

Contributors

Uh oh!

Languages