Ingest
Raw Verasight files (.sav, .csv, .dta, .rds, .xlsx) become long-format intermediate tables with metadata sidecars.
Verasight source reports are the canonical record. The Data Library presents key findings from them.
Source of truth Verasight.io
Source of truth
verasight.io/reportsVerasight source reports are the canonical record. Each one carries the full survey instrument, weighting design, and field methodology in one place.
The Data Library presents key findings from those reports. Every featured topic and indexed question on this site cites back to the source. To see how the full methodology works for any wave, click through to its report.
01 / Vocabulary
Every page in this library resolves to one of four editorial primitives. Categories group, featured topics tell, indexed questions reference, supporting data justifies.
02 / Pipeline
The pipeline runs as a fixed sequence of inspectable stages. Each stage owns one job and writes a checkable artifact. A failure halts the run with a descriptive error, not silent drift.
Raw Verasight files (.sav, .csv, .dta, .rds, .xlsx) become long-format intermediate tables with metadata sidecars.
Null handling, PII scrub, demographic normalization, and respondent context propagation.
Row-level weight validation and reusable weighted summaries.
Banner and extra demographic breakdowns with low-N flags and canonical dimension names.
Questions package into a wave-scoped bundle with toplines, crosstabs, methodology, citation, and slug.
Canonical question JSON, long-format per-question CSV, per-wave summary, and a site-wide index.
The pipeline run ends at stage 06 emit. Editorial curation happens after, on the committed canonical artifacts, and always passes through human review before publication.
03 / Publication
Canonical data is regenerable. Editorial decisions are committed as artifacts. The site is the read view over both.
One JSON and one CSV per question, plus a per-wave summary and a site-wide index. Stable contract, regenerable from raw inputs.
Category mappings, topic proposals, and curated featured topics commit on top of the canonical layer. Editorial decisions are tracked as artifacts, not implicit.
The site reads committed artifacts. Published featured topics drive home, category, and search surfaces. Absorbed questions never duplicate as standalone pages.
A canonical question absorbed into a published featured topic does not duplicate as a standalone page. Indexed-question pages are exactly the canonical questions that are not absorbed.
04 / Sources
The Data Library reads, presents, and cites. The full source is upstream at Verasight. Every featured topic links back to the report and, where available, to the question anchor inside it.
05 / Citations
Accurate citation requires structure on the page, structure in the source, and a stable path back to canonical. The library publishes all three.
06 / Upstream
Field methodology, weighting design, and verbatim instrument text are authored by Verasight and cited from the source report. The Data Library carries enough methodology inline to read a topic, and points to the source for everything else.
Mode-by-mode field methodology, weighting variable lists, and per-wave demographic summaries appear in the underlying report and in the per-question canonical record. Site pages surface a stable subset.
Citations
SourceWhen in doubt, cite the canonical source report.