The childes-db project aims to make CHILDES transcripts more accessible by reducing the amount of preprocessing necessary (e.g., CLAN or specific preprocessing libraries) and by making the individual tokens and utterances available in a tidy, tabular format. In addition, we plan to release new dated versions periodically to facilitate reproducibility. We are also supporting an API via the childesr R package, which allows users to access the transcripts in childes-db without having to write complex SQL queries.

Citation policy

If you use childes-db to access CHILDES in your research, please note the database version you used (i.e., 2018.1) and cite:

  1. CHILDES itself – both the database and the corpora you use – following the Talkbank policy.
  2. The childes-db paper:
Sanchez, A., Meylan, S., Braginsky, M., MacDonald, K. E., Yurovsky, D., & Frank, M. C. (2018, April 23). childes-db: a flexible and reproducible interface to the Child Language Data Exchange System. Retrieved from

Meet the childes-db team

Alessandro Sanchez

Stanford University

Michael C. Frank

Stanford University

Stephan Meylan

UC Berkeley

Daniel Yurovsky

University of Chicago

Kyle MacDonald

Stanford University