childes-db project aims to make CHILDES transcripts more accessible by reducing the amount of preprocessing necessary (e.g., CLAN or specific preprocessing libraries) and by making the individual tokens and utterances available in a tidy, tabular format. In addition, we plan to release new dated versions periodically to facilitate reproducibility. We are also supporting an API via the
childesr R package, which allows users to access the transcripts in
childes-db without having to write complex SQL queries.
If you use
childes-db to access CHILDES in your research, please note the database version you used (i.e.,
2018.1) and cite:
Sanchez, A., Meylan, S., Braginsky, M., MacDonald, K. E., Yurovsky, D., & Frank, M. C. (2018, April 23). childes-db: a flexible and reproducible interface to the Child Language Data Exchange System. Retrieved from psyarxiv.com/93mwx