Data Resources
Affective ratings for nearly 14 thousand English words
Age of Acquisition Norms & Concreteness norms for 30,000 Dutch Words
Age of Acquisition Norms (English)
ARC Nonword Database – Generate nonwords using specified lexical characteristics.
Baldey: a database of auditory lexical decision – 5541 Dutch words, includes auditory stimuli and associated Praat textgrids.
CLEARPOND Database – Cross-Linguistic Easy-Access Resource for Phonological and Orthographic Neighborhood Densities. Dutch, English, French, German, Spanish.
Concreteness Ratings in English – 40 thousand English lemmas.
Dutch Lexicon Project & British Lexicon Project
Dutch Word Association Database
English Lexicon Project – Database of lexical decision and naming reaction times and error rates. Visual lexical decision experiment uses 40481 words and and equal number of nonwords.
Form Priming Project: A behavioral database for masked form priming.
International Picture Naming Project – Studies, database of results, stimuli available for use.
MRC Psycholinguistic Database – Generate stimuli lists based on lexical characteristics.
Norms of Valence, Arousal, Dominance, and AoA in English and Dutch.
University of South Florida Free Association Norms
Wortschatz/Leipzig Corpora Collection – A project maintained by the Natural Language Processing Group at the Institute of Computer Science at Leipzig University. Includes searchable corpora for over 250 languages and tools for corpus linguistic analysis.
Corpora & Lexical Databases
Celex Lexical Database (WebCelex) – Online version of Celex. English, Dutch, German.
Child Language Data Exchange System (CHILDES)
Children’s Printed Word Database (University of Essex)
Corpora Collection, Leipzig University – searchable in 252 languages. Part of Wortschatz.
corpus.byu.edu – Series of corpora with site-based search functions, including: Corpus of Contemporary American English, NOW Corpus, BNC, Strathy Corpus (Canada).
dlexDB – German word frequencies. See also The Digital Dictionary of German (www.dwds.de).
Irvine Phonotactic Online Dictionary (IPhOD) – Words & pseudowords, organized by phonemes (i.e., as search criteria, number of phonological neighbours, etc.).
Lexique – French corpus, including word frequencies, phonological and grammatical information, and more. Version 3.
SUBTITLE FREQUENCIES
SUBTLEX-CH – Chinese, includes part of speech information
SUBTLEX-DE – German
SUBTLEX-ESP – Spanish
SUBTLEX-GR – Greek
SUBTLEX-NL – Dutch, includes part of speech information
SUBTLEX-PL – Polish, includes part of speech information
SUBTLEX-PT – Portuguese
SUBTLEX-PT-BR – Brazilian Portuguese
SUBTLEX-UK – British English, includes part of speech information
SUBTLEX-US – American English
University of Oxford Text Archives (OTA) – Corpora in Arabic, Chinese, English, French, German, Scots, Swedish, Welsh.
WordNet – Lexical database organized by word senses, synsets, and word relations.
Software & Tools
ASV Online Toolbox – A collection of tools that can be used to explore written language data. Part of Wortschatz.
Coh-Metrix – A tool for evaluating the difficulty of written text.
DMDX – Experiment presentation software for PC.
Linger – Experiment presentation software for language processing experiments.
PsychoPy – Open-source experiment presentation software written in Python. Includes a visual Builder interface and a Coder interface.
PsyScope X – Experiment presentation software for Mac.
Phonotactic Probability Calculator (University of Kansas).
Worldlikeness: A Web-based Tool for Typological Psycholinguistics – Online wordlikeness judgments for experimenters, participants, and researchers.