Word frequency data



You can also purchase the pre-2020 data (see sample). Note that this is based on a much smaller corpus (400 million words vs the current one billion words) with fewer genres. In addition, the pre-2020 data is just a single wordlist (compared to the four wordlists in the current data).

rank   lemma / word PoS freq dispersion
7309   attic n 2711 0.91
17311   tearful j 542 0.93
27303   tailgate v 198 0.85
37310   hydraulically r 78 0.83
47309   unsparing j 35 0.83
57309   embryogenesis n 22 0.66

Purchase:
 
# words Academic license Commercial license
20,000 $60 $120
60,000 $90 $180