International banner

International banner

Tuesday, October 18, 2011

Vocabulary profiler

There is a tool with which you can calculate how many academic words are written in a text. It calculates the number of families of words and also how many of these families are Anglo-Sax originated and from Greek or Latin origin. You can find this tool on http://www.lextutor.ca/vp/eng/.




K1 words are the most frequent 1000 words of English. K2 words are the second most frequent used words in English, so word 1001 to 2000. AWL words are 570 most frequently used words in academic texts. Finally, Off-list are words that are not found on K1, K2 or the Academic word list (AWL). A typical educated native English speaker's result is 70% from K1, 10% from K2, 10% academic and 10% less frequent words.

I submitted the Konopnicka memo report through this vocabulary tool as well and below is the result. The words I used were mostly from K1 (73%) and the K2 and AWL words were not as frequent as I may have wanted (8% and 7%). Of the words I used in the memo report, 36 percent are from Greek and Latin origin. 


I hope to make some progress this year in shifting my texts more towards academic texts, thus using more AWL words. 



  FamiliesTypesTokensPercent
K1 Words (1-1000):9611130473.25%
  Function:......(184)(44.34%)
  Content:......(120)(28.92%)
>   Anglo-Sax    
=Not Greco-Lat/Fr Cog:
......(41)(9.88%)
K2 Words (1001-2000):1820337.95%
>   Anglo-Sax:     ......(9)(2.17%)
    1k+2k      ......(81.20%)
AWL Words (academic):1822296.99%
>   Anglo-Sax:     ......()(0.00%)
Off-List Words:?214911.81%
132+?174415100%
Words in text (tokens):415
Different words (types):174
Type-token ratio:0.42
Tokens per type:2.39
Lex density (content words/total)0.56

Pertaining to onlist only
Tokens:366
Types:153
Families:132
Tokens per family:2.77
Types per family:1.16
Anglo-Sax Index:
(A-Sax tokens + functors / onlist tokens)
63.93%
Greco-Lat/Fr-Cognate Index: (Inverse of above)36.07%

No comments:

Post a Comment