Key | Element of Grammar | Example(s) |
---|---|---|
CC | Coord Conjuncn | and,but,or |
CD | Cardinal number | one,two |
DT | Determiner | the,some |
EX | Existential there | there |
FW | Foreign Word | mon dieu |
IN | Preposition | of,in,by |
JJ | Adjective | big |
JJR | Adj., comparative | bigger |
JJS | Adj., superlative | biggest |
LS | List item marker | 1,One |
MD | Modal | can,should |
NN | Noun, sing. or mass | dog |
NNP | Proper noun, sing. | Edinburgh |
NNPS | Proper noun, plural | Smiths |
NNS | Noun, plural | dogs |
POS | Possessive ending | 's |
PDT | Predeterminer | all, both |
PP$ | Possessive pronoun | my,one's |
PRP | Personal pronoun | I,you,she |
RB | Adverb | quickly |
RBR | Adverb, comparative | faster |
RBS | Adverb, superlative | fastest |
RP | Particle | up,off |
SYM | Symbol | +,%,& |
TO | 'to' | to |
UH | Interjection | oh, oops |
VB | verb, base form | eat |
VBD | verb, past tense | ate |
VBG | verb, gerund | eating |
VBN | verb, past part | eaten |
VBP | Verb, present | eat |
VBZ | Verb, present | eats |
WDT | Wh-determiner | which,that |
WP | Wh pronoun | who,what |
WP$ | Possessive-Wh | whose |
WRB | Wh-adverb | how,where |
, | Comma | , |
. | Sent-final punct | . ! ? |
: | Mid-sent punct. | : ; |
$ | Dollar sign | $ |
# | Pound sign | # |
" | quote | " |
( | Left paren | ( |
) | Right paren | ) |
The url accepts a text string (parameter name 'text') submitted through either http 'POST' or 'GET'. The result will be a JSON object of a two-dimensional array containing each element of the input text coupled with its grammatical classification (see table).
The breakdown tool is located at http://linguisti.cc/breakdown
http://linguisti.cc/breakdown?text=I am the very model of a modern Major-General, I've information vegetable, animal, and mineral, I know the kings of England, and I quote the fights historical, from Marathon to Waterloo, in order categorical.
Submitting the preceeding text to the breakdown tool results in the JSON encoded two-dimensional array result:
[["I","NN"],["am","VBP"],["the","DT"],["very","RB"],["model","NN"],["of","IN"],["a","DT"],["modern","JJ"],["Major-General","NN"],[",",","],["I've","NN"],["information","NN"],["vegetable","NN"],[",",","],["animal","NN"],[",",","],["and","CC"],["mineral","NN"],[",",","],["I","NN"],["know","VB"],["the","DT"],["kings","NNS"],["of","IN"],["England","NNP"],[",",","],["and","CC"],["I","NN"],["quote","VB"],["the","DT"],["fights","NNS"],["historical","JJ"],[",",","],["from","IN"],["Marathon","NNP"],["to","TO"],["Waterloo","NN"],[",",","],["in","IN"],["order","NN"],["categorical","JJ"],[".","."]]
You may also use POST to submit larger blocks of text.
All of the hard work of this tool is performed by the glorious jspos (Javascript Part of Speech Tagger) library (over at Google Code) which is itself a port of other work. All I did was give it a web interface. Mostly because clients couldn't be expected to download the entire lexicon, so web hosting it is a natural fit. Much appreciation to Percy Wegmann for developing this tool out in Javascript and making it free to use. The keys table is lifted directly from the project's README.
These tools are free to use. They are for fun and exploratory use, provided as-is without any warranty. They may be taken offline at any time. Let me know if you are using it, or are planning on hammering the servers.