Web & JSON API for Language Tools

Breakdown

This is a simple tool which parses text and returns the text broken down into parts of english grammar.

The classification key for *breakdown* result set
Key	Element of Grammar	Example(s)
CC	Coord Conjuncn	and,but,or
CD	Cardinal number	one,two
DT	Determiner	the,some
EX	Existential there	there
FW	Foreign Word	mon dieu
IN	Preposition	of,in,by
JJ	Adjective	big
JJR	Adj., comparative	bigger
JJS	Adj., superlative	biggest
LS	List item marker	1,One
MD	Modal	can,should
NN	Noun, sing. or mass	dog
NNP	Proper noun, sing.	Edinburgh
NNPS	Proper noun, plural	Smiths
NNS	Noun, plural	dogs
POS	Possessive ending	's
PDT	Predeterminer	all, both
PP$	Possessive pronoun	my,one's
PRP	Personal pronoun	I,you,she
RB	Adverb	quickly
RBR	Adverb, comparative	faster
RBS	Adverb, superlative	fastest
RP	Particle	up,off
SYM	Symbol	+,%,&
TO	'to'	to
UH	Interjection	oh, oops
VB	verb, base form	eat
VBD	verb, past tense	ate
VBG	verb, gerund	eating
VBN	verb, past part	eaten
VBP	Verb, present	eat
VBZ	Verb, present	eats
WDT	Wh-determiner	which,that
WP	Wh pronoun	who,what
WP$	Possessive-Wh	whose
WRB	Wh-adverb	how,where
,	Comma	,
.	Sent-final punct	. ! ?
:	Mid-sent punct.	: ;
$	Dollar sign	$
#	Pound sign	#
"	quote	"
(	Left paren	(
)	Right paren	)

The url accepts a text string (parameter name 'text') submitted through either http 'POST' or 'GET'. The result will be a JSON object of a two-dimensional array containing each element of the input text coupled with its grammatical classification (see table).

The breakdown tool is located at http://linguisti.cc/breakdown

Examples

http://linguisti.cc/breakdown?text=I am the very model of a modern Major-General, I've information vegetable, animal, and mineral, I know the kings of England, and I quote the fights historical, from Marathon to Waterloo, in order categorical.
Submitting the preceeding text to the breakdown tool results in the JSON encoded two-dimensional array result:
[["I","NN"],["am","VBP"],["the","DT"],["very","RB"],["model","NN"],["of","IN"],["a","DT"],["modern","JJ"],["Major-General","NN"],[",",","],["I've","NN"],["information","NN"],["vegetable","NN"],[",",","],["animal","NN"],[",",","],["and","CC"],["mineral","NN"],[",",","],["I","NN"],["know","VB"],["the","DT"],["kings","NNS"],["of","IN"],["England","NNP"],[",",","],["and","CC"],["I","NN"],["quote","VB"],["the","DT"],["fights","NNS"],["historical","JJ"],[",",","],["from","IN"],["Marathon","NNP"],["to","TO"],["Waterloo","NN"],[",",","],["in","IN"],["order","NN"],["categorical","JJ"],[".","."]]

You may also use POST to submit larger blocks of text.

I am the very model of a modern Major-General,
I've information vegetable, animal, and mineral,
I know the kings of England, and I quote the fights historical,
From Marathon to Waterloo, in order categorical;
I'm very well acquainted too with matters mathematical,
I understand equations, both the simple and quadratical,
About binomial theorem I'm teeming with a lot o' news---
With many cheerful facts about the square of the hypotenuse.

I'm very good at integral and differential calculus,
I know the scientific names of beings animalculous;
In short, in matters vegetable, animal, and mineral,
I am the very model of a modern Major-General.

I know our mythic history, King Arthur's and Sir Caradoc's,
I answer hard acrostics, I've a pretty taste for paradox,
I quote in elegiacs all the crimes of Heliogabalus,
In conics I can floor peculiarities parablous.
I can tell undoubted Raphaels from Gerard Dows and Zoffanies,
I know the croaking chorus from the Frogs of Aristophanes,
Then I can hum a fugue of which I've heard the music's din afore,
And whistle all the airs from that infernal nonsense Pinafore.

Credits

All of the hard work of this tool is performed by the glorious jspos (Javascript Part of Speech Tagger) library (over at Google Code) which is itself a port of other work. All I did was give it a web interface. Mostly because clients couldn't be expected to download the entire lexicon, so web hosting it is a natural fit. Much appreciation to Percy Wegmann for developing this tool out in Javascript and making it free to use. The keys table is lifted directly from the project's README.

Linguisti.cc

A collection of linguistic tools using web & JSON

Breakdown

This is a simple tool which parses text and returns the text broken down into parts of english grammar.

Examples

Credits

Use