List Info

Thread: Tagger tags defined




Tagger tags defined
user name
2006-02-21 05:28:31
The tags used in the Brill/Hepple tagger are derived from
the Penn 
Treebank Project tags. The original definition seems to be 
"Part-of-Speech Tagging Guidelines for the Penn
Treebank Project" by 
Beatrice Santorini. It might be a good idea to archive it: 
ftp://ftp.cis.upenn.edu/pub/treebank/doc/tagguide.ps.gz 
ftp://ftp.cis.upenn.edu/pub/treebank/doc/tagguide.tex

I added descriptions to the GATE list (and sorted the list).
Here are my 
modifications in case they are useful for the documentation,
although in 
a simple text format.

I notice some things are described as "unknown"
although I don't know if 
that reflects GATE not using them or the author of the GATE
doc not 
knowing their meaning.

http://gate.ac
.uk/sale/tao/#x1-368000D

Appendix D
Part-of-Speech Tags used in the Hepple Tagger

CC - coordinating conjunction: "and",
"but", "nor", "or",
"yet", plus, 
minus, less, times (multiplication), over (division). Also
"for" 
(because) and "so" (i.e., "so
that").
CD - cardinal number
DT - determiner: Articles including "a",
"an", "every", "no",
"the", 
"another", "any",
"some", "those".
EX - existential ’there’: Unstressed "there"
that triggers inversion of 
the inflected verb and the logical subject; "There was
a party in progress".
FW - foreign word
IN - preposition or subordinating conjunction
JJ - adjective: Hyphenated compounds that are used as
modifiers; 
happy-go-lucky.
JJR - adjective - comparative: Adjectives with the
comparative ending 
"-er" and a comparative meaning. Sometimes
"more" and "less".
JJS - adjective - superlative: Adjectives with the
superlative ending 
"-est" (and "worst"). Sometimes
"most"and "least".
JJSS - -unknown-, but probably a variant of JJS
-LRB- - -unknown-
LS - list item marker: Numbers and letters used as
identifiers of items 
in a list.
MD - modal: All verbs that don't take an "-s"
ending in the third person 
singular present: "can", "could",
"dare", "may", "might",
"must", 
"ought", "shall",
"should", "will",
"would".
NN - noun - singular or mass
NNP - proper noun - singular: All words in names usually are
capitalized 
but titles might not be.
NNPS - proper noun - plural: All words in names usually are
capitalized 
but titles might not be.
NNS - noun - plural
NP - proper noun - singular
NPS - proper noun - plural
PDT - predeterminer: Determinerlike elements preceding an
article or 
possessive pronoun; "all/PDT his marbles",
"quite/PDT a mess".
POS - possesive ending: Nouns ending in "'s" or
"'".
PP - personal pronoun
PRPR$ - -unknown-, but probably possesive pronoun
PRP - -unknown-, but probably possesive pronoun: Personal
pronouns such 
as "I", "me", "you",
"he", "him", "ours",
"theirs".
PRP$ - -unknown-, but probably possesive pronoun: Possessive
pronouns 
such as "my", "your",
"his", "his", "its",
"one's", "our", and
"their".
RB - adverb: Most words ending in "-ly". Also
"quite", "too", "very", 
"enough", "indeed",
"not", "-n't",
and"never".
RBR - adverb - comparative: Adverbs ending with
"-er" with a comparative 
meaning.
RBS - adverb - superlative
RP - particle: Mostly monosyllabic words that also double as
directional 
adverbs.
STAART - start state marker (used internally)
SYM - symbol: Technical symbols or expressions that aren't
English words.
TO - literal ”to”
UH - interjection: Such as "my",
"oh", "please", "uh",
"well", "yes".
VBD - verb - past tense: Includes conditional form of the
verb "to be"; 
"If I were/VBD rich...".
VBG - verb - gerund or present participle
VBN - verb - past participle
VBP - verb - non-3rd person singular present
VB - verb - base form: Subsumes imperatives, infinitives and
subjunctives.
VBZ - verb - 3rd person singular present
WDT - ’wh’-determiner
WP$ - possesive ’wh’-pronoun: Includes "whose"
WP - ’wh’-pronoun: Includes "what",
"who", and "whom".
WRB - ’wh’-adverb: Includes "how",
"where", "why". Includes
"when" when 
used in a temporal sense.

:: - literal colon
, - literal comma
$ - literal dollar sign
- - literal double-dash
” - literal double quotes
‘ - literal grave
( - literal left parenthesis
. - literal period
# - literal pound sign
) - literal right parenthesis
’ - literal single quote or apostrophe


Tagger tags defined
user name
2006-02-21 10:07:13
Thanks Scot, I'll update the documentation.
The unknown tags were indeed unknown to us - they were also
classified as 
unknown in the original PTB documentation we used. If you
have any further 
information about them then do let us know please!
Diana


Scot Wilcoxon wrote:
> The tags used in the Brill/Hepple tagger are derived
from the Penn 
> Treebank Project tags. The original definition seems to
be 
> "Part-of-Speech Tagging Guidelines for the Penn
Treebank Project" by 
> Beatrice Santorini. It might be a good idea to archive
it: 
> ftp://ftp.cis.upenn.edu/pub/treebank/doc/tagguide.ps.gz

> ftp://ftp.cis.upenn.edu/pub/treebank/doc/tagguide.tex
> 
> I added descriptions to the GATE list (and sorted the
list). Here are my 
> modifications in case they are useful for the
documentation, although in 
> a simple text format.
> 
> I notice some things are described as
"unknown" although I don't know if 
> that reflects GATE not using them or the author of the
GATE doc not 
> knowing their meaning.
> 
> http://gate.ac
.uk/sale/tao/#x1-368000D
> 
> Appendix D
> Part-of-Speech Tags used in the Hepple Tagger
> 
> CC - coordinating conjunction: "and",
"but", "nor", "or",
"yet", plus, 
> minus, less, times (multiplication), over (division).
Also "for" 
> (because) and "so" (i.e., "so
that").
> CD - cardinal number
> DT - determiner: Articles including "a",
"an", "every", "no",
"the", 
> "another", "any",
"some", "those".
> EX - existential ’there’: Unstressed
"there" that triggers inversion of 
> the inflected verb and the logical subject;
"There was a party in 
> progress".
> FW - foreign word
> IN - preposition or subordinating conjunction
> JJ - adjective: Hyphenated compounds that are used as
modifiers; 
> happy-go-lucky.
> JJR - adjective - comparative: Adjectives with the
comparative ending 
> "-er" and a comparative meaning. Sometimes
"more" and "less".
> JJS - adjective - superlative: Adjectives with the
superlative ending 
> "-est" (and "worst"). Sometimes
"most"and "least".
> JJSS - -unknown-, but probably a variant of JJS
> -LRB- - -unknown-
> LS - list item marker: Numbers and letters used as
identifiers of items 
> in a list.
> MD - modal: All verbs that don't take an
"-s" ending in the third person 
> singular present: "can",
"could", "dare", "may",
"might", "must", 
> "ought", "shall",
"should", "will",
"would".
> NN - noun - singular or mass
> NNP - proper noun - singular: All words in names
usually are capitalized 
> but titles might not be.
> NNPS - proper noun - plural: All words in names usually
are capitalized 
> but titles might not be.
> NNS - noun - plural
> NP - proper noun - singular
> NPS - proper noun - plural
> PDT - predeterminer: Determinerlike elements preceding
an article or 
> possessive pronoun; "all/PDT his marbles",
"quite/PDT a mess".
> POS - possesive ending: Nouns ending in
"'s" or "'".
> PP - personal pronoun
> PRPR$ - -unknown-, but probably possesive pronoun
> PRP - -unknown-, but probably possesive pronoun:
Personal pronouns such 
> as "I", "me",
"you", "he", "him",
"ours", "theirs".
> PRP$ - -unknown-, but probably possesive pronoun:
Possessive pronouns 
> such as "my", "your",
"his", "his", "its",
"one's", "our", and
"their".
> RB - adverb: Most words ending in "-ly".
Also "quite", "too",
"very", 
> "enough", "indeed",
"not", "-n't",
and"never".
> RBR - adverb - comparative: Adverbs ending with
"-er" with a comparative 
> meaning.
> RBS - adverb - superlative
> RP - particle: Mostly monosyllabic words that also
double as directional 
> adverbs.
> STAART - start state marker (used internally)
> SYM - symbol: Technical symbols or expressions that
aren't English words.
> TO - literal ”to”
> UH - interjection: Such as "my",
"oh", "please", "uh",
"well", "yes".
> VBD - verb - past tense: Includes conditional form of
the verb "to be"; 
> "If I were/VBD rich...".
> VBG - verb - gerund or present participle
> VBN - verb - past participle
> VBP - verb - non-3rd person singular present
> VB - verb - base form: Subsumes imperatives,
infinitives and subjunctives.
> VBZ - verb - 3rd person singular present
> WDT - ’wh’-determiner
> WP$ - possesive ’wh’-pronoun: Includes
"whose"
> WP - ’wh’-pronoun: Includes "what",
"who", and "whom".
> WRB - ’wh’-adverb: Includes "how",
"where", "why". Includes
"when" when 
> used in a temporal sense.
> 
> :: - literal colon
> , - literal comma
> $ - literal dollar sign
> - - literal double-dash
> ” - literal double quotes
> ‘ - literal grave
> ( - literal left parenthesis
> . - literal period
> # - literal pound sign
> ) - literal right parenthesis
> ’ - literal single quote or apostrophe
> 
> 


Tagger tags defined
user name
2006-02-21 19:00:42
hi Scot
Just implementing your changes to the doc, and I wonder if
you have made a 
slight mistake here:
You say:

PRP - unknown-, but probably possessive pronoun: Personal
pronouns such as 
"I", "me", "you",
"he", "him", "ours",
"theirs".\\
PRP\$ - unknown, but probably possessive pronoun,such as
"my", "your", "his", 
"his", "its", "one's",
"our", and "their".\\

Should PRP not be a possessive pronoun of the form
"mine", "yours", "his", 
"its", "ours", "theirs"
I think the personal pronouns of the kinds you described
"I" "me" etc. are 
mostly covered by PP.
Or have you another meaning in mind?
Regards
Diana

Scot Wilcoxon wrote:
> The tags used in the Brill/Hepple tagger are derived
from the Penn 
> Treebank Project tags. The original definition seems to
be 
> "Part-of-Speech Tagging Guidelines for the Penn
Treebank Project" by 
> Beatrice Santorini. It might be a good idea to archive
it: 
> ftp://ftp.cis.upenn.edu/pub/treebank/doc/tagguide.ps.gz

> ftp://ftp.cis.upenn.edu/pub/treebank/doc/tagguide.tex
> 
> I added descriptions to the GATE list (and sorted the
list). Here are my 
> modifications in case they are useful for the
documentation, although in 
> a simple text format.
> 
> I notice some things are described as
"unknown" although I don't know if 
> that reflects GATE not using them or the author of the
GATE doc not 
> knowing their meaning.
> 
> http://gate.ac
.uk/sale/tao/#x1-368000D
> 
> Appendix D
> Part-of-Speech Tags used in the Hepple Tagger
> 
> CC - coordinating conjunction: "and",
"but", "nor", "or",
"yet", plus, 
> minus, less, times (multiplication), over (division).
Also "for" 
> (because) and "so" (i.e., "so
that").
> CD - cardinal number
> DT - determiner: Articles including "a",
"an", "every", "no",
"the", 
> "another", "any",
"some", "those".
> EX - existential ’there’: Unstressed
"there" that triggers inversion of 
> the inflected verb and the logical subject;
"There was a party in 
> progress".
> FW - foreign word
> IN - preposition or subordinating conjunction
> JJ - adjective: Hyphenated compounds that are used as
modifiers; 
> happy-go-lucky.
> JJR - adjective - comparative: Adjectives with the
comparative ending 
> "-er" and a comparative meaning. Sometimes
"more" and "less".
> JJS - adjective - superlative: Adjectives with the
superlative ending 
> "-est" (and "worst"). Sometimes
"most"and "least".
> JJSS - -unknown-, but probably a variant of JJS
> -LRB- - -unknown-
> LS - list item marker: Numbers and letters used as
identifiers of items 
> in a list.
> MD - modal: All verbs that don't take an
"-s" ending in the third person 
> singular present: "can",
"could", "dare", "may",
"might", "must", 
> "ought", "shall",
"should", "will",
"would".
> NN - noun - singular or mass
> NNP - proper noun - singular: All words in names
usually are capitalized 
> but titles might not be.
> NNPS - proper noun - plural: All words in names usually
are capitalized 
> but titles might not be.
> NNS - noun - plural
> NP - proper noun - singular
> NPS - proper noun - plural
> PDT - predeterminer: Determinerlike elements preceding
an article or 
> possessive pronoun; "all/PDT his marbles",
"quite/PDT a mess".
> POS - possesive ending: Nouns ending in
"'s" or "'".
> PP - personal pronoun
> PRPR$ - -unknown-, but probably possesive pronoun
> PRP - -unknown-, but probably possesive pronoun:
Personal pronouns such 
> as "I", "me",
"you", "he", "him",
"ours", "theirs".
> PRP$ - -unknown-, but probably possesive pronoun:
Possessive pronouns 
> such as "my", "your",
"his", "his", "its",
"one's", "our", and
"their".
> RB - adverb: Most words ending in "-ly".
Also "quite", "too",
"very", 
> "enough", "indeed",
"not", "-n't",
and"never".
> RBR - adverb - comparative: Adverbs ending with
"-er" with a comparative 
> meaning.
> RBS - adverb - superlative
> RP - particle: Mostly monosyllabic words that also
double as directional 
> adverbs.
> STAART - start state marker (used internally)
> SYM - symbol: Technical symbols or expressions that
aren't English words.
> TO - literal ”to”
> UH - interjection: Such as "my",
"oh", "please", "uh",
"well", "yes".
> VBD - verb - past tense: Includes conditional form of
the verb "to be"; 
> "If I were/VBD rich...".
> VBG - verb - gerund or present participle
> VBN - verb - past participle
> VBP - verb - non-3rd person singular present
> VB - verb - base form: Subsumes imperatives,
infinitives and subjunctives.
> VBZ - verb - 3rd person singular present
> WDT - ’wh’-determiner
> WP$ - possesive ’wh’-pronoun: Includes
"whose"
> WP - ’wh’-pronoun: Includes "what",
"who", and "whom".
> WRB - ’wh’-adverb: Includes "how",
"where", "why". Includes
"when" when 
> used in a temporal sense.
> 
> :: - literal colon
> , - literal comma
> $ - literal dollar sign
> - - literal double-dash
> ” - literal double quotes
> ‘ - literal grave
> ( - literal left parenthesis
> . - literal period
> # - literal pound sign
> ) - literal right parenthesis
> ’ - literal single quote or apostrophe
> 
> 


Tagger tags defined
user name
2006-02-22 02:07:05
See the definition in the Penn Treebank document.  The stuff
before the 
colon is the GATE text; the "unknown" notes here
seem to be by someone 
who did not find the Penn definitions.  Unless the GATE code
has been 
changed from the Penn/Brill behavior...I did not check the
Penn 
definitions against the GATE code.

Diana Maynard wrote:

> hi Scot
> Just implementing your changes to the doc, and I wonder
if you have 
> made a slight mistake here:
> You say:
>
> PRP - unknown-, but probably possessive pronoun:
Personal pronouns 
> such as "I", "me",
"you", "he", "him",
"ours", "theirs".\\
> PRP\$ - unknown, but probably possessive pronoun,such
as "my", "your", 
> "his", "his",
"its", "one's", "our",
and "their".\\
>
> Should PRP not be a possessive pronoun of the form
"mine", "yours", 
> "his", "its",
"ours", "theirs"
> I think the personal pronouns of the kinds you
described "I" "me" etc. 
> are mostly covered by PP.
> Or have you another meaning in mind?
> Regards
> Diana
>
> Scot Wilcoxon wrote:
>
>> The tags used in the Brill/Hepple tagger are
derived from the Penn 
>> Treebank Project tags. The original definition
seems to be 
>> "Part-of-Speech Tagging Guidelines for the
Penn Treebank Project" by 
>> Beatrice Santorini. It might be a good idea to
archive it: 
>>
ftp://ftp.cis.upenn.edu/pub/treebank/doc/tagguide.ps.gz 
>>
ftp://ftp.cis.upenn.edu/pub/treebank/doc/tagguide.tex
>>
>> I added descriptions to the GATE list (and sorted
the list). Here are 
>> my modifications in case they are useful for the
documentation, 
>> although in a simple text format.
>>
>> I notice some things are described as
"unknown" although I don't know 
>> if that reflects GATE not using them or the author
of the GATE doc 
>> not knowing their meaning.
>>
>> http://gate.ac
.uk/sale/tao/#x1-368000D
>>
>> Appendix D
>> Part-of-Speech Tags used in the Hepple Tagger
>>
>> CC - coordinating conjunction: "and",
"but", "nor", "or",
"yet", 
>> plus, minus, less, times (multiplication), over
(division). Also 
>> "for" (because) and "so"
(i.e., "so that").
>> CD - cardinal number
>> DT - determiner: Articles including
"a", "an", "every",
"no", "the", 
>> "another", "any",
"some", "those".
>> EX - existential ’there’: Unstressed
"there" that triggers inversion 
>> of the inflected verb and the logical subject;
"There was a party in 
>> progress".
>> FW - foreign word
>> IN - preposition or subordinating conjunction
>> JJ - adjective: Hyphenated compounds that are used
as modifiers; 
>> happy-go-lucky.
>> JJR - adjective - comparative: Adjectives with the
comparative ending 
>> "-er" and a comparative meaning.
Sometimes "more" and "less".
>> JJS - adjective - superlative: Adjectives with the
superlative ending 
>> "-est" (and "worst").
Sometimes "most"and "least".
>> JJSS - -unknown-, but probably a variant of JJS
>> -LRB- - -unknown-
>> LS - list item marker: Numbers and letters used as
identifiers of 
>> items in a list.
>> MD - modal: All verbs that don't take an
"-s" ending in the third 
>> person singular present: "can",
"could", "dare", "may",
"might", 
>> "must", "ought",
"shall", "should",
"will", "would".
>> NN - noun - singular or mass
>> NNP - proper noun - singular: All words in names
usually are 
>> capitalized but titles might not be.
>> NNPS - proper noun - plural: All words in names
usually are 
>> capitalized but titles might not be.
>> NNS - noun - plural
>> NP - proper noun - singular
>> NPS - proper noun - plural
>> PDT - predeterminer: Determinerlike elements
preceding an article or 
>> possessive pronoun; "all/PDT his
marbles", "quite/PDT a mess".
>> POS - possesive ending: Nouns ending in
"'s" or "'".
>> PP - personal pronoun
>> PRPR$ - -unknown-, but probably possesive pronoun
>> PRP - -unknown-, but probably possesive pronoun:
Personal pronouns 
>> such as "I", "me",
"you", "he", "him",
"ours", "theirs".
>> PRP$ - -unknown-, but probably possesive pronoun:
Possessive pronouns 
>> such as "my", "your",
"his", "his", "its",
"one's", "our", and
"their".
>> RB - adverb: Most words ending in
"-ly". Also "quite",
"too", "very", 
>> "enough", "indeed",
"not", "-n't",
and"never".
>> RBR - adverb - comparative: Adverbs ending with
"-er" with a 
>> comparative meaning.
>> RBS - adverb - superlative
>> RP - particle: Mostly monosyllabic words that also
double as 
>> directional adverbs.
>> STAART - start state marker (used internally)
>> SYM - symbol: Technical symbols or expressions that
aren't English 
>> words.
>> TO - literal ”to”
>> UH - interjection: Such as "my",
"oh", "please", "uh",
"well", "yes".
>> VBD - verb - past tense: Includes conditional form
of the verb "to 
>> be"; "If I were/VBD rich...".
>> VBG - verb - gerund or present participle
>> VBN - verb - past participle
>> VBP - verb - non-3rd person singular present
>> VB - verb - base form: Subsumes imperatives,
infinitives and 
>> subjunctives.
>> VBZ - verb - 3rd person singular present
>> WDT - ’wh’-determiner
>> WP$ - possesive ’wh’-pronoun: Includes
"whose"
>> WP - ’wh’-pronoun: Includes "what",
"who", and "whom".
>> WRB - ’wh’-adverb: Includes "how",
"where", "why". Includes
"when" 
>> when used in a temporal sense.
>>
>> :: - literal colon
>> , - literal comma
>> $ - literal dollar sign
>> - - literal double-dash
>> ” - literal double quotes
>> ‘ - literal grave
>> ( - literal left parenthesis
>> . - literal period
>> # - literal pound sign
>> ) - literal right parenthesis
>> ’ - literal single quote or apostrophe
>>
>>
>
>


Tagger tags defined
user name
2006-02-22 10:21:46
Ah OK, the reason for the confusion is that when I wrote the
GATE 
documentation, I used a list of tags given by the creator of
the tagger, which 
is clearly not quite identical to the list of tags from the
PTB that you cite 
here. Since I don't have the source for my list, just the
original file, I 
don't know where it originally came from. The list we used
had some unknown 
tags (for example, JJSS doesn't appear in the PTB list you
cite). I have no 
idea why. And I'm not sure if any of those actually get
created by the tagger. 
But we're probably safe to assume the PTB ones you give.
Diana


Scot Wilcoxon wrote:
> See the definition in the Penn Treebank document.  The
stuff before the 
> colon is the GATE text; the "unknown" notes
here seem to be by someone 
> who did not find the Penn definitions.  Unless the GATE
code has been 
> changed from the Penn/Brill behavior...I did not check
the Penn 
> definitions against the GATE code.
> 
> Diana Maynard wrote:
> 
>> hi Scot
>> Just implementing your changes to the doc, and I
wonder if you have 
>> made a slight mistake here:
>> You say:
>>
>> PRP - unknown-, but probably possessive pronoun:
Personal pronouns 
>> such as "I", "me",
"you", "he", "him",
"ours", "theirs".\\
>> PRP\$ - unknown, but probably possessive
pronoun,such as "my", "your", 
>> "his", "his",
"its", "one's", "our",
and "their".\\
>>
>> Should PRP not be a possessive pronoun of the form
"mine", "yours", 
>> "his", "its",
"ours", "theirs"
>> I think the personal pronouns of the kinds you
described "I" "me" etc. 
>> are mostly covered by PP.
>> Or have you another meaning in mind?
>> Regards
>> Diana
>>
>> Scot Wilcoxon wrote:
>>
>>> The tags used in the Brill/Hepple tagger are
derived from the Penn 
>>> Treebank Project tags. The original definition
seems to be 
>>> "Part-of-Speech Tagging Guidelines for
the Penn Treebank Project" by 
>>> Beatrice Santorini. It might be a good idea to
archive it: 
>>>
ftp://ftp.cis.upenn.edu/pub/treebank/doc/tagguide.ps.gz 
>>>
ftp://ftp.cis.upenn.edu/pub/treebank/doc/tagguide.tex
>>>
>>> I added descriptions to the GATE list (and
sorted the list). Here are 
>>> my modifications in case they are useful for
the documentation, 
>>> although in a simple text format.
>>>
>>> I notice some things are described as
"unknown" although I don't know 
>>> if that reflects GATE not using them or the
author of the GATE doc 
>>> not knowing their meaning.
>>>
>>> http://gate.ac
.uk/sale/tao/#x1-368000D
>>>
>>> Appendix D
>>> Part-of-Speech Tags used in the Hepple Tagger
>>>
>>> CC - coordinating conjunction:
"and", "but", "nor",
"or", "yet", 
>>> plus, minus, less, times (multiplication), over
(division). Also 
>>> "for" (because) and
"so" (i.e., "so that").
>>> CD - cardinal number
>>> DT - determiner: Articles including
"a", "an", "every",
"no", "the", 
>>> "another", "any",
"some", "those".
>>> EX - existential ’there’: Unstressed
"there" that triggers inversion 
>>> of the inflected verb and the logical subject;
"There was a party in 
>>> progress".
>>> FW - foreign word
>>> IN - preposition or subordinating conjunction
>>> JJ - adjective: Hyphenated compounds that are
used as modifiers; 
>>> happy-go-lucky.
>>> JJR - adjective - comparative: Adjectives with
the comparative ending 
>>> "-er" and a comparative meaning.
Sometimes "more" and "less".
>>> JJS - adjective - superlative: Adjectives with
the superlative ending 
>>> "-est" (and "worst").
Sometimes "most"and "least".
>>> JJSS - -unknown-, but probably a variant of JJS
>>> -LRB- - -unknown-
>>> LS - list item marker: Numbers and letters used
as identifiers of 
>>> items in a list.
>>> MD - modal: All verbs that don't take an
"-s" ending in the third 
>>> person singular present: "can",
"could", "dare", "may",
"might", 
>>> "must", "ought",
"shall", "should",
"will", "would".
>>> NN - noun - singular or mass
>>> NNP - proper noun - singular: All words in
names usually are 
>>> capitalized but titles might not be.
>>> NNPS - proper noun - plural: All words in names
usually are 
>>> capitalized but titles might not be.
>>> NNS - noun - plural
>>> NP - proper noun - singular
>>> NPS - proper noun - plural
>>> PDT - predeterminer: Determinerlike elements
preceding an article or 
>>> possessive pronoun; "all/PDT his
marbles", "quite/PDT a mess".
>>> POS - possesive ending: Nouns ending in
"'s" or "'".
>>> PP - personal pronoun
>>> PRPR$ - -unknown-, but probably possesive
pronoun
>>> PRP - -unknown-, but probably possesive
pronoun: Personal pronouns 
>>> such as "I", "me",
"you", "he", "him",
"ours", "theirs".
>>> PRP$ - -unknown-, but probably possesive
pronoun: Possessive pronouns 
>>> such as "my", "your",
"his", "his", "its",
"one's", "our", and
"their".
>>> RB - adverb: Most words ending in
"-ly". Also "quite",
"too", "very", 
>>> "enough", "indeed",
"not", "-n't",
and"never".
>>> RBR - adverb - comparative: Adverbs ending with
"-er" with a 
>>> comparative meaning.
>>> RBS - adverb - superlative
>>> RP - particle: Mostly monosyllabic words that
also double as 
>>> directional adverbs.
>>> STAART - start state marker (used internally)
>>> SYM - symbol: Technical symbols or expressions
that aren't English 
>>> words.
>>> TO - literal ”to”
>>> UH - interjection: Such as "my",
"oh", "please", "uh",
"well", "yes".
>>> VBD - verb - past tense: Includes conditional
form of the verb "to 
>>> be"; "If I were/VBD rich...".
>>> VBG - verb - gerund or present participle
>>> VBN - verb - past participle
>>> VBP - verb - non-3rd person singular present
>>> VB - verb - base form: Subsumes imperatives,
infinitives and 
>>> subjunctives.
>>> VBZ - verb - 3rd person singular present
>>> WDT - ’wh’-determiner
>>> WP$ - possesive ’wh’-pronoun: Includes
"whose"
>>> WP - ’wh’-pronoun: Includes "what",
"who", and "whom".
>>> WRB - ’wh’-adverb: Includes "how",
"where", "why". Includes
"when" 
>>> when used in a temporal sense.
>>>
>>> :: - literal colon
>>> , - literal comma
>>> $ - literal dollar sign
>>> - - literal double-dash
>>> ” - literal double quotes
>>> ‘ - literal grave
>>> ( - literal left parenthesis
>>> . - literal period
>>> # - literal pound sign
>>> ) - literal right parenthesis
>>> ’ - literal single quote or apostrophe
>>>
>>>
>>
>>
> 
> 


[1-5]

about | contact  Other archives ( Real Estate discussion Medical topics )