English News Text Treebank: Penn Treebank Revised
- Resource
- URL
- https://dss2.princeton.edu/data/301/
- Blurb
-
English News Text Treebank: Penn Treebank Revised was developed by the Linguistic Data Consortium (LDC) with funding through a gift from Google Inc. It consists of a combination of automated and manual revisions of the Penn Treebank annotation of Wall Street Journal (WSJ) stories. The data is comprised of 1,203,648 word-level tokens in 49,191 sentence-level tokens -- in all 2,312 of the original Penn Treebank WSJ files.
- Link time
- 2020-05-21 19:25:00 UTC
- Sample
- Principal investigator
- Producer
- Distributor
- Version
- More detail URL
- Resource type
- Single study
- Subjects
- Art & Culture
- Regions
- Countries
- United States