Resource: New York Times Annotated Corpus

General Info
Project
Creation
Documentation
Access
Resource-specific information
Data files
About...
Cite as

Data Files

Persistent Identifier (PID) of this digital object: https://hdl.handle.net/11022/0000-0000-2CC1-5

Resource landing page: https://hdl.handle.net/11022/0000-0000-2CC1-5

Packaged files for this dataset:

Data Files
Persistent Identifier (PID) of this digital object:	https://hdl.handle.net/11022/0000-0000-2CC1-5
Resource landing page:	https://hdl.handle.net/11022/0000-0000-2CC1-5
Packaged files for this dataset:

This data set contains the following subordinate data objects:

This data set contains the following data streams:

https://hdl.handle.net/11022/0000-0000-2CC1-5@NYTAnnotatedCorpus.zip (application/zip) - 3 GB

General Information

Resource Name: New York Times Annotated Corpus

Resource Title: New York Times Annotated Corpus

Resource Class: Corpus

Version: archived

Life Cycle Status:

Start Year:

Completion Year:

Publication Date: 2008

Last Update:

Time Coverage:

Legal Owner:

Genre: unspecified

Field of Research:

Location: Deutschland Germany

Description:
English: The New York Times Annotated Corpus contains over 1.8 million articles written and published by the New York Times between January 1, 1987 and June 19, 2007 with article metadata provided by the New York Times Newsroom, the New York Times Indexing Service and the online production staff at nytimes.com.

Tags:

Modality Info: written

General Information
Resource Name:	New York Times Annotated Corpus
Resource Title:	New York Times Annotated Corpus
Resource Class:	Corpus
Version:	archived
Life Cycle Status:
Start Year:
Completion Year:
Publication Date:	2008
Last Update:
Time Coverage:
Legal Owner:
Genre:	unspecified
Field of Research:
Location:	Deutschland Germany
Description:	English: The New York Times Annotated Corpus contains over 1.8 million articles written and published by the New York Times between January 1, 1987 and June 19, 2007 with article metadata provided by the New York Times Newsroom, the New York Times Indexing Service and the online production staff at nytimes.com.
Tags:
Modality Info:	written

Project

Project Name: SFB 833 INF

Project Title: Heterogene Forschungsprimärdaten des SFB 833 – Repräsentation und Verarbeitung Heterogenous Primary Research Data of the SFB 833 - Representation and Processing

Project ID: 75650358

Url: https://www.sfb833.uni-tuebingen.de/infrastrukturprojekt-inf.html?type=0

Funder: Deutsche Forschungsgemeinschaft (DFG)

Institution: Sonderforschungsbereich 833: Bedeutungskonstitution- Dynamik und Adaptivität sprachlicher Strukturen SFB 833: The construction of meaning - the dynamics and adaptivity of linguistic structures Other: Eberhard Karls Universität Tübingen

Cooperations:

Person(s): Erhard Hinrichs

Descriptions:

Duration:

Project
Project Name:	SFB 833 INF
Project Title:	Heterogene Forschungsprimärdaten des SFB 833 – Repräsentation und Verarbeitung Heterogenous Primary Research Data of the SFB 833 - Representation and Processing
Project ID:	75650358
Url:	https://www.sfb833.uni-tuebingen.de/infrastrukturprojekt-inf.html?type=0
Funder:	Deutsche Forschungsgemeinschaft (DFG)
Institution:	Sonderforschungsbereich 833: Bedeutungskonstitution- Dynamik und Adaptivität sprachlicher Strukturen SFB 833: The construction of meaning - the dynamics and adaptivity of linguistic structures Other: Eberhard Karls Universität Tübingen
Cooperations:
Person(s):	Erhard Hinrichs
Descriptions:
Duration:

Creation

Topic:

Creator(s):

Source:

Original Source New York Times (Newswire)

Catalogue Link:

Type:

Format:

Size:

Quality:

Description:

Derivation:

Organisation(s) Linguistic Data Consortium

Derivation Date

Derivation Mode(s)

Derivation Type(s) Information extraction, Information retrieval, Metadata extraction, Summarization

Derivation Workflow(s)

Derivation Tool Info

Documentations

Documentation Type(s):

File Name(s):

URL:

Documentation Language(s): English

Descriptions(s):

Text Corpus

Corpus Type: specialised corpus

Temporal Classification:

Description(s):

Validation:

Subject Language(s):

Type-specific Size Info:

Text Corpus
Corpus Type:	specialised corpus
Temporal Classification:
Description(s):
Validation:
Subject Language(s):
Type-specific Size Info:

Access

Availability: request required

Distribution Medium: Download One DVD

Catalogue Link:

Price:

Licence: Linguistic Data Consortium, The Trustees of the University of Pennsylvania.

Contact: Thorsten Trippel (Archive Manager) , e-mail:thorsten.trippel@uni-tuebingen.de

Deployment Tool Info:

Descriptions:

Access
Availability:	request required
Distribution Medium:	Download One DVD
Catalogue Link:
Price:
Licence:	Linguistic Data Consortium, The Trustees of the University of Pennsylvania.
Contact:	Thorsten Trippel (Archive Manager) , e-mail:thorsten.trippel@uni-tuebingen.de
Deployment Tool Info:
Descriptions:

This digital object contains:

Original File Name Size Checksums

NYTAnnotatedCorpus.zip 3233831237B

2b7685aab3557dd88ae18e480a41ff93070036fe (SHA1)

About

Generation: Automatically generated with an XSL stylesheet from the CMDI file, v.02

Contact: Thorsten Trippel and Claus Zinn, SfS Tuebingen

Original File Name	Size	Checksums
NYTAnnotatedCorpus.zip	3233831237B	2b7685aab3557dd88ae18e480a41ff93070036fe (SHA1)

About
Generation:	Automatically generated with an XSL stylesheet from the CMDI file, v.02
Contact:	Thorsten Trippel and Claus Zinn, SfS Tuebingen

(): New York Times Annotated Corpus Persistent identifier: https://hdl.handle.net/11022/0000-0000-2CC1-5
This resource is provided through the technology partnership with the Tübingen Archive of Language Resources

Documentation Type(s):
File Name(s):
URL:
Documentation Language(s):	English
Descriptions(s):

Resource: New York Times Annotated Corpus

Data Files

General Information

Project

Creation

Documentations

Text Corpus

Access

About

(): New York Times Annotated Corpus Persistent identifier: https://hdl.handle.net/11022/0000-0000-2CC1-5

(): New York Times Annotated Corpus

Persistent identifier: https://hdl.handle.net/11022/0000-0000-2CC1-5