Temporal annotation
Temporal annotation is the study of how to automatically add semantic information regarding time to natural language documents. It plays a role in natural language processing and computational linguistics.
About
Temporal annotation involves the application of a semantic annotation to a document. Significant temporal annotation standards include TimeML, ISO-TimeML and TIDES.[1] These standards typically include annotations for some or all of temporal expressions (or timexes), events, temporal relations, temporal signals,[2] and temporal relation types.
In natural language texts, events may be associated with times; e.g., they may start or end at a given at a time. Events are also associated with other events, like occurring before or after them. We call these relations temporal relations. Temporal relation typing classifies the relation between two arguments, and is an important and difficult sub-task of figuring out all the temporal information in a document. Allen's interval algebra is one scheme for types of temporal relations. Rule-engineering and machine learning approaches to temporal annotation have both been successful, though achieving high performance in temporal relation typing remains a difficult task.
Applications
Successful temporal annotation enables systems to find out when facts asserted in texts are true, to build timelines,[3] to extract plans, and to discover mentions of change. This has had applications in many domains, such as information extraction, digital history,[4] processing social media, and clinical text mining.
Evaluation
The TempEval task series sets a shared temporal annotation task, and has run at SemEval three times, attracting system entries from around the world.[5][6][7] The task originally centred on determining the types of temporal relations only. In TempEval-2 and -3, this expanded to include event and timex annotation. In addition, the i2b2 clinical evaluation shared task was a temporal annotation exercise in 2012, which attracted a great deal of interest.[8]
Further reading
- Boguraev, B. and Ando, R.K. (2005), TimeML-Compliant Text Analysis for Temporal Reasoning. Proceedings of IJCAI.
- Derczynski, L. (2013), Determining the Types of Temporal Relations in Discourse, PhD thesis, University of Sheffield.
- Pustejovsky et al. (2003), The TimeBank Corpus, Proceedings of the Corpus Linguistics Conference.
- Pustejovsky et al. (2005), The specification language TimeML, in 'The Language of Time'. ISBN 9780199268542.
- UzZaman, N. and Allen, J. (2010), Event and Temporal Expression extraction from raw text: first step towards a temporally aware system, International Journal of Semantic Computing 4(4).
References
- Ferro, L (2005). "TIDES 2005 standard for the annotation of temporal expressions" (PDF). MITRE Corporation Technical Report.
- Derczynski, L (2011). "A corpus-based study of temporal signals". Proceedings of the Corpus Linguistics Conference. arXiv:1203.5066.
- Ji, Heng (2013). "Tackling Representation, Annotation and Classification Challenges for Temporal Knowledge Base Population". Journal of Knowledge and Information Systems.
- Cybulska, A (2011). "Historical event extraction from text". Proceedings of the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities.
- Verhagen, Marc (2007). "Semeval-2007 task 15: Tempeval temporal relation identification". Proceedings of the Workshop on Semantic Evaluation.
- Verhagen, Marc (2010). "Semeval-2010 task 13: Tempeval-2". Proceedings of the Workshop on Semantic Evaluation.
- UzZaman, N (2013). "Semeval-2013 task 1: Tempeval-3". Proceedings of the Workshop on Semantic Evaluation.
- Sun, W (2013). "Evaluating temporal relations in clinical text: 2012 i2b2 Challenge". Journal of the American Medical Informatics Association. 20 (5): 806–13. doi:10.1136/amiajnl-2013-001628. PMC 3756273. PMID 23564629.