readReut21578XML {tm} | R Documentation |
Read in a Reuters-21578 XML document.
readReut21578XML(elem, language, id) readReut21578XMLasPlain(elem, language, id)
elem |
a named list with the component |
language |
a string giving the language. |
id |
Not used. |
An XMLTextDocument
for readReut21578XML
, or a
PlainTextDocument
for readReut21578XMLasPlain
,
representing the text and metadata extracted from elem$content
.
Lewis, David (1997) Reuters-21578 Text Categorization Collection Distribution 1.0. http://kdd.ics.uci.edu/databases/reuters21578/reuters21578.html
Luz, Saturnino XML-encoded version of Reuters-21578. http://ronaldo.cs.tcd.ie/esslli07/data/reuters21578-xml/
Reader
for basic information on the reader infrastructure
employed by package tm.