make.phrase.matrix {textreg} | R Documentation |
Generate a n by p phrase count matrix, with n being number of documents and p being number of phrases: \tabularrrrrr 0 \tab 0 \tab 0 \tab 0 \tab 0 \cr 1 \tab 6 \tab 2 \tab 0 \tab 0 \cr 8 \tab 0 \tab 0 \tab 0 \tab 0 This is the phrase equivilent of a document-term matrix.
make.phrase.matrix(phrase_list, corpus)
phrase_list |
List of strings |
corpus |
A corpus object from tm package |
a n X p matrix, n being number of documents, p being number of phrases.
Other textregCounting: make.count.table
,
phrase.count
library( tm ) data( bathtub ) lbl = meta( bathtub )$meth.chl head( make.phrase.matrix( c("bathtub","strip+", "vapor *"), bathtub ) )