make.count.table {textreg} | R Documentation |
Given a list of phrases, count how many documents they appear in and subdivide by positive and negative appearance.
make.count.table(phrases, labeling, corpus)
phrases |
List of strings |
labeling |
Vector of +1/0/-1 labels |
corpus |
A corpus object from tm package |
This method does not consider multiple counts of phrases within documents.
Phrases can have wildcards and stemming notation. See grab.fragments
.
a dataframe of statistics. per.pos is the percent of the documents with the phrase that are positively labeled. per.tag is the percent of the positively labeled documents that have the phrase.
grab.fragments
Other textregCounting: make.phrase.matrix
,
phrase.count
library( tm ) data( bathtub ) lbl = meta( bathtub )$meth.chl make.count.table( c("bathtub","strip+", "vapor *"), lbl, bathtub )