rlda.bernoulli {Rlda} | R Documentation |
This method implements the Latent Dirichlet Allocation with
Stick-Breaking prior for bernoulli data.
rlda.bernoulli
works with binary data.frame.
rlda.bernoulli(data, n_community, alpha0, alpha1, gamma, n_gibbs, ll_prior = TRUE, display_progress = TRUE)
data |
A binary data.frame where each row is a sampling unit (i.e. Plots, Locations, Time, etc.) and each column is a categorical type of element (i.e. Species, Firms, Issues, etc.). The elements inside this data.frame must be Zeros and Ones. |
n_community |
Total number of communities to return. It must be less than
the total number of columns inside the |
alpha0 |
Hyperparameter associated with the Beta prior Beta(alpha0, alpha1). |
alpha1 |
Hyperparameter associated with the Beta prior Beta(alpha0, alpha1). |
gamma |
Hyperparameter associated with the Stick-Breaking prior. |
n_gibbs |
Total number of Gibbs Samples. |
ll_prior |
boolean scalar indicating |
display_progress |
boolean scalar |
rlda.bernoulli
uses a modified Latent Dirichlet Allocation method
to construct Mixed-Membership Clusters using Bayesian Inference.
The data
must be a non-empty data.frame with the binaries values
Zero or Ones for each variable (column) in each observation (row).
A R List with three elements:
Theta |
The individual probability for each observation
(ex: location) belong in each cluster (ex: community). It is a matrix
with dimension equal |
Phi |
The individual probability for each variable
(ex: Specie) belong in each cluster (ex: community). It is a matrix
with dimension equal |
LogLikelihood |
The vector of Log-Likelihoods compute for each Gibbs Sample. |
The Theta
and Phi
matrix can be obtained for the i-th gibbs
sampling using matrix(Theta[i,], nrow = nrow(data), ncol = n_community)
and
matrix(Phi[i,], nrow = n_community, ncol = ncol(data))
, respectively.
Pedro Albuquerque.
pedroa@unb.br
http://pedrounb.blogspot.com/
Denis Valle.
drvalle@ufl.edu
http://denisvalle.weebly.com/
Daijiang Li.
daijianglee@gmail.com
http://daijiang.name/
Blei, David M., Andrew Y. Ng, and Michael I. Jordan.
"Latent dirichlet allocation." Journal of machine Learning research
3.Jan (2003): 993-1022.
http://www.jmlr.org/papers/volume3/blei03a/blei03a.pdf
Valle, Denis, et al.
"Decomposing biodiversity data using the Latent Dirichlet
Allocation model, a probabilistic multivariate statistical
method." Ecology letters 17.12 (2014): 1591-1601.
rlda.multinomial
, rlda.binomial
## Not run: library(Rlda) # Presence data(presence) # Set seed set.seed(9842) # Hyperparameters for each prior distribution gamma <- 0.01 alpha0 <- 0.01 alpha1 <- 0.01 # Execute the LDA for the Bernoulli entry res <- rlda.bernoulli(data = presence, n_community = 10, alpha0 = alpha0, alpha1 = alpha1, gamma = gamma, n_gibbs = 5000,ll_prior = TRUE, display_progress = TRUE) ## End(Not run)