The basic idea of latent semantic analysis (LSA) is, that text do have a higher order (=latent semantic) structure which, however, is obscured by word usage (e.g. through the use of synonyms or polysemy). By using conceptual indices that are derived statistically via a truncated singular value decomposition (a two-mode factor analysis) over a given document-term matrix, this variability problem can be overcome.
| Version: | 0.63-3 |
| Depends: | R (≥ 2.10), Snowball, RWeka |
| Published: | 2011-06-26 |
| Author: | Fridolin Wild |
| Maintainer: | Fridolin Wild <f.wild at open.ac.uk> |
| License: | GPL (≥ 2) |
| NeedsCompilation: | no |
| In views: | NaturalLanguageProcessing |
| CRAN checks: | lsa results |
| Package source: | lsa_0.63-3.tar.gz |
| MacOS X binary: | lsa_0.63-3.tgz |
| Windows binary: | lsa_0.63-3.zip |
| Reference manual: | lsa.pdf |
| Old sources: | lsa archive |