0
I Use This!
Activity Not Available

Project Summary

Large-scale, powerful and battery included!

HadoopLDA can train LDA model with large corpus in parallel on a Hadoop cluster. It use distributed Gibbs Sampling technique, with built-in vocabulary selection. HadoopLDA is easy to use, a single command can turn huge amount of documents into a compact topic model file, and a Java class(LdaModel) is included for use the model easily in your code.

Scientific research? Comparing document similarity? Matching ads to users? Discover corpus structure? You choose. With HadoopLDA and a Hadoop cluster, it will be an easy task.

Source code, binary package and Getting Started doc will be uploaded soon.

Tags

distributed gibbs-sampling hadoop lda parallel topic-model

In a Nutshell, hadooplda...

 No code available to analyze

Open Hub computes statistics on FOSS projects by examining source code and commit history in source code management systems. This project has no code locations, and so Open Hub cannot perform this analysis

Is this project's source code hosted in a publicly available repository? Do you know the URL? If you do, click the button below and tell us so that Open Hub can generate statistics! It's fast and easy - try it and see!

Add a code location

Apache License 2.0
Permitted

Place Warranty

Sub-License

Private Use

Use Patent Claims

Commercial Use

Modify

Distribute

Forbidden

Hold Liable

Use Trademarks

Required

Include Copyright

State Changes

Include License

Include Notice

These details are provided for information only. No information here is legal advice and should not be used as such.

All Licenses

This Project has No vulnerabilities Reported Against it

Did You Know...

  • ...
    Black Duck offers a free trial so you can discover if there are open source vulnerabilities in your code
  • ...
    data presented on the Open Hub is available through our API
  • ...
    nearly 1 in 3 companies have no process for identifying, tracking, or remediating known open source vulnerabilities
  • ...
    learn about Open Hub updates and features on the Open Hub blog

 No code available to analyze

Open Hub computes statistics on FOSS projects by examining source code and commit history in source code management systems. This project has no code locations, and so Open Hub cannot perform this analysis

Is this project's source code hosted in a publicly available repository? Do you know the URL? If you do, click the button below and tell us so that Open Hub can generate statistics! It's fast and easy - try it and see!

Add a code location

Community Rating

Be the first to rate this project
Click to add your rating
   Spinner
Review this Project!
Sample ohloh analysis