Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

WEKA

Compare

  Analyzed about 10 hours ago

Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization. It is ... [More] also well-suited for developing new machine learning schemes. [Less]

780K lines of code

3 current contributors

over 1 year since last commit

38 users on Open Hub

Very Low Activity
3.93333
   
I Use This
Licenses: No declared licenses

PureData

Compare

  Analyzed 4 months ago

PD (aka Pure Data) is a real-time graphical programming environment for audio, video, and graphical processing. It is the third major branch of the family of patcher programming languages known as Max (Max/FTS, ISPW Max, Max/MSP, jMax, etc.) originally developed by Miller Puckette and company at ... [More] IRCAM. The core of Pd is written and maintained by Miller Puckette and includes the work of many developers, making the whole package very much a community effort. [Less]

304K lines of code

13 current contributors

over 2 years since last commit

36 users on Open Hub

Activity Not Available
5.0
 
I Use This
Licenses: BSD-3-Clause, gpl

javacc

Compare

  No analysis available

Java Compiler Compiler is the most popular parser generator for use with Java applications. A parser generator is a tool that reads a grammar specification and converts it to a Java program that can recognize matches to the grammar. In addition to the parser generator itself, JavaCC provides other ... [More] standard capabilities related to parser generation such as tree building (via a tool called JJTree included with JavaCC), actions, debugging, etc. [Less]

0 lines of code

0 current contributors

0 since last commit

33 users on Open Hub

Activity Not Available
4.54545
   
I Use This
Mostly written in language not available
Licenses: BSD-3-Clause

Apache Mahout

Compare

Claimed by Apache Software Foundation Analyzed about 4 hours ago

Apache Mahout's goal is to build scalable machine learning libraries. With scalable we mean: Scalable to reasonably large data sets. Our core algorithms for clustering, classfication and batch based collaborative filtering are implemented on top of Apache Hadoop using the map/reduce paradigm. ... [More] However we do not restrict contributions to Hadoop based implementations: Contributions that run on a single node or on a non-Hadoop cluster are welcome as well. The core libraries are highly optimized to allow for good performance also for non-distributed algorithms [Less]

146K lines of code

0 current contributors

2 months since last commit

25 users on Open Hub

Low Activity
3.6
   
I Use This

NetworkX

Compare

Claimed by Los Alamos National Lab Analyzed 1 day ago

NetworkX (NX) is a Python package for the creation, manipulation, and study of the structure, dynamics, and functions of complex networks. Features: * Includes standard graph-theoretic and statistical physics functions * Easy exchange of network algorithms between applications, disciplines ... [More] , and platforms * Includes many classic graphs and synthetic networks * Nodes and edges can be "anything" (e.g. time-series, text, images, XML records) * Exploits existing code from high-quality legacy software in C, C++, Fortran, etc. * Open source (encourages community input) * Unit-tested Additional benefits due to Python: * Allows fast prototyping of new algorithms * Easy to teach * Multi-platform * Allows easy access to almost any database [Less]

95.2K lines of code

102 current contributors

4 days since last commit

18 users on Open Hub

High Activity
4.33333
   
I Use This

STXXL

Compare

  Analyzed about 15 hours ago

STXXL - Standard Template Library for Extra Large Data Sets. STXXL is an implementation of the C++ standard template library STL for external memory (out-of-core) computations, containers, and algorithms that can process huge volumes of data that only fit on disks.

70.3K lines of code

3 current contributors

over 5 years since last commit

15 users on Open Hub

Inactive
5.0
 
I Use This

gCube

Compare

  Analyzed about 18 hours ago

gCube is a software system specifically designed and developed to enact the building and operation of *large scale infrastructures* providing their users with a rich array of services suitable for supporting the co-creation of *Virtual Research Environments* and promoting the implementation of *open ... [More] science* workflows and practices. It is at the heart of the D4Science.org infrastructure (www.d4science.org). [Less]

1.49M lines of code

15 current contributors

4 days since last commit

14 users on Open Hub

High Activity
4.66667
   
I Use This

dlib C++ Library

Compare

  Analyzed about 20 hours ago

This project is a modern C++ library with a focus on portability and program correctness. It strives to be easy to use right and hard to use wrong. Thus, it comes with extensive documentation and thorough debugging modes. The library provides a platform abstraction layer for common tasks such as ... [More] interfacing with network services, handling threads, or creating graphical user interfaces. Additionally, the library implements many useful algorithms such as data compression routines, linked lists, binary search trees, linear algebra and matrix utilities, machine learning algorithms, XML and text parsing, and many other general utilities. [Less]

450K lines of code

25 current contributors

1 day since last commit

11 users on Open Hub

Moderate Activity
4.75
   
I Use This

ScUtil

Compare

  Analyzed about 23 hours ago

Hundreds of functions of a variety of topics, from statistics to string parsing, module utilities to network tools. Everyone's pet library accumulates features over time. My erlang library got big, fast. I often find myself giving functions from it out to other people, and a lot of my other ... [More] libraries are dependant on ScUtil in various ways, so I figured what the hell, let's give it away. This library is believed to be efficiently implemented at all points. Efficiency tips are, however, both appreciated and taken seriously. ScUtil uses the TestErl library for unit, regression and stochastic testing. ScUtil is free and MIT licensed, because the GPL is evil. ScUtil is written by John Haugeland, from http://fullof.bs/ . [Less]

9.39K lines of code

0 current contributors

about 8 years since last commit

11 users on Open Hub

Inactive
4.8
   
I Use This

Revenue Management Optimisation Library

Compare

  Analyzed about 13 hours ago

That project (RMOL) aims at giving developers concerned by the Revenue Management arena a library delivering a reference implementation for most of the published algorithms and methods.

23.6K lines of code

1 current contributors

about 1 month since last commit

8 users on Open Hub

Very Low Activity
5.0
 
I Use This