Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

OpenBDRE

Compare

  Analyzed 1 day ago

Bigdata Ready Enterprise Open Source Software developed and contributed by Wipro

0 lines of code

0 current contributors

over 8 years since last commit

1 users on Open Hub

Activity Not Available
0.0
 
I Use This
Mostly written in language not available
Licenses: apache_2

DevOps Python Tools

Compare

  Analyzed about 11 hours ago

DevOps CLI Tools for Hadoop, Spark, HBase, Log Anonymizer, Ambari Blueprints, AWS CloudFormation, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Elasticsearch, Solr, Travis CI, Pig, IPython - Python / Jython Tools

18K lines of code

2 current contributors

3 months since last commit

1 users on Open Hub

Very Low Activity
0.0
 
I Use This
Licenses: No declared licenses

DevOps Perl Tools

Compare

  Analyzed about 6 hours ago

DevOps CLI Tools for Hadoop, Hive, HDFS file/snapshot age out, Solr / SolrCloud CLI, Ambari FreeIPA Kerberos, Config / Log Anonymizer, URL watcher for load balanced web farms, SQL ReCaser (Hive, Impala, Cassandra CQL, Couchbase N1QL, MySQL, PostgreSQL, Apache Drill, Microsoft SQL Server, Oracle, Pig ... [More] Latin, Neo4j, InfluxDB, Dockerfiles), Nginx stats watcher, Datameer, Linux tools... [Less]

5.81K lines of code

1 current contributors

18 days since last commit

1 users on Open Hub

Very Low Activity
0.0
 
I Use This
Licenses: No declared licenses

HariSekhon's Dockerfiles

Compare

  Analyzed about 7 hours ago

DockerHub public images - Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr / SolrCloud, Presto, Apache Drill, Nifi, Spark, Superset, H2O, Mesos, Serf, Consul, Riak, Alluxio, Jython, Advanced Nagios Plugins Collection / PyTools / Tools repos on CentOS / Ubuntu / Debian / Alpine

7.75K lines of code

2 current contributors

3 months since last commit

1 users on Open Hub

Very Low Activity
0.0
 
I Use This

haproxy-configs

Compare

  Analyzed about 18 hours ago

HAProxy Configs for Hadoop, Big Data, NoSQL, Docker, Elasticsearch, SolrCloud, HBase, Hortonworks, Cloudera, MapR, MySQL, PostgreSQL, Apache Drill, Hive, Presto, Impala, ZooKeeper, Graphite, InfluxDB, OpenTSDB, Prometheus, Kibana, SSH, RabbitMQ, Redis, Riak, Rancher etc

299 lines of code

1 current contributors

3 months since last commit

1 users on Open Hub

Very Low Activity
0.0
 
I Use This

Talend Open Studio for Big Data

Compare

  Analyzed 3 months ago

Studio open source projects related to Big Data Open Studio for Big Data Start working with Hadoop and NoSQL databases today using simple, graphical tools and wizards to generate native code that leverages the full power of Hadoop

126K lines of code

43 current contributors

3 months since last commit

0 users on Open Hub

Activity Not Available
0.0
 
I Use This

Elephant Bird

Compare

  Analyzed 1 day ago

Twitter's library of LZO and/or Protocol Buffer-related Hadoop InputFormats, OutputFormats, Writables, Pig LoadFuncs, HBase miscellanea, etc. The majority of these are in production at Twitter running over data every day.

26K lines of code

0 current contributors

about 7 years since last commit

0 users on Open Hub

Inactive
0.0
 
I Use This

Sizzle (Sawzall)

Compare

  Analyzed 1 day ago

A compiler and runtime for Google's Sawzall language, optimized for Hadoop

17K lines of code

0 current contributors

about 11 years since last commit

0 users on Open Hub

Inactive
0.0
 
I Use This
Licenses: No declared licenses
Tags hadoop

mrjob

Compare

  Analyzed 1 day ago

Run MapReduce jobs on Hadoop or Amazon Web Services

48.3K lines of code

8 current contributors

over 3 years since last commit

0 users on Open Hub

Inactive
0.0
 
I Use This

dispy

Compare

  Analyzed about 13 hours ago

dispy is a Python framework for parallel execution of computations by distributing them across multiple processors in a single machine (SMP), among many machines in a cluster or grid. dispy distributes computations (Python functions or standalone programs) and their dependencies (files, Python ... [More] functions, classes, modules) automatically and schedules jobs for parallel execution. dispy supports client-side and server-side fault recovery, SSL for security, and more. dispy is implemented with asyncoro, an independent framework for developing concurrent programs with asynchronous (non-blocking) sockets and coroutines (without threads) using polling mechanisms epoll, kqueue, devpoll and poll, and Windows I/O Completion Ports (IOCP), for high performance and scalability. [Less]

24.3K lines of code

3 current contributors

6 months since last commit

0 users on Open Hub

Very Low Activity
0.0
 
I Use This