ghtorrent: Mirror and index data from the Github API
A library and a collection of scripts used to retrieve data from the Github API and extract metadata in an SQL database, in a modular and scalable manner. The scripts are distributed as a Gem (ghtorrent), but they can also be run by checking out this repository.
GHTorrent can be used for a variety of purposes, such as:
* Mirror the Github API event stream and follow links from events to actual data to gradually build a Github index
* Create a queriable metadata database for a specific repository
* Construct a data source for extracting process analytics (see for example those) for one or more repositories
Use Patent Claims
Include Install Instructions
These details are provided for information only. No information here is legal advice and should not be used as such.