A set of scripts which generate synthetic credit card data which can be used for benchmarking data processing applications requiring high volumes of data. The scripts are designed to generate a single data set distributed across data files on multiple nodes of one or more racks, and allow for the data generation to be initiated from a single central node to run the data generation concurrently on multiple remote nodes of the cluster.
View README.html in the distribution for more details, including information on using the scripts.
Use Patent Claims
Include Install Instructions
These details are provided for information only. No information here is legal advice and should not be used as such.