2
I Use This!
Inactive

Commits : Listings

Analyzed about 4 hours ago. based on code collected about 4 hours ago.
Aug 31, 2024 — Aug 31, 2025
Commit Message Contributor Files Modified Lines Added Lines Removed Code Location Date
add some exceptionally sleazy UDF functions for Pig More... almost 12 years ago
get the OutputCommitter working for the main otuput path in PSE3, the one that goes through the reducer. One Q is does commit happen at the end of the map, the end of the reduce or at both? More... almost 12 years ago
working on #25, i am checking in something that doesn't quite work but is promising. at this point, output is being sent to a _temporary directory, but it never gets copied to the permanent location, which probably is because we're never committing the output, something I'm afraid would happen. No fear, let's understand this one More... almost 12 years ago
issue 25 -- as a first step, I've cut-and-pasted the old MultipleOutput class into my system and then I backported the pse3 code and tests that used the old MultipleOutput, got them running again in integration test More... almost 12 years ago
check in chopper library imports and a preliminary script to compute CDF of URI nodes in all positions More... almost 12 years ago
add debugging for a limited sample of bad triples so we can get some idea of what we are up against More... almost 12 years ago
generalize the freebase prefilter so it handles the new dumps More... almost 12 years ago
register specific version jar; fix bug where jar definition on command line not respected More... almost 12 years ago
add more powerful cluster configs to the system More... almost 12 years ago
better stub value for AWS creds More... about 12 years ago
at this point i am submitting jobs with ease to EMR. Lots of little changes support this including the addition of a mechanism for transmitting POSIX error codes through exceptions. More... about 12 years ago
get the wiring working for the AmazonEMRCluster by blocking out a conflicting dependency on httpconfig that we got from jena More... about 12 years ago
wire up support for finding jar files More... about 12 years ago
and here we catch up with the promise that you can override the spring stuff by writing your own .haruhi/applicationContext.xml file More... about 12 years ago
haruhi shell now has the modern syntax that lets you zip by with the defaults; -clusterId and -jarId are known to fail in the right way when when we specify an alternative but have not seen real use More... about 12 years ago
add MavenManagedJar class and default configuration for bakemono jar More... about 12 years ago
haruhi/src/main/resources/com/ontology2/haruhi/shell/applicationContext.xml More... about 12 years ago
refactor: the clusterId doesn't need to know its cluster id because it is configured by spring More... about 12 years ago
use Redirect.INHERIT instead of the stream copy so we get better interactive feel and less cruft More... about 12 years ago
add path search for hadoop binary More... about 12 years ago
do some fixup of error handling so at least the right thing happens when hadoop fails (just nuked my namenode) More... about 12 years ago
fixed up handling of incomplete inpuit data in the haruhi shell More... about 12 years ago
renamed JobApplication to JobApp because App is the right prefix in centipede, even though JobApplication sounded so much more fun. clean up junk comments. More... about 12 years ago
configure log4 and create shell hack so we can just type 'haruhi run ...' More... about 12 years ago
executable strategy for haruhi binary figured out; we package it as an execuatable jar More... about 12 years ago
delete obsolete documentation file More... about 12 years ago
haruhi shell lights up, but of course it doesn't do anything yet More... about 12 years ago
get rid of maven-generated stubs and check in eclipse files More... about 12 years ago
fix build issue (almost certainly this code will go away because I'm probably going to switch to cascading because Hadoop's model for composability is for the birds More... about 12 years ago
add haruhi project More... about 12 years ago