openhub.net
Black Duck Software, Inc.
Black Duck Open Hub
Follow @
OH
Sign In
Join Now
Projects
People
Organizations
Tools
Blog
BDSA
Projects
People
Projects
Organizations
Forums
A
archive-commons
Settings
|
Report Duplicate
0
I Use This!
×
Login Required
Log in to Open Hub
Remember Me
Inactive
Commits
: Listings
Analyzed
about 19 hours
ago. based on code collected
1 day
ago.
Apr 18, 2023 — Apr 18, 2024
Showing page 1 of 111
Search / Filter on:
Commit Message
Contributor
Files Modified
Lines Added
Lines Removed
Code Location
Date
last commit ever to "archive-commons"!
Noah Levitt
More...
about 11 years ago
ia-tools: add AccessControlAllowCapture(url, timestamp) udf which can be used to query an access oracle to see if a given url should be included/excluded Adding wayback-access-control/access-control lib as a dependency to ia-tools (seems to be best/simplest way to do this)
Ilya Kreymer
More...
about 11 years ago
archive-commons: extend TimestampDedupIterator with TimestampCustomDedupIterator which also checks an additional field (default status field) when doing dedup (will probably be refactored more) ia-tools: fix bug in HttpZipNumDerefLineRecordReader, init cluster before use
Ilya Kreymer
More...
about 11 years ago
getURLString(boolean,boolean,boolean) - omit opening paren when includeScheme is false(); new convenience method getSURTString(boolean includeScheme)
Noah Levitt
More...
about 11 years ago
remove archive-surt dependency
Noah Levitt
More...
about 11 years ago
update guava library to latest 14.0.1
Noah Levitt
More...
about 11 years ago
remove unneeded(?) obsolete(?) archive-surt
Noah Levitt
More...
about 11 years ago
Rename GoogleURLCanonicalizer* to BasicURLCanonicalizer* and DefaultIA*Canonicalizer* to AggressiveIA*Canonicalizer* to better reflect their roles, deprecating the old class names. Elaborate on javadoc for BasicURLCanonicalizer. Remove scheme-lowercasing from BasicURLCanonicalizer. Add rule to IAURLCanonicalizer to support scheme-lowercasing, and add the rule to AggressiveIACanonicalizerRules. Add new OrdinaryIAURLCanonicalizer for non-aggressive canonicalization and a few tests in OrdinaryIAURLCanonicalizerTest.
Noah Levitt
More...
about 11 years ago
add other unicode line terminators to STRAY_SPACING regex
Noah Levitt
More...
about 11 years ago
Merge github.com:internetarchive/archive-commons
Ilya Kreymer
More...
about 11 years ago
archive-commons: Fix bug in SummaryBlockIterator that would reinit block and over again without use! Not actually leaking, but inefficient nevertheless!
Ilya Kreymer
More...
about 11 years ago
treat pct-encoded strings as encoded utf-8 bytes; encode unicode as pct-encoded utf-8; when decoding pct-encoded, if not valid utf-8, leave undecoded
Noah Levitt
More...
about 11 years ago
avoid NPE when url scheme is null, such as with "opaque" dns urls
Noah Levitt
More...
about 11 years ago
handle urls with uppercase letters in scheme
Noah Levitt
More...
about 11 years ago
archive-commons: Abstracted out HTTPSeekableLineReaders into different possible implementations, currently supporting Apache 3.1 and Java URLConnection.. possible to add (HttpClient 4.x) as well. HTTPSeekableLineReader.getHttpFactory() returns the actual instance, default is HttpClient 3.1 as before.
Ilya Kreymer
More...
about 11 years ago
HTTPSeekableLineReader: log connection pool use if FINER logging setting set, also set timeout for manager getting new connections DateFilter: Support an empty filter (accept all) CDXMapper: Fix url->surt cdx conversion for cdxs that have hostname as 3rd field, if so, treat http:// + cdx key as original url
Ilya Kreymer
More...
about 11 years ago
ack! accidentally not setting maxTotalConnections on apache!! Huge fix
Ilya Kreymer
More...
about 11 years ago
slr: more fixes to slr classes, null out streams on close
Ilya Kreymer
More...
about 11 years ago
zip blockloading: various fixes, support for turning off stale checking, nio stream improvements
Ilya Kreymer
More...
about 11 years ago
turn off mmap for NIO for now, minor fixes to line readers, null out raf
Ilya Kreymer
More...
about 11 years ago
archive-commons: refactored some archive-commons, blockloader has ThreadLocal storage of all readers, closed at end by wayback
Ilya Kreymer
More...
about 11 years ago
fix typo
Ilya Kreymer
More...
about 11 years ago
additional exception capturing leak detection, close SLR when ioexception occurs, then rethrow
Ilya Kreymer
More...
about 11 years ago
more exception handling in HTTPSeekableLineReader
Ilya Kreymer
More...
about 11 years ago
add get header function
Ilya Kreymer
More...
about 11 years ago
bufferFully support in SeekableLineReader
Ilya Kreymer
More...
about 11 years ago
add buffering support to all SeekableLineReaders
Ilya Kreymer
More...
about 11 years ago
fixes: add catch around multi-iterators so that errors in one don't necessarily disable the whole iterator
Ilya Kreymer
More...
about 11 years ago
slr: add custom input stream to httpseekablelinereader to abort on close if not fully read
Ilya Kreymer
More...
about 11 years ago
archive-commons: Some refactoring of the SeekableLineReader classes (to be renamed) to support generic reading from inputstream, moved common classes to base When using line buffering iterator, buffer on load
Ilya Kreymer
More...
about 11 years ago
←
1
2
3
4
5
6
7
8
9
…
110
111
→
This site uses cookies to give you the best possible experience. By using the site, you consent to our use of cookies. For more information, please see our
Privacy Policy
Agree