4
I Use This!
Inactive

Commits : Listings

Analyzed about 22 hours ago. based on code collected 1 day ago.
May 17, 2023 — May 17, 2024
Commit Message Contributor Files Modified Lines Added Lines Removed Code Location Date
Fix bug where metaRefresh was used instead of metaLocation. Copy/paste problem, and a bug where meta http-equiv=refresh was incorrectly parsed: when no URL was specified a NULL URL would be added. More... about 9 years ago
Merge pull request #27 from lutaaya/readme-getOutgoingUrls More... over 9 years ago
Update README to use Set for getOutgoingURLs() More... over 9 years ago
Minor refactoring and improving code styles More... over 9 years ago
Update WebCrawler.java More... over 9 years ago
More comments for changes in the shouldVisit method More... over 9 years ago
Remove emails from javadocs More... over 9 years ago
Update dependencies More... over 9 years ago
[style] Code improvements More... over 9 years ago
[style] re-format code More... over 9 years ago
Merge pull request #23 from smtriplett/master More... over 9 years ago
Update BasicCrawler.java More... over 9 years ago
Add maven assembly plugin More... over 9 years ago
Update README.md More... over 9 years ago
[maven-release-plugin] prepare for next development iteration More... over 9 years ago
[maven-release-plugin] prepare release crawler4j-4.1 More... over 9 years ago
Update pom.xml for github More... over 9 years ago
[maven-release-plugin] prepare release crawler4j-4.1 More... over 9 years ago
Update CHANGES.txt More... over 9 years ago
Create README.md More... over 9 years ago
Issue 281: Upgrade try-catch to java7 "try with resources" More... over 9 years ago
Issue 335: robots.txt isn't crawled More... over 9 years ago
Issue 334: Crawling over disallowed paths from robots.txt More... over 9 years ago
Issue 214: Always log if exception happens + Code Refactoring More... over 9 years ago
Issue329: Cleaning leftovers More... over 9 years ago
Issie329: FileNotFoundException: .m2\repository\edu\uci\ics\crawler4j\4.0\crawler4j-4.0.jar!\tld-names.zip More... over 9 years ago
Fixed Issues: 326, 325, 238 More... over 9 years ago
moved logback to test/resources -> a library should not impose a logging framework It should be a test dependency in this case. More... over 9 years ago
Issue 324: NullPointerException when crawling links with no HREF More... over 9 years ago
Issue 149: Proper compression support in the PageFetcher More... over 9 years ago