4
I Use This!
Inactive

Commits : Listings

Analyzed 3 days ago. based on code collected 3 days ago.
May 30, 2023 — May 30, 2024
Commit Message Contributor Files Modified Lines Added Lines Removed Code Location Date
Issue 225: Meta refresh does not work correctly ? More... almost 10 years ago
Issue 297: Add tag name to WebUrl More... almost 10 years ago
Issue 295: Add meta tags into the parsed html object More... almost 10 years ago
Issues 293 & 294 More... almost 10 years ago
Issue 205: Removed eclipse generated files from the repository & updated gitignore More... almost 10 years ago
Issue 133 & Issue 160 More... almost 10 years ago
Fixed bad case of lowercasing the URL, which is wrong as URLs are case sensitive More... almost 10 years ago
Removed unneeded code More... almost 10 years ago
Issue 291: HtmlParseData should hold a unique list of URLs More... almost 10 years ago
Issue 290: We should support all redirect status codes More... almost 10 years ago
Issue 273: Tabbing looks messed up in several places More... almost 10 years ago
Issue 236: Please default includeHttpsPages to true More... almost 10 years ago
Issue 289: Parsing a binary content shouldn't throw a general parsing error More... almost 10 years ago
Issue 285: WebURL.java causes IndexOutOfBoundException Issue 206: StringIndexOutOfBoundsException in WebURL More... almost 10 years ago
Issue 288: Upgrade Unit Tests to v4 More... almost 10 years ago
Issue 282: Add CHANGES.TXT with the changelog to the root More... almost 10 years ago
Issue 284: Cathing any exception and hidding the log. More... almost 10 years ago
Issue 251: Fix a typo More... almost 10 years ago
Issue 279: TikaException is thrown while crawling several PDFs in a row More... almost 10 years ago
Issue 278: Add hooks in the webcrawler for better error handling Issue 239: Add an option to tweak the URL before processing the page More... almost 10 years ago
Issue 276: Don't let a crawled URL to be dropped without proper logging. More... almost 10 years ago
Fixed Issue #139 More... almost 10 years ago
Fixed some github-googlecode changes. More... almost 10 years ago
Fixes #28 - Added binary content parsing More... almost 10 years ago
Memory leakage in crawler4j caused by database environment #15 More... almost 10 years ago
Merge branch 'master' of https://github.com/Chaiavi/Crawler4j More... almost 10 years ago
Upgrade all logging statements to use {} of slf4j #25 More... almost 10 years ago
Upgraded the lists More... almost 10 years ago
Added "How to use" & "Code examples" sections More... almost 10 years ago
Added initial guidelines to the work of this prject More... almost 10 years ago