openhub.net
Black Duck Software, Inc.
Black Duck Open Hub
Follow @
OH
Sign In
Join Now
Projects
People
Organizations
Tools
Blog
BDSA
Projects
People
Projects
Organizations
Forums
C
crawler4j
Settings
|
Report Duplicate
4
I Use This!
×
Login Required
Log in to Open Hub
Remember Me
Inactive
Commits
: Listings
Analyzed
3 days
ago. based on code collected
3 days
ago.
May 30, 2023 — May 30, 2024
Showing page 15 of 18
Search / Filter on:
Commit Message
Contributor
Files Modified
Lines Added
Lines Removed
Code Location
Date
Issue 225: Meta refresh does not work correctly ?
Avi Hayun
More...
almost 10 years ago
Issue 297: Add tag name to WebUrl
Avi Hayun
More...
almost 10 years ago
Issue 295: Add meta tags into the parsed html object
Avi Hayun
More...
almost 10 years ago
Issues 293 & 294
Avi Hayun
More...
almost 10 years ago
Issue 205: Removed eclipse generated files from the repository & updated gitignore
Avi Hayun
More...
almost 10 years ago
Issue 133 & Issue 160
Avi Hayun
More...
almost 10 years ago
Fixed bad case of lowercasing the URL, which is wrong as URLs are case sensitive
Avi Hayun
More...
almost 10 years ago
Removed unneeded code
Avi Hayun
More...
almost 10 years ago
Issue 291: HtmlParseData should hold a unique list of URLs
Avi Hayun
More...
almost 10 years ago
Issue 290: We should support all redirect status codes
Avi Hayun
More...
almost 10 years ago
Issue 273: Tabbing looks messed up in several places
Avi Hayun
More...
almost 10 years ago
Issue 236: Please default includeHttpsPages to true
Avi Hayun
More...
almost 10 years ago
Issue 289: Parsing a binary content shouldn't throw a general parsing error
Avi Hayun
More...
almost 10 years ago
Issue 285: WebURL.java causes IndexOutOfBoundException Issue 206: StringIndexOutOfBoundsException in WebURL
Avi Hayun
More...
almost 10 years ago
Issue 288: Upgrade Unit Tests to v4
Avi Hayun
More...
almost 10 years ago
Issue 282: Add CHANGES.TXT with the changelog to the root
Avi Hayun
More...
almost 10 years ago
Issue 284: Cathing any exception and hidding the log.
Avi Hayun
More...
almost 10 years ago
Issue 251: Fix a typo
Avi Hayun
More...
almost 10 years ago
Issue 279: TikaException is thrown while crawling several PDFs in a row
Avi Hayun
More...
almost 10 years ago
Issue 278: Add hooks in the webcrawler for better error handling Issue 239: Add an option to tweak the URL before processing the page
Avi Hayun
More...
almost 10 years ago
Issue 276: Don't let a crawled URL to be dropped without proper logging.
Avi Hayun
More...
almost 10 years ago
Fixed Issue #139
Avi Hayun
More...
almost 10 years ago
Fixed some github-googlecode changes.
Avi Hayun
More...
almost 10 years ago
Fixes #28 - Added binary content parsing
Avi Hayun
More...
almost 10 years ago
Memory leakage in crawler4j caused by database environment #15
Avi Hayun
More...
almost 10 years ago
Merge branch 'master' of https://github.com/Chaiavi/Crawler4j
Avi Hayun
More...
almost 10 years ago
Upgrade all logging statements to use {} of slf4j #25
Avi Hayun
More...
almost 10 years ago
Upgraded the lists
Avi Hayun
More...
almost 10 years ago
Added "How to use" & "Code examples" sections
Avi Hayun
More...
almost 10 years ago
Added initial guidelines to the work of this prject
Avi Hayun
More...
almost 10 years ago
←
1
2
…
10
11
12
13
14
15
16
17
18
→
This site uses cookies to give you the best possible experience. By using the site, you consent to our use of cookies. For more information, please see our
Privacy Policy
Agree