fixed illegal codepoint cleanup |
|
More...
|
almost 10 years ago
|
fixed the most annoying bug in TTrNormalizer |
|
More...
|
almost 10 years ago
|
fixed two stupid loop counter bugs in TrUtf8Filter |
|
More...
|
almost 10 years ago
|
minimal improvement in TrTextAssessmentMulti |
|
More...
|
almost 10 years ago
|
sync |
|
More...
|
almost 10 years ago
|
host and TLD extraction plus some final touches |
|
More...
|
almost 10 years ago
|
sync |
|
More...
|
almost 10 years ago
|
finished TODO for behindthecow |
|
More...
|
almost 10 years ago
|
almost done wit Unicode/UTF-8 cleanups |
|
More...
|
almost 10 years ago
|
finished normalizer/UTF-8 cleaner |
|
More...
|
almost 10 years ago
|
bug fixes |
|
More...
|
almost 10 years ago
|
adding ISO boilerplate MLP |
|
More...
|
almost 10 years ago
|
NFC normalizer working |
|
More...
|
almost 10 years ago
|
adding minimal ICU normalization wrapper |
|
More...
|
almost 10 years ago
|
fixed a major bug in arc position recording (int overflow) |
|
More...
|
almost 10 years ago
|
cleanup |
|
More...
|
almost 10 years ago
|
refactoring TTrArcReader done; starting TTrWarcReader |
|
More...
|
almost 10 years ago
|
more towards the end of refactoring TTrArcReader for better TTrReader reusability |
|
More...
|
almost 10 years ago
|
in the middle of refactoring TTrArcReader for better TTrReader reusability |
|
More...
|
almost 10 years ago
|
modified reader architecture for alternative reader classes (WARC) |
|
More...
|
almost 10 years ago
|
polishing text assessment |
|
More...
|
almost 10 years ago
|
TTrTextAssessmentMulti complete, but untested |
|
More...
|
almost 10 years ago
|
begin multi-language changes for CommonCrawl data |
|
More...
|
almost 10 years ago
|
sync |
|
More...
|
almost 11 years ago
|
fixed Makefile |
|
More...
|
almost 11 years ago
|
fixed Makefile |
|
More...
|
almost 11 years ago
|
fixed Makefile |
|
More...
|
almost 11 years ago
|
completely changed cowinterleave |
|
More...
|
almost 11 years ago
|
rule 1: never commit without making first |
|
More...
|
almost 11 years ago
|
minor fixes to cowinterleave |
|
More...
|
almost 11 years ago
|