2
I Use This!
Very Low Activity
Analyzed 1 day ago. based on code collected 1 day ago.

Project Summary

Duke is a fast record linkage and deduplication engine written in Java. It provides both an API and a command-line interface, and supports incremental processing. There is also a genetic algorithm for automatically tuning configurations. Duke is based on Lucene.

Tags

dedup deduplication java recordlinkage recordlinking

Badges

In a Nutshell, Duke (Dupe Killer)...

Apache License 2.0
Permitted

Commercial Use

Modify

Distribute

Place Warranty

Sub-License

Private Use

Use Patent Claims

Forbidden

Hold Liable

Use Trademarks

Required

Include Copyright

State Changes

Include License

Include Notice

These details are provided for information only. No information here is legal advice and should not be used as such.

This Project has No vulnerabilities Reported Against it

Did You Know...

  • ...
    there are over 3,000 projects on the Open Hub with security vulnerabilities reported against them
  • ...
    you can embed statistics from Open Hub on your site
  • ...
    use of OSS increased in 65% of companies in 2016
  • ...
    learn about Open Hub updates and features on the Open Hub blog

Languages

Java
94%
XML
6%
HTML
<1%

30 Day Summary

Aug 12 2025 — Sep 11 2025

12 Month Summary

Sep 11 2024 — Sep 11 2025
  • 0 Commits
    Down -2 (100%) from previous 12 months
  • 0 Contributors
    Down -2 (100%) from previous 12 months