openhub.net
Black Duck Software, Inc.
Black Duck Open Hub
Follow @
OH
Sign In
Join Now
Projects
People
Organizations
Tools
Blog
BDSA
Projects
People
Projects
Organizations
Forums
P
pauldix's basset
Settings
|
Report Duplicate
0
I Use This!
×
Login Required
Log in to Open Hub
Remember Me
Inactive
Commits
: Listings
Analyzed
about 8 hours
ago. based on code collected
about 12 hours
ago.
Apr 19, 2023 — Apr 19, 2024
Showing page 1 of 1
Search / Filter on:
Commit Message
Contributor
Files Modified
Lines Added
Lines Removed
Code Location
Date
fixed tf and tf-idf to be correct (was normalizing the tf when I shouldn't have been. also added boolean and sublinear-tf-idf scoring
Paul Dix
More...
over 14 years ago
added ability to return sparse feature vectors with tf-idf scores
Paul Dix
More...
over 14 years ago
added ability to return tf scores instead of straight counts
Paul Dix
More...
over 14 years ago
changed the global_frequency to be document_frequency to prep for retunring features as tf-idf scores
Paul Dix
More...
over 14 years ago
added method to purge features occuring less than a specified number of times
Paul Dix
More...
over 14 years ago
changed the sparse vector representation to store arrays. the user can convert to whatever format the want
Paul Dix
More...
over 14 years ago
added methods to pull out a raw hash map for serializing the feature map
Paul Dix
More...
over 14 years ago
added functionality to serialize and marshal from json. removed nested feature class and opted for an array (it's hidden behind the feature_collection class anyway)
Paul Dix
More...
over 14 years ago
bumped version for build
Paul Dix
More...
over 14 years ago
made the sparse feature vector calculation not suck so hard
Paul Dix
More...
over 14 years ago
made the output of normalized vectors file include feature and row count as the first line
Paul Dix
More...
over 14 years ago
added row and feature counts to feature_collection
Paul Dix
More...
over 14 years ago
fixed load paths and updated the gemspec with the proper version
Paul Dix
More...
over 14 years ago
added some ugly untested code to do entropy normalization on a set of vectors
Paul Dix
More...
over 14 years ago
made spec a little cleaner
Paul Dix
More...
over 14 years ago
removed normalization from feature collection since it doesn't make sense there
Paul Dix
More...
over 14 years ago
added the feature_collection. wired up basic functionality
Paul Dix
More...
over 14 years ago
changed version to 2.0.0. Makes more sense since this isn't a point release of the previous version
Paul Dix
More...
over 14 years ago
wired up the text parser class
Paul Dix
More...
over 14 years ago
added the skeleton
Paul Dix
More...
over 14 years ago
starting fresh. so deleting all this crap
Paul Dix
More...
over 14 years ago
udpated a comment on the example
Paul Dix
More...
about 15 years ago
wrote a quick example to show usage
Paul Dix
More...
about 15 years ago
* adding in again
Paul Dix
More...
over 15 years ago
removing to readd without trunk dir
Paul Dix
More...
over 15 years ago
first commit
Paul Dix
More...
over 15 years ago
This site uses cookies to give you the best possible experience. By using the site, you consent to our use of cookies. For more information, please see our
Privacy Policy
Agree