Overview

Request 686071 accepted

- update to 0.4.0:
* Bump model version and fix typo
* Add HashingAnnoy model
* Add hashing\_nn benchmark in doc string
* Add HashingApproximateNeighbors model
* Implement iterator interface for file-like objects
* Refactor TokenizerTests
* Provide a bit more info about timings of the training
* Remove support for bag-of-words\_lshf
* Don't store duplicate data in model
* Fix heat\_uuid regexp formatting
* Relax digits\_re again a bit
* Vectorizer optimisation: don't do word analysing
* debug\_lineprocess: Handle more than one input file
* debug\_lineprocess: Format output slightly nicer and remove duplicates
* Tighten heat\_uuid regexp
* Tighten length-based regexp matches properly
* debug\_lineprocess add some simple word / token statistics
* Blacklist .xml extension
* Use for loop instead of handcrafted while construct
* tests: use free tcp port for gearman server
* Add --model-type argument to top-level command
* tokenizer: remove sshd warnings
* Make debugging scripts callable again
* Reduce code duplication a bit
* Micro-optimize the tokenization
* ci: enable gate jobs
* Make systemd service file SCL independent
* Transition webui related files to the log-classify name
* Match uuid\_re before heat\_re

Request History
Dirk Mueller's avatar

dirkmueller created request

- update to 0.4.0:
* Bump model version and fix typo
* Add HashingAnnoy model
* Add hashing\_nn benchmark in doc string
* Add HashingApproximateNeighbors model
* Implement iterator interface for file-like objects
* Refactor TokenizerTests
* Provide a bit more info about timings of the training
* Remove support for bag-of-words\_lshf
* Don't store duplicate data in model
* Fix heat\_uuid regexp formatting
* Relax digits\_re again a bit
* Vectorizer optimisation: don't do word analysing
* debug\_lineprocess: Handle more than one input file
* debug\_lineprocess: Format output slightly nicer and remove duplicates
* Tighten heat\_uuid regexp
* Tighten length-based regexp matches properly
* debug\_lineprocess add some simple word / token statistics
* Blacklist .xml extension
* Use for loop instead of handcrafted while construct
* tests: use free tcp port for gearman server
* Add --model-type argument to top-level command
* tokenizer: remove sshd warnings
* Make debugging scripts callable again
* Reduce code duplication a bit
* Micro-optimize the tokenization
* ci: enable gate jobs
* Make systemd service file SCL independent
* Transition webui related files to the log-classify name
* Match uuid\_re before heat\_re


Saul Goodman's avatar

licensedigger accepted review

ok


Factory Auto's avatar

factory-auto added opensuse-review-team as a reviewer

Please review sources


Factory Auto's avatar

factory-auto accepted review

Check script succeeded


Jan Engelhardt's avatar

jengelh accepted review


Staging Bot's avatar

staging-bot added openSUSE:Factory:Staging:adi:37 as a reviewer

Being evaluated by staging project "openSUSE:Factory:Staging:adi:37"


Staging Bot's avatar

staging-bot accepted review

Picked openSUSE:Factory:Staging:adi:37


Staging Bot's avatar

staging-bot accepted review

ready to accept


Staging Bot's avatar

staging-bot approved review

ready to accept


Dominique Leuenberger's avatar

dimstar_suse accepted request

Accept to openSUSE:Factory

openSUSE Build Service is sponsored by