Module for splitting text into sentences.
The 'Lingua::EN::Sentence' module contains the function get_sentences,
which splits text into its constituent sentences, based on a regular
expression and a list of abbreviations (built in and given).
Certain well know exceptions, such as abreviations, may cause incorrect
segmentations. But some of them are already integrated into this code and
are being taken care of. Still, if you see that there are words causing the
get_sentences() to fail, you can add those to the module, so it notices
them.
- Developed at devel:languages:perl
- Sources inherited from project openSUSE:Factory
-
3
derived packages
- Download package
-
Checkout Package
osc -A https://api.opensuse.org checkout openSUSE:Factory:PowerPC/perl-Lingua-EN-Sentence && cd $_
- Create Badge
Refresh
Refresh
Source Files
Filename | Size | Changed |
---|---|---|
Lingua-EN-Sentence-0.27.tar.gz | 0000008942 8.73 KB | |
perl-Lingua-EN-Sentence.changes | 0000001012 1012 Bytes | |
perl-Lingua-EN-Sentence.spec | 0000002410 2.35 KB |
Revision 12 (latest revision is 17)
Dominique Leuenberger (dimstar_suse)
accepted
request 296654
from
Stephan Kulow (coolo)
(revision 12)
1
Comments 0