Skip to main content
Warning: You are using the test version of PyPI. This is a pre-production deployment of Warehouse. Changes made here affect the production instance of TestPyPI (testpypi.python.org).
Help us improve Python packaging - Donate today!

Pure-Python implementations of the Snowball stemmers

Project Description

The traditional way of using the Snowball stemmers in Python is via the pystemmer package, which provides a Python wrapper around the Snowball C library. However, Python C extensions are problematic in some environments. Therefore, this package provides pure-Python implementations of the Snowball stemming algorithms.

The implementations of the stemming algorithms is translated from the Snowball language to Python via sbl2py.

Usage

Usually, you’ll prefer to use the pystemmer module whenever that is possible, because it’s much faster than purestemmer:

try:
    import Stemmer
except ImportError:
    # pystemmer is not available, use purestemmer instead
    import purestemmer as Stemmer

Since purestemmer has the same public API and provides the same algorithms as pystemmer, there should be no need to change any code when switching between pystemmer and purestemmer like this.

Please see the pystemmer documentation for details on how to use the stemming algorithms.

Differences between purestemmer and pystemmer

  • purestemmer has only been tested on Python 2.7
  • purestemmer.Stemmer instances are thread-safe
  • purestemmer is on average about 100x slower than pystemmer

License

purestemmer itself is covered by the MIT License. The underlying Snowball algorithms are covered by the BSD-3 License. Please see the LICENSE file for details.

Release History

Release History

This version
History Node

0.2.0

History Node

0.1.0

Download Files

Download Files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

File Name & Checksum SHA256 Checksum Help Version File Type Upload Date
purestemmer-0.2.0.tar.gz (77.9 kB) Copy SHA256 Checksum SHA256 Source Aug 5, 2014

Supported By

WebFaction WebFaction Technical Writing Elastic Elastic Search Pingdom Pingdom Monitoring Dyn Dyn DNS Sentry Sentry Error Logging CloudAMQP CloudAMQP RabbitMQ Heroku Heroku PaaS Kabu Creative Kabu Creative UX & Design Fastly Fastly CDN DigiCert DigiCert EV Certificate Rackspace Rackspace Cloud Servers DreamHost DreamHost Log Hosting