Notice!
PyPM is being replaced with the ActiveState Platform, which enhances PyPM’s build and deploy capabilities.
Create your free Platform account
to download ActivePython or customize Python with the packages you require and get automatic updates.
Download
ActivePython
INSTALL>
pypm install mwtextextractor
How to install mwtextextractor
- Download and install ActivePython
- Open Command Prompt
- Type
pypm install mwtextextractor
| Python 2.7 | Python 3.2 | Python 3.3 |
---|
Windows (32-bit) | | | |
---|
Windows (64-bit) | | | |
---|
Mac OS X (10.5+) | | | |
---|
Linux (32-bit) | | | |
---|
Linux (64-bit) | | | |
---|
Lastest release
version 0.1 on May 23rd, 2013
mwtextextractor extracts simple body text from MediaWiki wikitext by stripping off templates, html tags, tables, headers, etc.
The extracted text can be used for word counting.
Example:
System Message: ERROR/3 (<string>, line 15)
Unknown directive type "code-block".
.. code-block:: python
from mwtextextractor import get_body_text
print get_body_text('Lorem {{ipsum}} dolor')