Notice!
PyPM is being replaced with the ActiveState Platform, which enhances PyPM’s build and deploy capabilities.
Create your free Platform account
to download ActivePython or customize Python with the packages you require and get automatic updates.
Download
ActivePython
INSTALL>
pypm install nsi.metadataextractor
How to install nsi.metadataextractor
- Download and install ActivePython
- Open Command Prompt
- Type
pypm install nsi.metadataextractor
| Python 2.7 | Python 3.2 | Python 3.3 |
---|
Windows (32-bit) | | | |
---|
Windows (64-bit) | | | |
---|
Mac OS X (10.5+) | | | |
---|
Linux (32-bit) |
1.0.2
| | |
---|
Linux (64-bit) |
1.0.2
| | |
---|
Lastest release
version 1.2 on May 9th, 2013
Introduction
System Message: WARNING/2 (<string>, line 2)
Title underline too short.
Introduction
=====
nsi.metadataextractor is a metadata extractor for academic (Portuguese-BR) documents like:
```
Course Conclusion (ABNT format)
Event Article
Periodic Article
System Message: WARNING/2 (<string>, line 6); backlink
Inline literal start-string without end-string.
Supported extention: .pdf
```
System Message: WARNING/2 (<string>, line 11); backlink
Inline literal start-string without end-string.
System Message: WARNING/2 (<string>, line 11); backlink
Inline interpreted text or phrase reference start-string without end-string.
Setup
pip install nsi.metadataextractor
Example
System Message: WARNING/2 (<string>, line 21)
Title underline too short.
Example
=====
Python
from nsi.metadataextractor.extractors import tcc, event, periodic
path = "/home/stuff/tccdocument.pdf"
tccextractor = tcc.TccExtractor(path)
eventextractor = event.EventExtractor(path)
periodicextractor = periodic.PeriodicExtractor(path)
tccextractor.all_metadata()
eventextractor.all_metadata()
periodicextractor.all_metadata()
Bash
>>> extract_metadata /home/stuff/tccdocument.pdf -t tcc
>>> extract_metadata /home/stuff/eventdocument.pdf -t event
>>> extract_metadata /home/stuff/periodicdocument.pdf -t periodic