-
User
-
Authentication
-
Registration
-
New password
-
Admin
-
Confirm registrations
-
Set a user's document conversion rate
-
Delete user
-
Edit User
-
System log viewer
-
Manager
-
Receives conversion jobs
-
Job list
-
Handles job distribution to queues
-
PathFinder
-
Determines which conversion path to take
-
DocxConversion
-
Converts documents to DocX format
-
WpPdfConversion
-
Converts word processing documents directly to PDF
-
NlmxmlConversion
-
Uses meTypeset to perform part of the NLM XML conversion
-
Cermine
-
Uses CERMINE to perform part of the NLM XML conversion
-
ReferenceConversion
-
Parses references from DocX document into a seperate XML file
-
BibtexConversion
-
Converts references from the previous step into Bibtex
-
BibtexreferenceConversion
-
Converts Bibtex references into NLMXML and merges the converted references into the NLMXML document
-
HtmlConversion
-
Converts the NLMXML document into HTML
-
CitationStyleConversion
-
Formats the citations in the HTML document according to the citationstyle requested by the user
-
PdfConversion
-
Converts the HTML document into PDF
-
XmpConversion
-
Adds an XMP sidecar with metadata from the NLMXML to the PDF document
-
EpubConversion
-
Converts the NLMXML document into Epub format
-
MergeXMLOutputs
-
Combines the CERMINE and meTypeset outputs (as modified by BibTeX processing, if appropriate) into a better NLM XML document
-
ZipConversion
-
Zips all documents
-
API
-
Simple REST API to submit and retrieve jobs and to provide functionality for the frontends AJAX callbacks.
-
Apache mod_headers needs to be installed and enabled
-
Java VM needs to be installed
-
citation parsing has a variety of requirements, as listed in the ParsCit documentation
-
xml2bib needs to be installed
-
Pandoc & libghc-citeproc-hs-data needs to be installed
-
The XMP conversion needs Exiftool to be installed
-
The docX conversion needs LibreOffice with unoconv installed. The server is tested to work with LibreOffice 4.2.4.
wget http://download.documentfoundation.org/libreoffice/stable/4.2.4/deb/x86_64/LibreOffice_4.2.4_Linux_x86-64_deb.tar.gz tar -xzf LibreOffice_4.2.4_Linux_x86-64_deb.tar.gz rm -f LibreOffice_4.2.4_Linux_x86-64_deb.tar.gz sudo dpkg -i LibreOffice_4.2.4.2_Linux_x86-64_deb/DEBS/*.deb rm -rf LibreOffice_4.2.4.2_Linux_x86-64_deb
-
The PHP5 XSL module must be installed.
-
Copy the source
# git clone https://github.com/pkp/xmlps.git # cd xmlps
-
Install the dependencies
# php composer.phar self-update # php composer.phar install
-
Ensure there is a MySQL database called
xmlps
, and a MySQL user with access to it. -
Configure the environment:
-
Copy the
local.php
template, then edit it:# cp config/autoload/local.php.dist config/autoload/local.php
-
Provide the MySQL user and password.
-
If you want debug messages to be displayed to the user (i.e., if developing locally), set these options:
'view_manager' => array( 'display_not_found_reason' => true, 'display_exceptions' => true, ),
-
If using a version of
unoconv
installed via package manager (such as apt), you may need to override the command, e.g.:'conversion' => array( 'docx' => array( 'unoconv' => array( 'command' => 'unoconv', ), ), ),
-
-
Make sure the cache directories are writable by your Web server process, and empty the cache directory, e.g:
# chmod -R go+w var # rm var/cache/zfcache-ea/*
-
Initialize the database (you may need to manually create an 'xmlps' database first)
# vendor/doctrine/doctrine-module/bin/doctrine-module orm:schema-tool:update --force
-
Run the shell script that starts the conversion queues
# ./start_queues.sh
After a successful installation the unit tests should complete without errors:
# ./unittest.sh
You will need to re-empty the cache directories afterward, if your Web server runs as a different user than you.
-
SASS compilation, CSS and Javascript compression & unification is done using Guard (http://guardgem.org)
-
After making changes to Javascript (javascript/) or style files (style/scss/) recompile/recompress the style and Javascript files by running
# guard
There is a simple REST API available to submit, view and retrieve jobs from/to the server.
Submit
Submit a job to the server. The citationStyleHash
is an internal identifier
for the requested citaton style. A list of hashes can be retrieved through the
citationStyleList API. The API will return the job id which can be used to
retrieve the completed job later or to query the server for the job status.
- URL:
api/job/submit
- Request type:
POST
- Parameters:
email
password
fileName
fileContent
citationStyleHash
E.g.:
http://example.com/api/job/submit
POST parameters:
'email' => 'user@example.com'
'password' => 'passowrd'
'fileName' => 'document.docx'
'citationStyleHash' => 'c6de5efe3294b26391ea343053c19a84',
'fileContent' => '...'
Example response:
{"status":"success","id":123}
Status
Returns the current status for a job. Only completed jobs can be retrieved from the server. A full list of statuses can be found here.
- URL:
api/job/status
- Request type:
GET
- Parameters:
email
password
id
E.g.:
http://example.com/api/job/status?email=user@example.com&password=password&id=123
Example response:
{"status":"success","jobStatus":0,"jobStatusDescription":"Pending"}
Citation Style List
Returns a list of available citation styles and their internal ids. We support all citation styles from citationstyles.org.
- URL:
api/job/citationStyleList
- Request type:
GET
E.g.:
http://example.com/api/job/citationStyleList
Example response:
{"status":"success","citationStyles":{"c6de5efe3294b26391ea343053c19a84":"ACM SIG Proceedings (\u0022et al.\u0022 for 15+ authors)"...
Retrieve
Retrieve a converted document. The jobConversionStage parameter specifys which type of conversion you want to get retrned. A full list of conversion stages can be found here.
- URL:
api/job/retrieve
- Request type:
GET
- Parameters:
email
password
id
conversionStage
E.g.:
http://example.com/api/job/retrieve?email=user@example.com&password=password&id=123&conversionStage=10
Example response:
The requested document or a JSON string with an error message.