Get millions of archive pages accessible online at a reasonable cost thanks to the AureXus solution.
Your organization has a wealth of important information that is
difficult to access. AureXus has created a complete solution allowing
the transformation of your valuable documents into a rich content
accessible online.
1- SCAN
AureXus scans the documents at the customer’s location or at its
processing centers. All type of documents, paper, books, microfilms,
index cards etc…can be processed.
2- OCR OR MANUAL ENTRY
The content is then capture by OCR (Optical character recognition) or manual data entry.
3- FORMATING - Original-like
The information on the pages or index cards is then structured for
encoding: record number, titles, dates, physical description, etc
Quality control
Strict controls are then carried out: checking against the original
document, line by line, character by character. Specialized software
and databases are used to eradicate all errors.
4- TAGGING ENCODING EAD/XML
Our powerful encoding engine is then applied. The automated process of
the specialized software guarantees a high quality of the encoded
end-product. The tagging and encoding that will be done according to
the latest version of the EAD standard and by taking into account the
particular customer’s specifications for each book or finding aid
if necessary.
The encoding is done in two phases:
Phase one consists in structuring the
document (standard EAD), and in checking its contents by elements of
description (title, dates, physical description, etc).
The hierarchy of the levels of description is established. This
hierarchy is open and can comprise an unlimited number of sub levels.
Phase two is the encoding. It is
performed entirely by computer, using the proprietary system that
exports the content of the data base in an XML/EAD file. Not one tag of
the documents is encoded manually, the encoding is entirely automatic.
The
absence of any human intervention in the tagging guarantees a result
without errors and a very high consistence in the style of encoding.
To take into account the style of encoding (beacons, attributes, etc)
specific to each category of documents, the encoding system is
parameterized, adapted, fine tuned and tested at the beginning of each
project. The result is a rich file, with digital documents easily
accessible and ready to be viewed online.
We have developed a strong knowledge in complex project management of digitalization and encoding.
Old documents, manuscripts (Birth certificate, index, etc…)
Documents including one or more type of characters (for example Chinese, Arabic, Japanese and Greek characters), with or without phonetic transcription.
Foreign languages and ancient handwriting
Rare, damaged documents, stained, torn, sometimes requiring electronic cleaning
One of the
reasons of our success in this type of work is our capacity to manage
difficult projects thanks to our experience and know-how, the use of
specialized databases of references and linguistic specialists.
Guaranteed Quality
We are able to guarantee a quality of indexing and perfect capture.
This is possible thanks to redundant controls that we set up. We have
delivered several important projects that required a rate of quality
close to 100%.
Our methodology for the encoding phase brings strong advantages in term
of quality: the encoding is done quickly regardless the size of the
documents and the document xml/ead is generated without error.
Our method avoids any manual intervention during the encoding, using
our software of automated processing; this insures that the validated
rules of encoding at the time of the test phase are rigorously
maintained for the series of the encoded documents belonging to the
same category.
Project Management
For each of our projects we set up a project management process that
will give our customers feedback, sometimes in real time. The customer
can follow the work progress daily. For the remote assistance and
urgency, we set up an Internet hot-line, opened 24h a day necessary,
allowing contact of the project managers as fast as possible.
Volumes
The automation of the processes enables us to ensure large volumes quickly and in a consistent way.
Reliability
The proprietary software ensures a perfect consistency of the
meta-data, allowing efficient and relevant future multi-criteria
research.
Schedule
We guarantee the delivery time. Our centers are equipped with redundant
equipment to avoid any production interruption (Internet, telephone,
electricity) and can run 24 hours a day, 7 days a week.
Costs
Because it is based mainly on automated processes based on reliable
software, the AureXus solution guarantees the best quality at very
competitive prices.