Solutions  >  Toolkits / O.E.M.  >  Toolkits  >  iDRS SDK
 Overview
 Toolkits
     iDRS SDK
     IDR SDK
     IRISFingerprint SDK
     Cardiris SDK
 Contact us

 

iDRS 14 SDK

Recognition, Image preprocessing, Document formatting and more

iDRS™ 14.0 is a compilation of more than 20 years of I.R.I.S. technology and expertise in data recognition and output. All proprietary I.R.I.S. software is based on iDRS™ - it is what we use!

iDRS™ 14.0 includes all our latest advances in preprocessing (deskew, despeckle, binarization…), recognition (OCR, ICR…), formatting (PDF, RTF, PDF/A-1b, XPS…) and hyper-compression (iHQC™, an I.R.I.S. proprietary technology) and will allow developers to create their own specific solutions for the recognition of any kind of document.
The toolkit is compatible with the most commonly used Operating Systems (Windows, Linux, UNIX, Mac OS) using a simple and complete C++ interface.

 

iDRS™ 14.0 is also available for .NET environments ! allowing J#, C# and VB.NET developers to use iDRS™ 14.0 in Microsoft Windows.

iDRS™ 14.0 is a modular toolkit that will fit your project requirements perfectly. You just use what you need.

iDRS™ Global Benefits

iDRS™ 14.0 is the most competitive and fastest professional solution on the market to enable your application with the ability to convert all images into indexed and editable files, and/or provide multiple users with document recognition services.

iDRS™ is…

Fast

The recognition speed of the OCR engine is the fastest technology available on the market. iDRS™ technology embedded in I.R.I.S. products has been used to process hundreds of thousands of documents per day on a single machine!

Easy to install and use

An integrator can create a complete application in a few hours. Just have a look at the many included sample codes and you will be able to integrate all the iDRS™ technologies into your C++, C#, J# or VB.NET applications..
Demo applications are also available as examples showing how the features can be used and integrated.

Versatile - Create your own solution, from entry-level OCR software to a complete document management platform !

Did you see something that caught your attention in one of our products and you would like to develop your own version? Or have you already taken care of the scanning and archiving parts but would like to throw OCR and indexing into the mix? In any case, iDRS™ is all you need.

Feature-rich

iDRS™ is packed with features which will be described later on in this datasheet, but here is a glimpse of what it can bring to your product line:
  • Improve scanned images with powerful image cleaning tools
  •  Deskew text, detect its orientation… and much more
  •  Rename and separate documents automatically by detecting barcode values and blank pages
  •  Powerful recognition engines for languages, barcodes, banking fonts, patch codes…
  •  New OCR engine with 137 recognized languages - unlike our competitors, every language we recognize is fully supported, including a full dictionary.
  •  Index documents, and store the extracted data into a descriptive XML index file
  •  iHQC™ (hyper-compression technology by I.R.I.S.) provides a much higher compression rate than the best available TIFF G4 compression for black and white images.
    However iHQC™ is much more than an image compression, it is a document compression technology that combines off-the-chart rates with perfectly preserved readability. (more details on page XXX)
  •  Convert document images into many formats, among which PDF, XPS (with possible hyper-compression), etc. (see full list on page XXX)
  •  Think about the future : give your customers the ability to convert documents to PDF/A-1b for long-term archiving

A Closer Look at the Modules

iDRS™ is composed of different modules you can pick from to implement into your own application.

Imaging

The imaging module offers support for the compression and decompression of images in various formats. Decompression is to open various image file formats while compression is for embedding images inside your documents.

  • Standard
    This standard Imaging module allows you to open and save the TIFF (CCITT G4)
  • JPEG extension
    Allows you to open and save JPEG.
  • Snowbound, External SDKs
    iDRS™ can use external imaging toolkits to extend its imaging capabilities. They integrate seamlessly into iDRS™ in a “plug and play” fashion. Other external SDKs can also be used. For a full list of formats supported by those SDKs, please refer to their respective documentation.
  • Input file types
    HTML, ODT, DOCX, RTF, PDF, PDF + iHQC, WordML, XPS, XPS + iHQC


Pre-processing

Provides image pre-processing capabilities allowing image enhancement to increase the accuracy and interpretability of the image resulting in optimal recognition results.

  • Standard pre-processing 
    The standard pre-processing module offers several image enhancement techniques to improve the OCR result:, …
    • Deskewing : remove any skew of the text in the image up to an angle of 10.5°
    • Orientation detection: Automatic rotation and straightening of the text orientation contained in your documents, by 90°, 180° or 270°.
    • Despeckling : remove noisy pixels with customizable cleaning tools
    • Inversed Despeckling: remove white noise from a black background
    • Smoothening : flatten out the colors of the image, which can for example remove the JPEG compression artifacts and help recognition
    • Adaptive and threshold based binarization
    • Pixel inversion, etc.
  • Advanced pre-processing 
    The advanced pre-processing module offers more advanced pre-processing capabilities: color dropout after binarization, advanced despeckling, dark border removal, line removal and interpolation.

Text Recognition

iDRS™ enjoys the most productive optical recognition technology, and includes various recognition engines for machine print (OCR, MICR), handprint (ICR) and barcode recognition.

Recognition engine and full page OCR
iDRS™ includes the latest I.R.I.S. recognition engine and formatting technologies. Page analysis is carried out on the color image, resulting in better zone segmentation. This new engine also recognizes the color of the text and keeps it in the output file.

  • OCR 
    The IRIS OCR engine currently supports 137 languages under which all Latin languages, Greek, Cyrillic, etc. 
    Add-ons offer recognition of Asian languages, Arabic, Hebrew and Farsi. (see below)
    Next to pure recognition the OCR module also offers page analysis, access to the recognized data and formatting information, possibility to load user lexicons and many other recognition related functions such as: 
    • Page-layout analysis: automatic detection of text, graphic, table and barcode zones.
    • Detailed recognition information: access to the character confidence level, alternative solutions, the coordinates of each character…
    • User Lexicon support: a user lexicon containing specific terms can be loaded into the engine. This is very useful for the recognition of very specific information like for example pharmaceutical names.
    • Formatting information: paragraph attributes, typefaces, font size, typestyle, color, drop capital.
    • Character recognition: recognition of ligatures ("joined" characters), broken and degraded characters (6 to 72 point size)
    • Multilingual recognition: combine up to 5 languages on the same document (selective combinations)
    • Dot Matrix
    • 137 recognized languages:
      American English, British English, Afrikaans, Albanian, Aymara, Balinese, Basque, Bemba, Bikol, Bislama, Brazilian, Breton, Bulgarian, Byelorussian, Catalan, Cebuano, Chamorro, Corsican, Croatian, Czech, Danish, Dutch, Esperanto, Estonian, Faroese, Fijian, Finnish, French, Frisian, Friulian, Galician, Ganda, German, Greek, Greenlandic, Haitian (Creole), Hani, Hiligaynon, Hungarian, Icelandic, Ido, Ilocano, Indonesian, Interlingua, Irish (Gaelic), Italian, Javanese, Kapampangan, Kicongo, Kinyarwanda, Kurdish, Latin, Latvian, Lithuanian, Luxemburg, Macedonian, Madurese, Malagasy, Malay, Maltese, Manx (Gaelic), Maori, Mayan, Minangkabau, Nahuatl, Norwegian, Numeric, Nyanja, Nynorsk, Occitan, Pidgin English, Polish, Portuguese, Quechua, Rhaeto-Roman, Romanian, Rundi, Russian, Samoan, Sardinian, Scottish (Gaelic), Serbian, Serbian (Latin), Shona, Slovak, Slovenian, Somali, Sotho, Spanish, Sudanese, Swahili, Swedish, Tagalog, Tahitian, Tok Pisin, Tonga, Tswana, Turkish, Ukrainian, Waray, Wolof, Xhosa, Zapotec, Zulu, Bulgarian - English, Byelorussian - English, Greek - English, Macedonian - English, Russian - English, Serbian - English, Ukrainian - English, Hebrew - Simplified Chinese (optional), Traditional Chinese (optional), Japanese (optional), Korean (optional), Arabic and Farsi (optional) + Moldovan, Bosnian (Cyrillic and Latin), Tetum, Swiss-German and Kazak.
  • Asian add-on
    The Asian add-on extends the OCR languages with 4 additional languages: Traditional Chinese, Simplified Chinese, Japanese and Korean.
  • Hebrew add-on
    The Hebrew add-on extends the OCR languages with the Hebrew language.
  • Arabic & Farsi add-on
    Offers recognition of Arabic and Farsi.
  • ICR
    I.R.I.S. has developed its own Intelligent Character Recognition (ICR) engine for all Latin based languages. Supports spaced and even touching handprint.
  • Banking Fonts - MICR
    The following banking fonts can be recognized: OCR-A, OCR-B, E13B and CMC-7.
  • Define a User Lexicon
  • Fine-tune the Recognition Vs. Speed ratio
  • Choose to apply recognition to selected pages only
  • Indexing
    • Generate a full-page text index for every image
    • Recognize the value of barcodes
    • Export that information as an index file in XML format
    • Use the indexed information for file renaming

Barcode Recognition
  • Barcodes
    • 1D - The barcode module will identify any barcode located anywhere on a page. 24 different 1-dimensional barcodes are supported covering the most popular ones as code 39, EAN, Code 128, etc.
    • 2D - The IRIS barcode engine also supports the recognition of PDF 417 and QR barcodes. 2-dimensional barcodes containing much more information than the 1-dimensional barcodes.
    • Full list : Codabar, Code 128, Code 39, Code 39 extended, Code 39 HIBC, Code 93, Datalogic 2 of 5, Discrete 2 of 5, EAN-13, EAN-2, EAN-5, EAN-8, IATA 2 of 5, Interleaved 2 of 5, MSI pharmaceutical, MSI-Plessey, PDF-417, PostNet, PostNet 32, PostNet 52, PostNet 62, QR Code, Reverse 2 of 5, UCC-128, UPC-A, UPC-E and UPC-E extended.
  • Fine-tune the Recognition Vs. Speed ratio
  • Choose to apply recognition to selected pages only
  • Indexing
    • Generate a full-page text index for every image
    • Recognize the value of barcodes
    • Export that information as an index file in XML format
    • Use the indexed information for file renaming

Formatting

The formatting module allows the developer to export the recognized data to various output formats. Several formatting options are available: retain formatting information or not, retain text color, set page size and many more:

  • Standard 
    The following output formats are available in this module: PDF (text, image-text, text-image and image), PDF/A-1b, HTML, XML, RTF, TXT, UNICODE, UTF-8, ODT, WordML, SpreadsheetML and CSV. 
  • Advanced 
    This module offers additional compressed output formats like for example PDF hyper-compressed with iHQC™. (see page XXX)
  • Full list of output formats
    • PDF (4 different file modes: Image, Image-Text, Text, Text-Image, with many options: PDF/A-1b compliance for long-term archiving, certification, encryption)
    • PDF with iHQC™ hyper-compression (3 different compression levels, many options: PDF/A-1b compliance, certification, encryption)
    • XPS (4 different file modes: Image, Image-Text, Text, Text-Image)
    • XPS with iHQC™ hyper-compression (3 different compression levels, many options)
    • Common file formats such as TXT, RTF, DOC, OpenDocument text, HTML, XML, WordML, SpreadsheetML
    • Image file formats such as TIFF (single or multi-page), BMP, JPEG, JPEG2000, or PDF image (also available with iHQC™ hyper-compression)
  • The iDRS™ engine offers you the possibility to activate multiple output formats simultaneously. This allows you for instance to convert an image into PDF/A-1b for long-term archiving, and at the same time to PDF with iHQC™ hyper-compression for sharing by e-mail within your organization.
  • Various settings specific to the output format are also available, for instance layout retention, document protection by password, electronic signature with or without embedded fonts, etc.

iHQC™: Intelligent High Quality Compression Technology

iHQC™ is the revolutionary hyper-compression technology from I.R.I.S., which will dramatically reduce the size of your PDF, PDF/a and XPS files (color or black and white) while maintaining perfect text legibility. iHQC™ generates better quality than JPEG 2000 with much smaller files.

Learn more about iHQC

A multi-platform toolkit

iDRS™ has been developed on a multitude of platforms (Windows, Mac and Linux). All have the same API, which means you could write your code once and simply recompile on the different platforms.

I.R.I.S. is working on extending the number of supported platforms. As their number is continuously growing, feel free to contact us for more details.

System Requirements and Platforms

Development Environments:

  • Windows - Microsoft Visual Studio
  • Mac – xcode
  • Linux - gcc

OS supported:
iDRS is built in such a way as to reduce OS dependencies to an absolute minimum. It works on all current desktop or server declinations of Windows, Mac and Linux OS. This does not however cover embedded versions such as Windows CE.

  • Windows 32 bit – Windows 98, ME, NT, 2000, XP, 2003, 2008…
  • Windows 64 bit – (note the SDK is not native 64 bit)
  • Mac 32 bit –
  • Mac 64 bit (note the SDK is native 64 bit) – Snow Leopard
  • Linux 32 bit –
  • Linux 64 bit –

Miscellaneous

The size of the images the iDRS™ OCR engine is able to recognize are 4961 x 7016 or 5100 x 6600 pixels at all resolutions, which is equivalent to:
    • A4 – 600 dpi
    • A2 – 300 dpi
    • A0 – 150 dpi

Download datasheet
Trial version