Class AbstractExtractionService

  • Direct Known Subclasses:
    ImageExtractionService, PdfExtractionService

    public abstract class AbstractExtractionService
    extends java.lang.Object
    This service is responsible for the manipulation of Comparable documents
    • Constructor Detail

      • AbstractExtractionService

        public AbstractExtractionService()
    • Method Detail

      • getPageDimensionInPt

        public abstract Dimension getPageDimensionInPt​(java.io.File pdfFile,
                                                       int pageId)
                                                throws java.lang.Exception
        Throws:
        java.lang.Exception
      • getPageCount

        public abstract int getPageCount​(java.io.File pdfFile)
                                  throws java.lang.Exception
        Throws:
        java.lang.Exception
      • extractRegions

        public abstract java.util.List<RegionExtractionResult> extractRegions​(java.io.File pdfFile,
                                                                              java.util.List<Region> regions)
                                                                       throws java.io.IOException,
                                                                              net.sourceforge.tess4j.TesseractException
        Throws:
        java.io.IOException
        net.sourceforge.tess4j.TesseractException
      • setLanguage

        public abstract void setLanguage​(java.lang.String lang)