Convert PDF to HTML, Attach Large Files & Text Extraction from PDF

C

What’s New in this Release?

The latest version of Aspose.Pdf for .NET (6.4.0) has been released. This release includes PDF files conversion into HTML format. Prior to this release, Aspose.Pdf for .NET supported converting images, XML, HTML, SVG, PCL and XSL-FO files into PDF format but not the other way around.  With this new feature, one single product provides the capability to convert HTML files into PDF or transform PDF files into HTML format. Moreover it allows users to extract non-English text such as Arabic, Hebrew etc. Users can now specify custom text and image files when adding signatures to PDF document.  The product is much faster and more stable compared to earlier releases. This release also improved large files management and users can now add files larger than 1GB as an attachment to PDF files and concatenating large PDF documents with confidence.  This release includes plenty of new and improved features as listed below

  • Convert PDF to HTML
  • Add Custom Image/Text to PDF Signature
  • Implement support of ActualText in marked contents inside Contents stream for text extraction
  • How to get PDF security settings  
  • Enhanced Unified document open and save method
  • PDF to JPEG conversion is improved
  • Optimized image loading
  • Enhanced Text extraction from PDF
  • Implement quick way to append new operators to contents
  • control the space between text Underline and Overline
  • Setting custom data via indexer property does not apply
  • Exception is being generated when filling the form field
  • InvalidOperationException is occurring during license setting
  • Font and text size issue while filling the PDF form
  • Output view issue is resolved with latest version of Reader
  • Text missing issue is resolved while using PdfExtractor to extract text
  • Improved DecodePage method
  • Characters missing is resolved  from the image decoded
  • Now can set background image in a scanned document
  • EncryptFile now applies security to individual PDFs in portfolio PDF
  • Password is now applied successfully on the output PDF
  • Text stamp forecolor is corrected in the output file
  • Hebrew text is inverted and numbers are now corrected in extracted text
  • Watermark lost issues is resolved on the output TIFF image
  • Extracted Arabic text is now readable
  • Lines are thicker in the printed output
  • Non-English characters missing is resolved from the converted images
  • PdfViewer is can now print PDF to XPS in evaluation mode
  • Gray squares issue is resolved on the output image
  • Table column missing issue is resolved during HTML to PDF conversion
  • Enhanced Image extraction
  • Empty pdf document after PDF File editor is fixed

Other most recent bug fixes are also included in this release.

Newly added documentation pages and articles

Some new tips and articles have now been added into Aspose.Pdf for .NET documentation that may guide you briefly how to use Aspose.Pdf for performing different tasks like the followings.

Overview: Aspose.Pdf for .NET

Aspose.Pdf is a .Net Pdf component for the creation and manipulation of Pdf documents without using Adobe Acrobat. Create PDF by API, XML templates & XSL-FO files. It supports form field creation, PDF compression options, table creation & manipulation, graph objects, extensive hyperlink functionality, extended security controls, custom font handling, add or remove bookmarks; TOC; attachments & annotations; import or export PDF form data and many more. Also convert HTML, XSL-FO and MS WORD to PDF.

More about Aspose.Pdf for .NET

Contact Information

Aspose Pty Ltd, Suite 163,
79 Longueville Road
Lane Cove, NSW, 2066
Australia
Aspose – Your File Format Experts
[email protected]
Phone: 888.277.6734
Fax: 866.810.9465

About the author

By sherazam