PDF Output

PDF output is a standard function of XSL Formatter V3.4. PDF can be outputted as PDF version 1.3, 1.4 or 1.5. PDF1.3 can be read by Adobe Acrobat 4.0 or later, PDF1.4 can be read by Adobe Acrobat 5.0 or later, PDF 1.5 can be read by Adobe Acrobat 6.0 or later. (Including each Acrobat Reader)

Bookmarks and Links

By setting extended properties inside the original document, bookmarks and links can be created.

Multi separate volume

Possible to output PDF in multi separate volume from one FO.

Document Information

By setting extended properties inside the original document, 'Title','Subject','Author',and 'Keyword' information can be emdeded.

Security Setting

When setting the security, 40-bit RC4(Acrobat3.x,4.x) is applied to the encryption. A higher level encryption, 128-bit RC4(Acrobat5.x) is available with PDF 1.4 or later. It is possible to set user password, master password and security settings for permissions to access the PDF.

PDF Embedding

Possible to embed PDF document in the other PDF.

It is performed by using <fo:external-graphic>.

<fo:external-graphic src="embedded.pdf#page=3"/>

As described above, specify the page number which you want to embed to the URI. When there is no page number specified, the first page will be embedded. When there is no size specified, it's embedded in the page size of the PDF where it's embedded. However if you want to scale the size, content-width or content-height can be specified as follows:

<fo:external-graphic src="embedded.pdf#page=3" content-width="50%"/>

When specifying PDF by utilizing the data scheme (RFC2397), the page number can be specified as the parameter of the media type as follows.

<fo:external-graphic src="data:application/pdf;page=3;base64,JVBERi0xLjQKJeLjz9M..."/>

Moreover, it's also possible to embed PDF as a background. This can be used when making a list form as a background and formatting only the content data on it. When specifying PDF as a background, please specify background-repeat="no-repeat." as follows. background-repeat="repeat" cannot be specified.

<fo:simple-page-master axf:background-image="background.pdf"
                       background-repeat="no-repeat" ...>

The version of embedded PDF must be less than or equal to the version of PDF to output.

A dummy image will be displayed in GUI.

Fonts

Adobe Type 1 fonts (including Adobe Standard 14 fonts), and TrueType fonts (including OpenType fonts with TrueType Outlines), and OpenType fonts (PostScript Outline) are supported for PDF output. Other font formats are not supported. For more details, refer to the "Font Setting".

XSL Formatter V3.4 requires that the fonts, which are specified in documents, are installed on your system in order to use them correctly. Please refer to Windows help or follow the installation instructions attached to the fonts for the method of installing the font in the Windows version. Be sure to select "Copy fonts to Font folder" when you install fonts.

Note for Adobe Type 1 fonts on Windows

Adobe Type 1 fonts are supported only with Windows2000/XP.

These 14 Adobe Type 1 fonts are called Standard 14 Fonts in PDF.

  • Courier
  • Courier-Bold
  • Courier-Oblique
  • Courier-BoldOblique
  • Helvetica
  • Helvetica-Bold
  • Helvetica-Oblique
  • Helvetica-BoldOblique
  • Times-Roman
  • Times-Bold
  • Times-Italic
  • Times-BoldItalic
  • Symbol
  • ZapfDingbats

Even when using an Adobe Type 1 font except for these Standard 14 Fonts, it is not necessary to prepare an AFM (Adobe Font Metrics) file.

The glyph names of Adobe Type 1 fonts maps to character codes (Unicode) of formatting data according to the AGL (Adobe Glyph List) specification. The glyph with a name that is not defined in AGL is not output.

Character Sets, Encoding

The following character sets are supported:

  • Adobe Standard Latin character set
  • Symbol character set
  • ZapfDingbats character set
  • Japanese character set (Adobe-Japan1-Supplement2)
  • Simplified Chinese character set (Adobe-GB1-Supplement2)
  • Traditional Chinese character set (Adobe-CNS1-Supplement0)
  • Korean character set (Adobe-Korea1-Supplement1)

Encoding of all characters is processed as Unicode within XSL Formatter V3.4. In the case of Chinese, Japanese, Korean, (CJK), XSL Formatter V3.4 maps the Unicode to glyph in each CJK character sets by using the following CMap.

  • Japanese : UniJIS-UCS2-H(V) UniJIS-UCS2-HW-H(V)
  • Simplified Chinese : UniGB-UCS2-H(V)
  • Traditional Chinese : UniCNS-UCS2-H(V)
  • Korean : UniKS-UCS2-H(V)

The characters that do not belong to the above character sets are embedded in the PDF by getting the glyphs from the font files. This process is done only for TrueType fonts.

XSL Formatter V3.4 Lite does not support CJK.

Font Embedding

By embedding fonts in PDF, it becomes possible to display PDF files even in the environment where there are no fonts. In XSL Formatter V3.4, this function can be done only for TrueType fonts, Adobe Type 1 font, including OpenType fonts with TrueType Outline.

In the default setting of TrueType font processing, only the outline of glyphs that are not defined by CMap is embedded. In the case where TrueType fonts that are prohibited from embed by font vendor appears, error occurs and processing stops. You can avoid this error by replacing it with a white space and output PDF. You can also specify the option that all glyphs of a font are to be embedded whether the character is defined by CMap or not.

In the default setting of Adobe Type 1 font processing, only the outline of a font that has font specific encoding is embedded. You can also specify the option that all glyphs of a font are to be embedded whether the font has standard encoding or font specific encoding.

XSL Formatter V3.4 does not support treating fonts as external fonts that are not allowed to be embedded.

For more information about font embedding, refer to PDF Setting.

Thai Output

The following True Type fonts are available for Thai output:

  1. Microsoft Sans Serif
  2. Tahoma
  3. Arial Unicode MS (ver0.86 or later)
  4. Angsana New
  5. Angsana UPC
  6. Browallia New
  7. Browallia UPC
  8. Cordia New
  9. Cordia UPC
  10. Dillenia UPC
  11. Eucrosia UPC
  12. Freesia UPC
  13. Iris UPC
  14. Jasmine UPC
  15. Kodchiang UPC
  16. Lily UPC

In Windows 2000, the above fonts from 4 to 12 are added when you select 'Language settings for the system' in the 'Regional Options' in the Control Panel and check Thai. In Windows XP, the above fonts from 4 to 12 are added when you select Supplemental language support in language in the 'Regional and Language Options' in Control Panel and check 'Install files for complex script and right-to-left language [including Thai]'.

Fonts 1 and 3 are available only for Regular type, Font 2 is available for Regular and Bold type. The other fonts are available for Regular, Bold, Italic, BoldItalic type.

In processing Thai fonts, when the string that needs the glyph substitution is detected by checking output script, the glyph substitution is executed.

XSL Formatter V3.4 Lite does not support Thai.

Arabic Output

The following True Type fonts are available for Arabic output:

  1. Arial
  2. Courier New
  3. Microsoft Sans Serif
  4. Tahoma
  5. Times New Roman
  6. Arial Unicode MS

In Windows 2000, the fonts are added when you check Arabic in Language settings for the system in the 'Regional and Language Options' in Control Panel are not supported. Also in Windows XP, the Arabic fonts added when you select Supplemental language support in language in the 'Regional and Language Options' in Control Panel and check 'Install files for complex script and right-to-left language [including Arabic]' are not supported.

Font 3 and 6 are available only for Regular type, the other fonts are available for Regular and Bold type.

In processing Arabic, Arabic script is shaped depending on its position, final, initial, medial or isolated by using Open Type features defined in the above fonts. In addition, the glyph substitution of Ligature, marks is defined, it's also shaped. Supported features are 'init', 'medi', 'fina', 'isol', 'liga', 'mset' for each.

Also, the expansion processing by inserting Kashida is supported. The percentage of Kashida can be controlled by the extension property axf:text-kashida-space.

XSL Formatter V3.4 Lite does not support Arabic.

Hebrew Output

The following True Type fonts are available for Hebrew output:

  1. Arial
  2. Courier New
  3. Lucida Sans Unicode
  4. Microsoft Sans Serif
  5. Tahoma
  6. Times New Roman
  7. Arial Unicode MS
  8. Aharoni
  9. David
  10. David Transparent
  11. FrankRuehl
  12. Levenim MT
  13. Miriam
  14. Miriam Fixed
  15. Fixed Miriam Transparent
  16. Miriam Transparent
  17. Narkisim
  18. Rod
  19. Rod Transparent

In Windows 2000, the above fonts from 8 to 19 are added when you select 'Language settings for the system' in the 'Regional Options' in the Control Panel and check Hebrew. In Windows XP, the above fonts from 8 to 19 are added when you select Supplemental language support in language in the 'Regional and Language Options' in Control Panel and check 'Install files for complex script and right-to-left language [including Hebrew]'.

Fonts 3, 4, 7, 10, 11, 13, 14, 15, 16, 17, 18, 19 are only available for Regular type, Font 8 is only available for Bold type, and the others are available for Regular, Bold, Italic, BoldItalic type.

In processing Hebrew, by checking output script, when the string that needs to be changed to the string defined in Unicode type Hebrew Presentation Forms is detected it is detected and changed.

XSL Formatter V3.4 Lite does not support Hebrew.

Restrictions

  • Thai fonts are always embedded in PDF.
  • OpenType fonts have a functionality called Advanced Tyopographic Extensions, such as Glyph Substitution. XSL Formatter V3.4 supports Glyph substitution ('vert') in vertical writing and 'init', 'medi', 'fina', 'isol', 'liga' 'mset' for Arabic. The fonts that uses this feature except for these might not be processed correctly.
  • In some cases, the position of a vowel is out of position in Arabic and Hebrew.

Image Output

For more information about supported graphic images, refer to the "Graphics".

Vector Images

The following vector images are outputted to PDF as vector primitives are replaced with PDF operators.

  • SVG
  • MathML
  • EMF
  • WMF

In Windows version, vector images other than the above are transformed into the raster images and outputted to PDF. The resolution of the raster can be set in the result PDF with the value of dpi. Please refer to rasterize-resolution in PDF Setting. In non-Windows versions, the vector image which cannot be outputted to PDF is disregarded.

CAUTION: MathML is converted into raster images when XSL Formatter MathML Option is not installed.

Raster Images

Generally the raster graphic data is compressed using a compression algorithms in the graphic file. If the compression method and the original (uncompressed) image format are both compatible with the PDF file format, the compressed raster graphic data will be directly embedded into the PDF file. If the compression method or original image format are not compatible with the PDF format, the graphic data will be uncompressed and converted to a PDF compatible bitmap format for output. If graphic data cannot be uncompressed, its data cannot be processed. Bitmap graphic will be compressed using JPEG or ZLIB compression and embedded into the PDF file.
Refer to image-compression, jpeg-quality attributes in PDF Setting of the option setting file. If raster image data is directly embedded into the PDF file, these attributes will not be applied.

The raster image which can be embedded directly in PDF is as follows.

  • JPEG
  • PNG
  • GIF
  • TIFF
  • JPEG2000

However, there are the following restrictions.

  • Progressive JPEG, Interlaced GIF are transformed into regular JPEG or GIF images.
  • 16-bit color in PNG or TIFF is reduced to a 8-bit color.
  • When alpha channel is attached to PNG or TIFF, it is divided.
  • There are some unsupported TIFF formats.
  • JPEG2000 is embedded into PDF only when it is PDF1.5 or later. For other version, it is embedded after being converted to JPEG etc.

Down sampling

In XSL Formatter V3.4, the down sampling of the raster images embedded in PDF can be down sampled.

How it's down sampled can be specified in Compression in the PDF Option Setting dialog or in Option Seting File.



Copyright © 1999-2005 Antenna House, Inc. All rights reserved.
Antenna House is a trademark of Antenna House, Inc.