The PDF file format (2024)

Skip to content

Prepressure

Prepress, printing, PDF, PostScript, fonts and stuff…

The average user never needs to look ‘under the hood’ of PDF files. For curious people, this page takes a closer look at the way information is stored in a PDF file.

General conventions

Here is some useful information in case you intend to open PDF-files to edit them straight away:

  • PDF files are either 8-bit binary files or 7-bit ASCII text files (using ASCII-85 encoding).
  • Every line in a PDF can contain up to 255 characters.
  • Every line ends with a carriage return, a line feed or a carriage return followed by a line feed (depending upon the application or platform used to create the PDF file).
  • PDF is case sensitive.
  • The file format is completely independent of the platform that it is viewed or created on. Files can be moved back and forth between Macs, Windows system, Linux systems,… When FTP-ing a PDF file, it does make sense to compress it, to avoid data corruption by some outdated web system that the file needs to go through.

PDF file structure

PDF files use a fixed structure, they always contain 4 sections:

  • A header, which contains information on the PDF-specifications the file adheres to. This line looks like this: ‘%PDF-1.2’.
  • The body area which contains a description of the various elements that are placed on the pages.
  • A cross-reference table which refers to all the elements from the body that are used on the pages of the PDF-file.
  • A trailer that tells applications or RIPs where to find the cross-reference table and always ends with ‘%%EOF’. If this line is missing, the PDF-file is not complete and can probably not be processed by any RIP or application. This is not the case with PostScript files. If the last few lines of a PostScript file are missing (because of a lost connection while transferring the file or a computer crash) you can often still print most of the pages. With a PDF-file, you’ll lose everything.

PDF imaging model

Objects that are placed on PDF pages are called ‘marks’. The page surface is called the ‘canvas’.

  • A coordinate system is used to define where each mark is placed. By default, coordinates are defined in points (72 units per inch) but this measurement system can be redefined within a PDF. The origin is in the bottom left but this 0,0 coordinate can also be redefined. This flexible coordinate system is called ‘User Space’. Afterward when a PDF is sent to an output device such as a printer, the RIP needs to recalculate everything to ‘Device Space’, the coordinate system of the output device.
  • Marks can have a number of characteristics:
    • a fill
    • a stroke
    • a color, which can be defined in one of the color spaces that PDF supports (11 in the most recent versions)
    • a certain level of transparency (from PDF 1.4 onwards)
  • All graphic objects are either
    • paths – shapes made out of lines, curves and or rectangles
    • text
    • bitmap images
    • Form XObjects – which are reusable elements
    • PostScript language fragments – worst case
  • Some of the content of a PDF can be optional content, marks that can be selectively viewed or hidden. In Acrobat optional content is somewhat misleadingly referred to as ‘layers’. Some practical examples of this:
    • Maps that contain a coordinate grid that can be activated/deactivated
    • Brochures with text in multiple languages
    • Packaging files with the die-cut and embossing information as separate optional content.

Modifying data in a PDF

If data are appended to a PDF-file (for instance because the user edited text in Adobe Acrobat and saved the file again or if you merge PDF files), another body area, cross-reference table, and trailer are added to the end of the file. This bloats the PDF file size. By opening a file in Acrobat and using ‘Save as’, you force the application to clean up the PDF file so there are no more multiple data areas. The same is true when deleting pages in a PDF: only when you force the application to regenerate a new file will it clean up unused data.

  1. Thank you for a very good summary!
    PDF 2.0 (ISO 32000-2 / 2017) introduced UTF-8 encoded strings as an additional format for text strings.

    Reply

Leave a Reply

The PDF file format (2024)

FAQs

How do I get answers from a PDF? ›

Try our PDF AI Chat tool for free

Ask questions on our latest PDF AI summariser tool. Upload a document, sign in, and type a question to get a quick response. With Acrobat AI Assistant, you can understand key information and analyse data within seconds.

What does PDF stand for answer document format? ›

Answer. PDF stands for "portable document format". Essentially, the format is used when you need to save files that cannot be modified but still need to be easily shared and printed.

How do I fix my PDF file Format? ›

Quick list: How to Repair a Damaged PDF.
  1. Update Adobe Acrobat Reader. One of the reasons a PDF might not display properly is because your Adobe Acrobat Reader isn't up to date. ...
  2. Repair installation. ...
  3. Restore previous version. ...
  4. Convert the PDF.

How can I answer a PDF file? ›

Click the “Fill & Sign” tool in the right pane. Fill out your form: Complete form filling by clicking a text field and typing or adding a text box. You can add checkmarks and fill in radio buttons too. Sign your form: Click “Sign” in the toolbar at the top of the page.

Can ChatGPT answer questions from a PDF? ›

Copy the PDF file URL and enter the prompt to let ChatGPT summarize or answer questions based on your PDF.

How do I make a PDF read to me? ›

Open an adobe (pdf) file. Toggle to the “view” screen and scroll down to “Read Out Loud.” Select “Activate Read Out Loud.” ” Then select how you want the document to be read “Read This Page Only” or “Read To End of Document.”

What is the difference between PDF and PDF format? ›

The main difference between the two format lies in the components: while standard PDFs can include elements such as external fonts, external references, multimedia and other objects that might change with time, PDF/As don't, making this format the ideal choice for documents that are destined to remain unchanged for ...

How do I make a PDF file? ›

Create PDF files in Adobe Acrobat:
  1. Open Acrobat and choose “Tools” > “Create PDF”.
  2. Select the file type you want to create a PDF from: single file, multiple files, scan or other option.
  3. Click “Create” or “Next” depending on the file type.
  4. Follow the prompts to convert to PDF and save to your desired location.

How to convert PDF to Word? ›

How to convert PDF files into Word documents:
  1. Open a PDF file in Acrobat.
  2. Click on the “Export PDF” tool in the right pane.
  3. Choose Microsoft Word as your export format, and then choose “Word Document.”
  4. Click “Export.” ...
  5. Save your new Word file:

How do I correct a PDF format? ›

Edit PDF files in Adobe Acrobat:
  1. Open a file in Acrobat.
  2. Click the “Edit PDF” tool in the right pane.
  3. Use Acrobat editing tools: Add new text, edit text or update fonts using selections from the Format list. ...
  4. Save your edited PDF: Name your file and click the “Save” button. That's it.

Can you change the format of a PDF? ›

You can select text in a PDF file and save it in one of the supported formats: DOCX, DOC, XLSX, RTF, XML, HTML, or CSV. Use the Select tool and mark the content to save. Right-click the selected text and choose Export Selection As. Select a format from the Save As Type list and select Save.

How to reduce PDF format? ›

Reduce File Size of PDF File in Acrobat Pro
  1. With Acrobat Pro open, go to File > Open.
  2. After your file is open, Go to File > Save as Other > Reduced Size PDF....
  3. Select the Acrobat version that is compatible with the features in your PDF file. ...
  4. Re-name your file and Click Save.
Aug 29, 2024

How do I get a document in PDF format? ›

1. Create a PDF from a Word document
  1. Open the document you want to save as a PDF.
  2. Click on the File tab.
  3. In Word, click Save As | PDF from the drop-down menu. In Google Docs, click File | Download | PDF. ...
  4. In the file name box, . pdf will automatically appear at the end of your file name.
Feb 7, 2023

How do I pass a PDF file? ›

Open the Acrobat app. Navigate to the PDF you wish to send. Tap the send icon on the top right portion of the screen. In the new dialog box, you have the option to share via email, or you can send a copy via Messages or other third-party apps such as WhatsApp.

How do I open something in a PDF? ›

Find the PDF you want to open in your Files and double click to open. Select Adobe Acrobat (or whichever reader you downloaded) from the list of available options. If no list appears or the page opens in another application, you can right-click the file and select Open With to choose your PDF reader. Click Open.

How do I retrieve text from a PDF? ›

Method 1: Copy and Paste the Text

One of the most widely used options to extract text from PDF documents is to simply copy and paste the text. Many people prefer this method because copying and pasting text is a familiar process — something that you do nearly every day.

How do I convert a PDF to detect text? ›

Open Scan and OCR Tool

From the menu, choose Recognize Text. This will open the Scan and OCR tool. You can also right click the main document and choose Recognize Text from the menu. If either of these do not work, open the Scan and OCR tool from the Tools menu.

How do I extract search results from a PDF? ›

You need a copy of Adobe® Acrobat® Pro along with the AutoSplit Pro™ plug-in installed on your computer in order to use this tutorial. Both are available as trial versions. With the file to be processed open in Acrobat, select "Plug-Ins > Split Documents > Extract Pages By Text Search" from the main menu.

How do I extract text from a PDF form? ›

Open the PDF document using a PDF reader like Adobe Acrobat Reader. Select the text you want to extract by dragging your mouse cursor over the desired area. Right-click on the selected text and choose the "Copy" option. Open a text editor or word processing software (e.g., Microsoft Word, Google Docs).

Top Articles
How to Be a Successful Gambler: 13 Steps (with Pictures) - wikiHow
4 Reasons Why You Shouldn’t Use an Online Travel Agency or Booking Site - Valley International Airport
Fernald Gun And Knife Show
Northern Counties Soccer Association Nj
Jesus Calling December 1 2022
Coffman Memorial Union | U of M Bookstores
15 Types of Pancake Recipes from Across the Globe | EUROSPAR NI
MADRID BALANZA, MªJ., y VIZCAÍNO SÁNCHEZ, J., 2008, "Collares de época bizantina procedentes de la necrópolis oriental de Carthago Spartaria", Verdolay, nº10, p.173-196.
2013 Chevy Cruze Coolant Hose Diagram
South Ms Farm Trader
Dityship
Brenna Percy Reddit
Capitulo 2B Answers Page 40
Cvs Learnet Modules
Mlb Ballpark Pal
Best Fare Finder Avanti
Letter F Logos - 178+ Best Letter F Logo Ideas. Free Letter F Logo Maker. | 99designs
25Cc To Tbsp
Parent Resources - Padua Franciscan High School
E22 Ultipro Desktop Version
U Arizona Phonebook
Zack Fairhurst Snapchat
Walmart Car Department Phone Number
Finalize Teams Yahoo Fantasy Football
Phoebus uses last-second touchdown to stun Salem for Class 4 football title
Homeaccess.stopandshop
The Old Way Showtimes Near Regency Theatres Granada Hills
St Clair County Mi Mugshots
A Person That Creates Movie Basis Figgerits
Jermiyah Pryear
Does Hunter Schafer Have A Dick
The Collective - Upscale Downtown Milwaukee Hair Salon
Striffler-Hamby Mortuary - Phenix City Obituaries
Vadoc Gtlvisitme App
Earthy Fuel Crossword
Street Fighter 6 Nexus
Used 2 Seater Go Karts
Star News Mugshots
Current Time In Maryland
Nacogdoches, Texas: Step Back in Time in Texas' Oldest Town
Naya Padkar Newspaper Today
Chs.mywork
South Bend Tribune Online
Davis Fire Friday live updates: Community meeting set for 7 p.m. with Lombardo
F9 2385
Sas Majors
Craigslist/Nashville
Flappy Bird Cool Math Games
Mauston O'reilly's
RubberDucks Front Office
Best Restaurant In Glendale Az
El Patron Menu Bardstown Ky
Latest Posts
Article information

Author: Laurine Ryan

Last Updated:

Views: 5500

Rating: 4.7 / 5 (77 voted)

Reviews: 92% of readers found this page helpful

Author information

Name: Laurine Ryan

Birthday: 1994-12-23

Address: Suite 751 871 Lissette Throughway, West Kittie, NH 41603

Phone: +2366831109631

Job: Sales Producer

Hobby: Creative writing, Motor sports, Do it yourself, Skateboarding, Coffee roasting, Calligraphy, Stand-up comedy

Introduction: My name is Laurine Ryan, I am a adorable, fair, graceful, spotless, gorgeous, homely, cooperative person who loves writing and wants to share my knowledge and understanding with you.