Channel: How to extract text from a PDF file? - Stack Overflow

How to extract text from a PDF file?

February 10, 2024, 12:56 pm

≪ Previous: Answer by Eugene for How to extract text from a PDF file?

I'm trying to extract the text included in this PDF file using Python.

I'm using the PyPDF2 package (version 1.27.2), and have the following script:

import PyPDF2with open("sample.pdf", "rb") as pdf_file:    read_pdf = PyPDF2.PdfFileReader(pdf_file)    number_of_pages = read_pdf.getNumPages()    page = read_pdf.pages[0]    page_content = page.extractText()print(page_content)

When I run the code, I get the following output which is different from that included in the PDF document:

 ! " # $ % # $ % &% $ &' ( ) * % + , - % . / 0 1 ' * 2 3% 45' % 1 $ # 2 6 % 3/ % 7 / ) ) / 8 % &) / 2 6 % 8 # 3" % 3" * % 31 3/ 9 # &)%

How can I extract the text as is in the PDF document?

↧

↧

Latest Images

Eco Data 4/26/24

Eco Data 4/26/24

April 25, 2024, 5:00 pm

‘Pay day every day’ may become Shangri-La Group, BPOs’ secret to happy employees

April 25, 2024, 5:51 am

Nonprofit donates custom home in this East Bay city for Marine injured in...

Nonprofit donates custom home in this East Bay city for Marine injured in...

April 23, 2024, 7:00 am

New private rooms on Tokaido Shinkansen change the way we travel from Tokyo...

New private rooms on Tokaido Shinkansen change the way we travel from Tokyo...

April 22, 2024, 6:00 am

Ukraine bans military from online gambling amid addiction concerns

Ukraine bans military from online gambling amid addiction concerns

April 22, 2024, 5:17 am

ಮಂಡ್ಯದಿಂದ ಸುಮಲತಾ ದೂರ; ಹೆಚ್‌ಡಿಕೆ ಪರ ಪ್ರಚಾರಕ್ಕಿಳಿಯದ ಸಂಸದೆ –ಬರ್ತಾರೆ ನೋಡೋಣ ಎಂದ...

ಮಂಡ್ಯದಿಂದ ಸುಮಲತಾ ದೂರ; ಹೆಚ್‌ಡಿಕೆ ಪರ ಪ್ರಚಾರಕ್ಕಿಳಿಯದ ಸಂಸದೆ –ಬರ್ತಾರೆ ನೋಡೋಣ ಎಂದ...

April 20, 2024, 8:08 pm

OCBC Bank Singapore Offers Up to 2.8% p.a. Fixed Deposit Promotion from 21...

April 20, 2024, 12:38 pm

National Poetry Month 2024: Maxine Starr

National Poetry Month 2024: Maxine Starr

April 19, 2024, 9:56 am

Vegan Chicken Pot Pie

Vegan Chicken Pot Pie

April 19, 2024, 9:18 am

Firefox UX: On Purpose: Collectively Defining Our Team’s Mission Statement

Firefox UX: On Purpose: Collectively Defining Our Team’s Mission Statement

April 19, 2024, 7:03 am

Trending Articles

A Wall Street guide to watches

August 5, 2015, 7:32 am

Who Is Junior Pope?| Biography| Profile| History Of Nollywood Actor “Pope...

July 26, 2017, 8:45 am

AUDIO | Diamond Platnumz ft Mugabe - LawaMa | Download

July 25, 2014, 8:00 am

Gangland murders in Dublin (1990-94)

April 17, 2020, 1:54 am

Tuck Mill sells for £1.4 million

April 15, 2013, 5:22 am

Practice Sheet of Right form of verbs for HSC Students

September 22, 2019, 11:40 pm

100+ Short Whatsapp Status in English | Short Status Quotes Words

March 22, 2017, 12:27 am

Romantic And Impressive Birthday Wishes For Girlfriend - Best Birthday Wishes...

January 30, 2020, 8:41 am

Pengalaman Rawatan di Klinik Dr. Ko

October 15, 2021, 7:41 am

Windows 11 Highly Compressed ISO - 10 MB

July 1, 2021, 2:00 pm

Who Is Jennifer Hines? Bryan Olesen Wife Is Mother Of 3 Kids

March 5, 2024, 2:19 am

Consuelo Ortiga y Rey: The "Crush ng Bayan" in Rizal's Time

August 4, 2013, 11:32 pm

NAT, NCAE, LAPG, SREYA, ELNA and PHIL-IR Materials and Reviewers

February 27, 2017, 6:16 pm

Bar Rescue - The Prime Bar (WildeFire Bistro) Update

September 15, 2019, 6:50 am

Guntur District Police Officers Mobile Numbers

April 17, 2017, 2:10 am

Nellore Potti Sriramulu District Police Officers Mobile Numbers

April 17, 2017, 1:40 am

Read GOS (Generic Object Service) Picture Attachments and Display it into...

February 14, 2014, 1:08 pm

A List of Glasses Wholesale Markets in Guangzhou–World of Spectacles

August 22, 2017, 9:42 am

Housefull 4 (2019) Hindi 1080p WEB-DL 1.4GB ESubs Download Khatrimaza

December 19, 2019, 9:12 pm

Murder In The Family: Ronald Luyster was killed by Cody Steich, who was...

September 22, 2016, 8:32 pm

Latest Images

Eco Data 4/26/24

Eco Data 4/26/24

April 25, 2024, 5:00 pm

‘Pay day every day’ may become Shangri-La Group, BPOs’ secret to happy employees

April 25, 2024, 5:51 am

Nonprofit donates custom home in this East Bay city for Marine injured in...

Nonprofit donates custom home in this East Bay city for Marine injured in...

April 23, 2024, 7:00 am

New private rooms on Tokaido Shinkansen change the way we travel from Tokyo...

New private rooms on Tokaido Shinkansen change the way we travel from Tokyo...

April 22, 2024, 6:00 am

Ukraine bans military from online gambling amid addiction concerns

Ukraine bans military from online gambling amid addiction concerns

April 22, 2024, 5:17 am

ಮಂಡ್ಯದಿಂದ ಸುಮಲತಾ ದೂರ; ಹೆಚ್‌ಡಿಕೆ ಪರ ಪ್ರಚಾರಕ್ಕಿಳಿಯದ ಸಂಸದೆ –ಬರ್ತಾರೆ ನೋಡೋಣ ಎಂದ...

ಮಂಡ್ಯದಿಂದ ಸುಮಲತಾ ದೂರ; ಹೆಚ್‌ಡಿಕೆ ಪರ ಪ್ರಚಾರಕ್ಕಿಳಿಯದ ಸಂಸದೆ –ಬರ್ತಾರೆ ನೋಡೋಣ ಎಂದ...

April 20, 2024, 8:08 pm

OCBC Bank Singapore Offers Up to 2.8% p.a. Fixed Deposit Promotion from 21...

April 20, 2024, 12:38 pm

National Poetry Month 2024: Maxine Starr

National Poetry Month 2024: Maxine Starr

April 19, 2024, 9:56 am

Vegan Chicken Pot Pie

Vegan Chicken Pot Pie

April 19, 2024, 9:18 am

Firefox UX: On Purpose: Collectively Defining Our Team’s Mission Statement

Firefox UX: On Purpose: Collectively Defining Our Team’s Mission Statement

April 19, 2024, 7:03 am

© 2024 //www.rssing.com