Skip to content
This repository has been archived by the owner on May 30, 2023. It is now read-only.

Running the pdf example produces a pdf with no selectable text #10373

Closed
ghost opened this issue Feb 7, 2012 · 51 comments
Closed

Running the pdf example produces a pdf with no selectable text #10373

ghost opened this issue Feb 7, 2012 · 51 comments

Comments

@ghost
Copy link

ghost commented Feb 7, 2012

[email protected] commented:

Which version of PhantomJS are you using? Tip: run 'phantomjs --version'.
1.4.1

What steps will reproduce the problem?

  1. phantomjs rasterize.js 'http://en.wikipedia.org/w/index.php?title=Jakarta&printable=yes' jakarta.pdf
  2. Look at result and try to select text
  3. Impossible to select text

What is the expected output? What do you see instead?
output is a pdf. i expect to see pdf and ability to select the text.

Which operating system are you using?
osx

Did you use binary PhantomJS or did you compile it from source?
binary

Please provide any additional information below.
Are you supposed to be able to select the text for the pdf? If not then what is the purpose? Thanks!

Disclaimer:
This issue was migrated on 2013-03-15 from the project's former issue tracker on Google Code, Issue #373.
🌟   14 people had starred this issue at the time of migration.

@thomasbachem
Copy link

[email protected] commented:

Same on Debian with the 1.5.0 binary. Tried it with several websites. That's a pretty big issue :/

@thomasbachem
Copy link

[email protected] commented:

This issue seems very similar to http://code.google.com/p/wkhtmltopdf/issues/detail?id=886, where rendering with the newest static build of wkhtmltopdf leads to unselectable text, but using an older binary works well. May be that helps, since wkhtmltopdf is also relying on QtWebKit.

@veeneck
Copy link

veeneck commented Aug 16, 2012

[email protected] commented:

Using 1.6 now, this issue is still here, and doesn't appear to be one of the milestones for 1.7. Wanted to check in and see if anyone found a workaround?

@trainiac
Copy link

[email protected] commented:

It turns out that phantomjs 1.6 running on Linux does output PDFs with selectable text. I can confirm that phantomjs 1.6 on Mac OS X 10.7.4 does not output pdfs with selectable text.

@wachunga
Copy link

[email protected] commented:

Also an issue on phantomjs 1.7.0 on OSX 10.6.8 using the same jakarta example.

@andineck
Copy link

[email protected] commented:

I have got the same problem on phantomjs 1.8.1 on OSX 10.8.2

@andineck
Copy link

[email protected] commented:

I can confirm that text is selectable inside the PDF with phantomjs 1.8.1 on Ubuntu 12.04. It looks like it really is a Mac OSX problem.

@ragekit
Copy link

ragekit commented Apr 29, 2013

using an old version of wkhtmltopdf seems to work (like this one) http://wkhtmltopdf.googlecode.com/files/wkhtmltopdf-OSX-0.10.0_rc2-static.tar.bz2

@ragulka
Copy link

ragulka commented Sep 23, 2013

I can also confirm that running phantomjs 1.8.2 on OSX 10.8.5 results in text not being selectable and links not clickable.

@bruno-c
Copy link

bruno-c commented Oct 4, 2013

OSX 10.7.5 here, same issue.

@ianpetzer
Copy link

There is a pull request for a fix here: #11509

@TooBug
Copy link

TooBug commented Aug 25, 2014

any progress on this?

@marcbachmann
Copy link

I guess this will be fixed in version phantomjs v2.
It's using QT5 which shouldn't have the problem.

@nickcmaynard
Copy link

Backing up @andi-neck - definitely an OSX issue. Identical NPM/phantomjs versions show this working on Linux (Ubuntu 14.04), but not on OSX (Yosemite).

@aidos
Copy link

aidos commented Dec 2, 2014

I'm running 2.0.0 (development) on linux and I'm seeing the issue. While trying to render an svg containing embedded fonts all the text is rasterized (it looks correct, but it isn't text).

@AnthonyClink
Copy link

I experience the same issue same code runs great on windows 8, except, the viewport seems to be better respected on my windows box and the text just is not selectable.

@josebalius
Copy link

What is the resolution for this? Is it confirmed that it works on Linux but not Mac?

@aidos
Copy link

aidos commented Jan 5, 2015

It's not working on Linux for me (2.0.0).

I discovered that as long as you don't embed the fonts in the SVG they render as text. I guess you need to have them registered and available within the system but that gets into some Linux font madness that I couldn't quite figure out.

I was trying to take my embedded fonts, strip them from the SVG, register them temporarily as system fonts, render the PDF, then deregister the fonts again. I think there's a work around there - hopefully someone who knows a little more about font management in Linux knows how to make this pattern work.

@coud
Copy link

coud commented Jan 31, 2015

I confirm this bug too using phantomjs 2 in Ubuntu 14.04

@jordanandree
Copy link

confirmed that this issue is still present with 2.0 on Mac OS 10.10.2. Rendering a PDF results in text being rasterized and not selectable.

@anantshri
Copy link

Hey Folks, is there any version of phantomjs where text seems to be selectable on MacOSX.

I have 1.9.0 on linux box and the resultant pdf has non selectable text.
on MacOSX i have tried with version 2.0.0, 1.9.0, 1.9.8, 1.9.6

This is turning out to be a deal breaker for me.

astefanutti added a commit to astefanutti/phantomjs that referenced this issue Aug 6, 2015
…n QCoreTextFontEngine

Fix bugs generating PDF on Cocoa

Provide real implementations of:
	properties(), faceId() and getUnscaledGlyph

Task-number: QTBUG-10094

ariya#10373
astefanutti added a commit to astefanutti/phantomjs that referenced this issue Aug 6, 2015
…n QCoreTextFontEngine

Fix bugs generating PDF on Cocoa

Provide real implementations of:
	properties(), faceId() and getUnscaledGlyph

Task-number: QTBUG-10094

ariya#10373
@abecks
Copy link

abecks commented Aug 26, 2015

I can confirm this is happening when building PhantomJS 2.0.0 from source on CentOS 6

@XadillaX
Copy link

+1 same issue

@ghost
Copy link

ghost commented Sep 29, 2015

+1 with PhantomJS 2.0.0 on Mac. Any workarounds?

@ali-habibzadeh
Copy link

+1 Same issue

@astefanutti
Copy link
Contributor

Given the externalisation of Qt Base as a submodule, PR #13243 is superseded by vitallium/qtbase#2.

@anantshri
Copy link

@astefanutti do you have instructions on how this pull request could be tested. I am interested in doing the test just need proper instructions.

@astefanutti
Copy link
Contributor

@anantshri just rebuild PhantomJS with vitallium/qtbase#2 on Mac OS and run the usual rasterize.js example with output PDF.

@ghost
Copy link

ghost commented Dec 3, 2015

@astefanutti does that also include hyperlink support (#10196)? Given the warning in the compilation instructions (http://phantomjs.org/build.html), any chance you would mind posting a compiled binary for Mac OS? Thanks a ton for your fixes!

@astefanutti
Copy link
Contributor

@travis5555 you can get a Mac OS binary from here: https://github.com/astefanutti/decktape#install. It contains hyperlink support as well.

@ghost
Copy link

ghost commented Dec 3, 2015

thanks for the quick reply @astefanutti! I hadn't seen decktape before - it looks pretty awesome. since it has all of the awesomeness of embedded text, hyperlink, support, etc, is there any reason it can't be used for generic HTML > PDF conversion as opposed to slide decks specifically?

@astefanutti
Copy link
Contributor

@travis5555 you're right. DeckTape depends on the improved version of PhantomJS that I maintain here https://github.com/astefanutti/phantomjs and that can be used for general purpose HTML to PDF conversion. Ideally, all the improvements will be integrated in PhantomJS upstream so that DeckTape value remains for slide decks specifically.

@kilianc
Copy link

kilianc commented Dec 13, 2015

@astefanutti how do I npm i your fork ?

@astefanutti
Copy link
Contributor

Some progress related to that issue can be found at #13997 (mostly Linux).

@Tin-Nguyen
Copy link

Tin-Nguyen commented Apr 20, 2016

+1 same issue. Any workaround to fix this?

@robinfhu
Copy link

robinfhu commented May 9, 2017

Wanted to post my solution to this problem. It turns out that loading a web font from a remote URL will cause PhantomJS to rasterize the font in the PDF. This creates a PDF where the text cannot be highlighted, since it is an image. This causes the PDF file size to grow 10 times.

We were using Proxima Nova, and our CSS file looked like this:

@font-face
    font-family ProximaNovaReg
    font-style normal
    font-weight 100
    src url("/assets/fonts/ProximaNova-ThinWeb.woff") format("woff")

body
   font-family ProximaNovaReg

To fix the issue, we installed the Proxima Nova TTF files directly onto our Ubuntu box. This means copying the TTF files to /usr/share/fonts/truetype, and running fc-cache -fv.

Now we can change our CSS to just the following:

body
   font-family "Proxima Nova"

PhantomJS now treats Proxima Nova as a natively installed font, and renders a smaller sized PDF with selectable text. This is the right solution.

Note: I only encountered this problem on Linux. Mac OS worked fine.

@jpbochi
Copy link

jpbochi commented May 10, 2017

@robinfhu I was going to try your solution and realised that upgrading my dependencies was enough. I think there was a bug, and it got fixed at some point. Regardless of that, the case you found is a very good one to know.

FWIW, I'm using phantomjs indirectly via https://www.npmjs.com/package/markdown-pdf.

@SWGFL
Copy link

SWGFL commented May 24, 2017

1.9.8 works for me on Centos7. 2.1.1 rasterizes the output

@liwonder
Copy link

liwonder commented Aug 31, 2017

@robinfhu your solution works for me.
environment:
phantomjs: 2.1.1
os: Ubuntu 16.04 LTS

I installed the font used in my pdf to the machine. Now the pdf output is as normal as the output in my Mac.

But the thing is I need to run some pdf render test in the CI with Ubuntu OS. I don't want to install font before run the test, and that is not a good solution.

Maybe with some fix the linux version of Phantomjs can fix this issue. I am still watching this, if there are any updates.

@ariya
Copy link
Owner

ariya commented Dec 31, 2019

Due to our very limited maintenance capacity (see #14541 for more details), we need to prioritize our development focus on other tasks. Therefore, this issue will be closed. In the future, if we see the need to attend to this issue again, then it will be reopened.
Thank you for your contribution!

@ariya ariya closed this as completed Dec 31, 2019
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests