Eliot Jones
23c7e44fc8
handle stream length being an object stream value
2019-12-24 15:22:47 +00:00
Eliot Jones
3084a9aab6
support streams containing only carriage returns. handle comments in arrays and dictionaries
...
* while the pdf specification says stream data should follow a newline following a stream operator some files have only a carriage return following the stream operator.
* since comment tokens may appear inside an array or dictionary we ignore them if they occur here since they will break interpretation of the dictionary or array contents.
2019-12-20 14:04:58 +00:00
Eliot Jones
68bcaf3901
#55 move support for images to page and add inline images
...
support both xobject and inline images. adds unsupported filters so that exceptions are only thrown when accessing lazily evaluated image.bytes property rather than when opening the page.
treat all warnings as errors.
2019-10-08 14:04:36 +01:00
Eliot Jones
bbe5409f94
#62 use length value of stream directly to read the full stream once
2019-08-20 21:08:06 +01:00
Eliot Jones
caf1a0c233
use invariant culture for parsing all numbers #37
2019-06-18 19:12:51 +01:00
Eliot Jones
98424b32aa
special case handling for faulty offsets in xref with missing whitespace between eof and object number
2019-06-14 20:40:24 +01:00
Eliot Jones
2b486dccab
prevent infinite loops where a stream token's length entry references itself. perform brute force scans in case of a faulty xref table #33
2019-06-08 16:45:02 +01:00
Eliot Jones
03af28ed6d
fix bug with compact font format font matrix reading and where endstream token is missed if immediately following 'e'
2019-05-10 20:02:29 +01:00
Eliot Jones
bad57763a1
finish initial support for rc4 encryption with blank user password
2019-05-06 15:41:29 +01:00
Eliot Jones
be394f5bba
start adding support for reading encrypted documents
2019-05-04 15:36:13 +01:00
Eliot Jones
2fa781b8e9
#10 make all token classes public and expose via a public structure member on pdf document
2018-11-24 19:02:06 +00:00
Eliot Jones
7d90f4858a
continue migrating code to tokenizer
2018-01-20 18:42:29 +00:00
Eliot Jones
3d2a66cbf9
fix bug with endstream appearing without line break
2018-01-20 11:53:24 +00:00
Eliot Jones
615ee88a46
start passing the pdf scanner in to read the type 1 files
2018-01-14 15:33:22 +00:00
Eliot Jones
b19b96604d
make the pdf object scanner work with streams
2018-01-14 10:53:01 +00:00
Eliot Jones
8dcea9b37f
create a pdf object scanner which sits on top of the core token scanner to provide complete object parsing
2018-01-13 22:30:15 +00:00