Eliot Jones
e01d77b93a
add negative test case and make tests non-lenient
2019-12-05 13:56:12 +00:00
Eliot Jones
2e5c995322
make external nodes different to document nodes and finish reimplementation
2019-12-05 13:21:19 +00:00
Eliot Jones
ecf0b8743b
make bookmarknode immutable and use scanner when retrieving bookmarks
2019-12-05 12:03:30 +00:00
Eliot Jones
a967e0898a
handle missing width and height correctly for compact font format fonts #75
2019-12-04 14:19:06 +00:00
Eliot Jones
80f024dbed
make form access public
2019-11-27 16:36:25 +00:00
Eliot Jones
ed53773c7b
handle checked state of radio buttons and checkboxes
2019-11-27 15:34:28 +00:00
Eliot Jones
910e22a4e9
wrap checkboxes and radiobuttons in their own form field types with access to the child collections
2019-11-26 16:33:24 +00:00
Eliot Jones
a2147902a0
merge pull request #72 from uglytoad/fix-export-formatting
...
fix export formatting
2019-10-17 11:28:06 +01:00
Eliot Jones
57dfee3211
move alto xml exporter to root export namespace
2019-10-17 10:46:43 +01:00
Eliot Jones
3f1321141a
#73 process xobject form content when extracting text and images
2019-10-16 14:59:16 +01:00
Eliot Jones
f14c52a05a
fix tests for renaming and validating generate alto xml
2019-10-15 13:59:09 +01:00
Eliot Jones
d68bd88824
format and tidy up alto export autogenerated code. tidy up docstrum
2019-10-14 18:30:18 +01:00
Eliot Jones
dec4c31a33
fix bug where cross reference stream subsections were skipped
...
a single cross-reference stream may contain multiple disjoint runs of object numbers, previously we only took the first now we load all objects.
adds indexer to array token for ease-of-use.
adds page number and bounds information to all form fields.
2019-10-10 16:05:21 +01:00
Eliot Jones
2ef45f71d5
make missing acroform types public and start improving data
...
also changes pages to use a proper tree structure since this will be required for resource inheritance and for acroform widget dictionaries.
2019-10-09 14:28:37 +01:00
Eliot Jones
68bcaf3901
#55 move support for images to page and add inline images
...
support both xobject and inline images. adds unsupported filters so that exceptions are only thrown when accessing lazily evaluated image.bytes property rather than when opening the page.
treat all warnings as errors.
2019-10-08 14:04:36 +01:00
Eliot Jones
0cd7795bff
add method to get all pages from document
2019-08-23 19:09:33 +01:00
Eliot Jones
f5e025aa70
merge pull request #58 from uglytoad/colors
...
adds colors to letters and prepares code to add colors to paths.
2019-08-13 20:50:06 +01:00
Eliot Jones
f55091f3d2
make color types public and add stream based tests to prevent future breaking as observed in #52
2019-08-13 20:48:22 +01:00
Eliot Jones
0349bedd3e
#57 add access to document metadata and expose wrapper type
2019-08-11 12:42:30 +01:00
Eliot Jones
4dde4ca0c1
add colors to letters based on current font and graphics state
2019-08-05 19:26:10 +01:00
Eliot Jones
50bf1784bd
merge pull request #43 from Numpsy/test_document_information
...
Add a unit test for reading document information
2019-07-07 17:01:42 +01:00
Eliot Jones
557d8bc948
map missing character codes directly #44
...
previously if no matching unicode was found for a character code we would return a null letter. instead we now map from the character code directly to a character. this seems to work for most documents, except where there are ligatures, e.g. fi or ff, but is still better than not returning anything.
2019-07-07 13:53:25 +01:00
Richard Webb
10dcbd0cc4
Add a unit test for reading document information
2019-07-06 22:18:18 +01:00
Eliot Jones
0dfe742770
continue searching for xref tokens even if an %%eof is encountered #38
2019-07-06 14:26:38 +01:00
Eliot Jones
caf1a0c233
use invariant culture for parsing all numbers #37
2019-06-18 19:12:51 +01:00
Eliot Jones
98424b32aa
special case handling for faulty offsets in xref with missing whitespace between eof and object number
2019-06-14 20:40:24 +01:00
Eliot Jones
4c716fcbd6
finish support for revision 5 encryption using aes 256 #34
2019-06-13 19:46:08 +01:00
Eliot Jones
f3c8220ec4
add test coverage for invalid document from #33
2019-06-08 16:58:20 +01:00
Eliot Jones
90e9c46373
correctly determine compact font format encoding where supplements are used and pass the font encoding for type 1 fonts to the encoding used to read the pdf
2019-05-11 11:56:01 +01:00
Eliot Jones
03af28ed6d
fix bug with compact font format font matrix reading and where endstream token is missed if immediately following 'e'
2019-05-10 20:02:29 +01:00
Eliot Jones
bad57763a1
finish initial support for rc4 encryption with blank user password
2019-05-06 15:41:29 +01:00
Eliot Jones
be394f5bba
start adding support for reading encrypted documents
2019-05-04 15:36:13 +01:00
Eliot Jones
a4305cc86c
test letter positions with truetype standard14 mutant. fix truetype table overflow. read compact font format encodings
2019-01-12 18:07:53 +00:00
Eliot Jones
d86c79da45
improve caching for system font finder and fix issue with composite glyphs in truetype fonts
2019-01-09 20:42:49 +00:00
Eliot Jones
b6a528a97d
#9 add support for cff cid fonts, currently bounding box and width are calculated incorrectly.
2019-01-07 20:52:42 +00:00
Eliot Jones
f1621b3924
#24 make some fields public
2019-01-05 15:13:32 +00:00
Eliot Jones
20e843f5ae
#24 start adding classes for the acroform api
2019-01-01 17:44:46 +00:00
Eliot Jones
4d5518a599
move annotations to experimental access, support changing color state for document creation and update readme
2018-12-30 14:12:04 +00:00
Eliot Jones
55a0e6b646
move large strings from code files to avoid wrong language detected on github
2018-12-29 11:55:19 +00:00
Eliot Jones
79c555e967
#9 add some more code for reading glyph ids from cid fonts in compact font format
2018-12-29 11:41:46 +00:00
Eliot Jones
47e49c4044
#9 fix bug with truetype fonts and start adding support for cid fonts using compact font format
2018-12-28 22:34:47 +00:00
Eliot Jones
d572af8a52
finish first pass of annotation api
2018-12-22 15:54:32 +00:00
Eliot Jones
a5349dd77a
start adding retrieval of annotations
2018-12-20 18:18:32 +00:00
Eliot Jones
3a4b7b79d1
#21 change dictionarytoken to use explicit key type, finish os/2 table for truetype, first file creation using embedded truetype font
2018-12-08 14:38:27 +00:00
Eliot Jones
a5ce43774b
revert change to public api of letter. update readme
2018-11-26 20:18:00 +00:00
Eliot Jones
997979cc92
#11 early access to the raw xobjects for images.
...
temporary 'safe' untested implementation of seac for type 1 charstrings.
make structure public
bump version of package and project to 0.0.3 (it had accidentally increased to 0.0.5)
2018-11-26 19:46:41 +00:00
Eliot Jones
48fa4a4f15
add position test for truetype using system fonts. add images for documentation
2018-11-25 15:00:52 +00:00
Eliot Jones
243f3dc099
#13 handle special case cff file and reduce duplication in integration tests
2018-11-25 12:36:38 +00:00
Eliot Jones
87fccbbadc
#15 add word boxes to the visual verification tests
2018-11-25 10:17:01 +00:00
Eliot Jones
fdd48b25d8
#15 change default word extraction for latex test
2018-11-25 10:10:28 +00:00