Eliot Jones
4042649b46
update the readme for 0.0.10 release
2019-12-09 13:37:22 +00:00
Eliot Jones
6ee7c09788
merge pull request #93 from BobLd/master
...
improving clustering algorithm
2019-12-09 13:10:17 +00:00
Eliot Jones
d37149a8d7
support custom encodings for type 1 standard 14 fonts without metrics #95
2019-12-09 13:02:52 +00:00
Eliot Jones
f2ead37134
handle missing whitespaces before the start of the object #88
2019-12-09 12:24:20 +00:00
BobLd
b69c004548
Changing functions description to reflect changes
2019-12-07 22:49:10 +00:00
Eliot Jones
75a6260501
make cropbox public
2019-12-06 17:34:51 +00:00
BobLd
fd9efb5b5d
making FindIndexNearest() internal
2019-12-06 17:29:33 +00:00
Eliot Jones
e38da0a403
add support for alternative colorspace in separation colorspaces #89
2019-12-06 17:23:15 +00:00
BobLd
b5a0faaa3b
Improving clustering algorithm
2019-12-06 16:02:30 +00:00
BobLd
e0138c7ae1
Merge branch 'master' of https://github.com/UglyToad/PdfPig
2019-12-06 15:39:03 +00:00
Eliot Jones
c57d8446e4
merge pull request #92 from uglytoad/outlines-refactoring
...
outlines refactoring
2019-12-06 13:46:37 +00:00
Eliot Jones
2ff6a4bf11
handle rotations of 360 degrees or more #90
2019-12-06 13:36:45 +00:00
BobLd
439bd9bb53
Merge branch 'master' of https://github.com/UglyToad/PdfPig
2019-12-06 11:41:33 +00:00
Eliot Jones
a9b61d81fa
skip explicit destinations where the page cannot be found
2019-12-05 16:40:21 +00:00
Eliot Jones
d6e1dccc01
add support for standardencoding in type 1 fonts #78
2019-12-05 16:32:10 +00:00
Eliot Jones
e01d77b93a
add negative test case and make tests non-lenient
2019-12-05 13:56:12 +00:00
Eliot Jones
8ca947542f
skip unrelated entries in document name tree
2019-12-05 13:47:42 +00:00
Eliot Jones
2e5c995322
make external nodes different to document nodes and finish reimplementation
2019-12-05 13:21:19 +00:00
Eliot Jones
2ea71ce3bb
fix off-by-one error in format 4 cmap subtable for truetype #91
2019-12-05 12:21:58 +00:00
Eliot Jones
ecf0b8743b
make bookmarknode immutable and use scanner when retrieving bookmarks
2019-12-05 12:03:30 +00:00
Eliot Jones
928347bcce
merge pull request #84 from BobLd/master
...
add basic bookmarks extraction capabilities.
2019-12-04 14:24:10 +00:00
Eliot Jones
a967e0898a
handle missing width and height correctly for compact font format fonts #75
2019-12-04 14:19:06 +00:00
Eliot Jones
8a51795e99
update codecov version for azure pipeline
2019-11-27 16:45:05 +00:00
Eliot Jones
80f024dbed
make form access public
2019-11-27 16:36:25 +00:00
Eliot Jones
df3cb43cfc
update coverage libraries
2019-11-27 16:16:11 +00:00
Eliot Jones
ed53773c7b
handle checked state of radio buttons and checkboxes
2019-11-27 15:34:28 +00:00
Eliot Jones
910e22a4e9
wrap checkboxes and radiobuttons in their own form field types with access to the child collections
2019-11-26 16:33:24 +00:00
BobLd
9da0623fab
Merge branch 'master' of https://github.com/UglyToad/PdfPig
2019-11-26 12:16:43 +00:00
Eliot Jones
677d2b5e8f
#82 make resource store state local to the page and operation being processed
...
resources such as fonts are linked to page content operations using name labels, e.g. "/F1", these resource labels can be reassigned on different pages or inside form xobjects. we now clear the entire resource state for each page which is parsed and after form xobject operations which use resource dictionaries.
2019-11-25 14:34:02 +00:00
Eliot Jones
9028f932b2
#83 decrypt dictionary hex values
2019-11-25 12:42:32 +00:00
BobLd
89daa2818e
update PublicApiScannerTests
2019-11-04 15:17:25 +00:00
BobLd
a8559c1167
Add basic bookmarks extraction capabilities.
2019-11-04 15:11:54 +00:00
Eliot Jones
ef6d509f44
Merge pull request #80 from BobLd/master
...
Enhancing NearestNeighbourWordExtractor
2019-11-04 09:56:21 +00:00
BobLd
99f260befb
Enhancing NearestNeighbourWordExtractor
...
- Making the code easier to read
- Using 20% of Width instead of 60%
- Making DefaultWordExtractor public
2019-10-21 20:51:27 +01:00
Eliot Jones
0e39c88008
Merge pull request #77 from BobLd/master
...
AltoDocument: make all `xxxSpecified` setters public to allow `Deserialize`
2019-10-20 12:58:01 +01:00
BobLd
0b2a0f4bc7
AltoDocument: make all xxxSpecified
setter public to allow Deserialize
.
2019-10-20 12:25:34 +01:00
Eliot Jones
80fc404b10
#47 improve performance by caching truetype bounding boxes
...
also uses less reflection when parsing the page content stream
2019-10-18 15:56:28 +01:00
Eliot Jones
84990722ca
#76 add infinite loop protection for brute force search
...
also treats 'm' or 'j' in endstream/endobj as a valid object number start character
2019-10-17 16:50:01 +01:00
Eliot Jones
efe7896824
#75 support vertical writing mode fonts
2019-10-17 15:57:04 +01:00
Eliot Jones
a2147902a0
merge pull request #72 from uglytoad/fix-export-formatting
...
fix export formatting
2019-10-17 11:28:06 +01:00
Eliot Jones
09b26c43e0
#74 add intersectswith method to rectangle
2019-10-17 11:21:49 +01:00
Eliot Jones
57dfee3211
move alto xml exporter to root export namespace
2019-10-17 10:46:43 +01:00
Eliot Jones
3f1321141a
#73 process xobject form content when extracting text and images
2019-10-16 14:59:16 +01:00
Eliot Jones
6174877892
#71 ignore malformed dates in true type header table. fix reading of dates from bytes
2019-10-16 10:51:02 +01:00
Eliot Jones
f14c52a05a
fix tests for renaming and validating generate alto xml
2019-10-15 13:59:09 +01:00
Eliot Jones
d68bd88824
format and tidy up alto export autogenerated code. tidy up docstrum
2019-10-14 18:30:18 +01:00
Eliot Jones
e2c9db8d50
merge pull request #69 from BobLd/master
...
Support for hORC, Atlo Xml and Page Xml output formats
2019-10-14 10:10:47 +01:00
BobLd
e76badaeaf
Update PublicApiScannerTests with new public classes
2019-10-11 08:57:16 +01:00
BobLd
e9b3db7102
Make ITextExporter implementations public
2019-10-11 08:55:03 +01:00
BobLd
f886411e12
Merge https://github.com/UglyToad/PdfPig
2019-10-10 16:52:45 +01:00