PdfPig

lsm/PdfPig

mirror of https://github.com/UglyToad/PdfPig.git synced 2026-03-10 00:23:29 +08:00

Author	SHA1	Message	Date
BobLd	d1e8b42877	Check and handle circular references when processing XObject forms and fix #671	2023-08-05 20:05:47 +01:00
BobLd	94cc9be967	Fix regression introduced in #561 where paths were not clipped correctly for rotated pages due to initial clipping path not being transformed	2023-06-20 23:47:52 +01:00
Eliot Jones	9d2b3f914d	account for skipmissingfonts in positioned text #637	2023-06-04 11:47:30 +01:00
Eliot Jones	fba1cbc13c	skip missing objects if skip fonts is true #298 if skip missing fonts is set we want to read the file as much as possible so we will also skip any missing xobjects like images, forms or postscript code	2023-05-27 10:46:29 +01:00
BobLd	a4284aa5a8	Implement Pattern color space and Shading, seal IColor classes, stop using decimal in colors and use double instead	2023-05-18 20:24:55 +01:00
BobLd	8c08aa2efe	Implement Lab and DeviceN color spaces and fix bug in SetNonStrokingColorspace() for Transparency Group XObjects	2023-04-16 19:34:38 +01:00
BobLd	9eb791ae58	Fix integration tests for #595	2023-04-14 13:16:54 +01:00
BobLD	b8a98fbed2	Properly implement color spaces	2023-04-12 07:25:09 +01:00
BobLD	fc3f27fd18	Update CurrentGraphicsState with AlphaConstantNonStroking and AlphaConstantStroking and implement more named graphics state	2023-03-19 13:23:06 +00:00
mvantzet	9d15930401	Cleaned up usings, log warning when using user space unit other than 1, removed comment	2023-03-15 22:31:31 +01:00
mvantzet	ea77156eb8	Changes for annotation positions: - Pass in the initial matrix to the annotation provider, so that it can return the correct rectangles / quad points. - Made a change / extensions to the Annotation class: - ModifiedDate is now a DateTimeOffset instead of unparsed string. If the string is invalid, ModifiedDate is set to the default value. - Added lookup for the "appearance streams"; all the annotations should have a "N" (normal) appearance, and optionally have a "R" (roll-over/hover) and "D" (down/click) appearance. Did not expose the actual stream objects, but added a flag indicating the existence of "R" / "D". At some point we can consider doing something with the appearances. - Changed signature of GetInitialMatrix / ContentStreamProcessor constructor from PdfRectangle back to what it was earlier, namely MediaBox and CropBox, to prevent accidentally mixing the two up in the caller.	2023-03-13 18:15:24 +01:00
mvantzet	0413f3f1bf	Fix related to page sizes / rotation / coordinate transformations (issue 560): The initial transformation matrix was incorrect, as it translated by the cropbox width/height instead of by the cropbox left/bottom offsets. Also, it did not translate the results back into the 1st quadrant so that (0,0) would (again) be the lower left corner origin for the cropped area. Added unit tests in new file ContentStreamProcessorTests. EFFECTIVE CHANGES: - The coordinates used for letters etc. are different now for rotated and/or cropped pages, but as those were not very consistent anyway this is probably OK. - The Page Size (A4, A3, Custom, etc.), Width and Height are now determined by the CropBox, not by the MediaBox; the CropBox ultimately determines what you see on screen and is printable. If no cropbox is defined in the PDF, it is set to the MediaBox; so in that case it is backwards compatible with the old code. - The Page MediaBox and CropBox properties are no longer rotated according to Page.Rotation. The Page Width and Height do take rotation into account (kept it backward compatible).	2023-03-09 16:42:09 +01:00
mvantzet	06253966e4	Added Letter properties RenderingMode, StrokeColor, FillColor and added those as mandatory constructor arguments. Kept property Color, which contains either StrokeColor (if rendering mode is Stroke) or FillColor (for all other rendering modes). In PdfPageBuilder opted for default text rendering mode "Fill" which seems like a sensible default.	2023-01-13 12:35:25 +01:00
Eliot Jones	c8874c5984	#483 make skip missing fonts even more resilient to nonsense files	2022-12-11 16:18:09 -05:00
Eliot Jones	e2246a88bb	#482 add skip missing fonts option and pass parsing options to content stream processor this doesn't fix the reported issue since the pdf itself is corrupted on page 8 however it will allow recovery in some scenarios where text content isn't important. also adds more informative error when stream unintentionally passed with non zero offset	2022-10-09 13:44:05 -04:00
Eliot Jones	eb0758f050	only combine when it forms part of the same byte sequence	2022-04-14 20:22:49 -04:00
Eliot Jones	b5b15ee593	add handling for combining diacritics	2022-04-14 20:14:09 -04:00
Eliot Jones	9ae0a5ec15	allow stream filters to contain indirect references to name tokens	2021-04-25 16:22:22 -04:00
BobLd	eb85f67b18	Remove CurrentGraphicsState GetCurrentState() from IOperationContext.	2020-11-23 11:11:07 +00:00
BobLd	d07baa97d5	Remove reference from CurrentSubpath and CurrentPath in IOperationContext and add MoveTo(), BezierCurveTo(), LineTo() and Rectangle().	2020-11-23 10:49:50 +00:00
BobLd	cd9ac6ac6c	- fix letter's PointSize computation by applying the transform to a rectangle of height fontSize - add test with rotated letters	2020-10-12 12:59:02 +01:00
BobLd	8f0f7769a6	fix clipping error when trying to fill a single line; add log; set EvenOdd as default in initiate CurrentClippingPath; add tests	2020-09-22 10:47:34 +01:00
BobLd	33f92cd11c	handle page rotation by updating initial TransformationMatrix	2020-06-02 16:12:24 +01:00
BobLd	6e773446df	simplify double cast	2020-06-01 14:55:45 +01:00
BobLd	2d9a4e5adb	fix CurrentTransformationMatrix multiplication order in ProcessFormXObject	2020-06-01 14:00:17 +01:00
Eliot Jones	09b951f667	expose font details on individual letters also fixes a regression for image extraction	2020-04-25 17:15:26 +01:00
Eliot Jones	f18bc0766a	#161 handle zero point size by using rotated matrix	2020-04-19 10:28:11 +01:00
Eliot Jones	25314cc79d	#161 change rotation to fix values and page size this doesn't account for images and pdf paths yet.	2020-04-18 18:04:41 +01:00
Eliot Jones	db442194c3	use a mutable struct	2020-04-18 12:10:17 +01:00
Eliot Jones	e382e581ba	add merge test for document with object stream	2020-04-16 20:57:57 +01:00
BobLd	ec2dcdc9f4	Check if CurrentSubpath is null in CloseSubpath()	2020-04-05 17:58:57 +01:00
BobLd	b923a42f9e	Check if CurrentSubpath null before CloseSubpath	2020-04-05 17:58:57 +01:00
BobLd	20c4b9594b	Rename PdfSubpath.ClosePath() to PdfSubpath.CloseSubpath() to avoid confusion	2020-04-05 17:58:57 +01:00
BobLd	04300eb12c	Add PdfSubpath comment	2020-04-05 17:58:57 +01:00
BobLd	064fa4922a	make Clipping internal do not throw errors when CurrentPath is null modify tests to match	2020-04-05 17:58:57 +01:00
BobLd	51165dc11a	Implement EndPath Make path clipping optional	2020-04-05 17:58:57 +01:00
BobLd	983cfcb2f6	Simplify path construction operators fix 're' operator to reflect documentation Update ContentStreamProcessor with fill, stroke and clip operations Throw errors when currentPosition is null in PdfSubpath	2020-04-05 17:58:57 +01:00
BobLd	3ee9ac7915	Implement FillStrokePath() operator and filling rule.	2020-04-05 17:58:57 +01:00
BobLd	43b40da5d5	Change Subpath to path where necessary	2020-04-05 17:58:57 +01:00
BobLd	6677641b37	Create PdfPath Rename ClippingRule to FillingRule Move FillingRule from Subpath to Path	2020-04-05 17:58:57 +01:00
BobLd	ab6a0f11fc	Change name from PdfPath to PdfSubpath	2020-04-05 17:58:57 +01:00
Eliot Jones	48d166276d	remove islenientparsing from contentstreamprocessor	2020-02-28 11:44:13 +00:00
Eliot Jones	7b09999a3f	remove islenientparsing from the font handlers we're removing islenientparsing to make the code simpler to maintain and use as well as more resilient.	2020-02-28 11:37:18 +00:00
BobLd	0afaa19d15	Handle null CurrentPath	2020-02-24 11:20:56 +00:00
BobLd	1d095af974	Implement Modify Clipping operations	2020-02-24 11:20:56 +00:00
BobLd	588648d30b	Fix #133 Marked content extraction issue	2020-02-10 11:23:19 +00:00
Eliot Jones	29061b1fd2	handle unexpected adobe type 1 format an encoding array in an adobe type 1 font may be missing its declaration ending in 'for', if we encounter 'dup' while looking for the 'for' token we have a special case to go straight into reading the encoding. also handles a case where the page content stream contains a path-closing operator without any path being active.	2020-01-28 16:05:53 +00:00
Eliot Jones	ba09a13d08	more end image recovery logic since inline image data may contain the end image "ei" token inside the data stream there's no reliable way to actually determine if we've read all the data. for this reason if we end up with an invalid state parsing operations after we've read the end image token we try to recover by reading from the previous token to the next end image token if any. we supply log information to let the consumer know this is what we're doing. it's still not bullet-proof but it should be good enough. also support negative page rotation values by adding them to a 360 degree rotation so -90 degrees clockwise is 270 degrees clockwise.	2020-01-25 15:53:08 +00:00
Eliot Jones	b4d917dcdc	merge pull request #122 from uglytoad/marked-content marked content	2020-01-10 17:07:21 +00:00
Eliot Jones	41cc7abd1b	prevent negative point size for fonts	2020-01-10 14:40:28 +00:00

1 2

91 Commits