pandoc - MSM's mirror of Pandoc

Age	Commit message (Collapse)	Author
2023-09-06	Org reader: factor out orgAnchor -> Org.Parsing.	John MacFarlane
	A purely internal change. We will use this both in inline and block parsing.
2023-08-26	Org reader: allow escaping commas in macro arguments	Amneesh Singh
	Signed-off-by: Amneesh Singh <natto@weirdnatto.in>
2023-03-22	Org reader: Allow zero width space as an escape character	Christian Christiansen
	Allow the character U+200B to be used as an escape character as described in the Org-mode documentation https://orgmode.org/manual/Escape-Character.html Closes issue #8716.
2023-01-10	Update copyright years, it's 2023!	Albert Krewinkel

2022-10-29	T.P.Parsing.General: change `characterReference`, `charsInBalanced`.	John MacFarlane
	`characterReference` now returns a Text (as it should, because some named references don't correspond to a single Char), and uses the `lookupEntity` function from commonmark-hs instead of the slow one from tagsoup. `charsInBalanced` now takes a Text parser rather than a Char parser as argument. [API change]
2022-10-10	Org reader: make #+pandoc-emphasis-pre work as expected. (#8360)	Amir Dekel
	So far, `orgStateLastPreCharPos` wasn't updated appropriately after each parsing to native Str (by the parser `str`). In addition to solving this, the guard `notAfterString` in `emphasisStart` is removed to allow emphasis after Str at the first place.
2022-09-19	Org reader: Allow org-ref v2 citations with `&` prefix.	John MacFarlane
	Closes #8302.
2022-01-17	Fix some haddock errors.	John MacFarlane

2022-01-02	Copyright notices: update for 2022	Albert Krewinkel

2021-12-14	Org reader: parse official org-cite citations.	John MacFarlane
	We also support the older org-ref style as a fallback. We no longer support the "markdown-style" citations. See #7329.
2021-12-14	Org reader: remove support for "Berkeley style" citations.	John MacFarlane
	See #7329.
2021-05-13	Implement curly-brace syntax for Markdown citation keys.	John MacFarlane
	The change provides a way to use citation keys that contain special characters not usable with the standard citation key syntax. Example: `@{foo_bar{x}'}` for the key `foo_bar{x}`. Closes #6026. The change requires adding a new parameter to the `citeKey` parser from Text.Pandoc.Parsing [API change]. Markdown reader: recognize @{..} syntax for citatinos. Markdown writer: use @{..} syntax for citations when needed. Update manual with curly-brace syntax for citations. Closes #6026.
2021-05-09	Change reader types, allowing better tracking of source positions.	John MacFarlane
	Previously, when multiple file arguments were provided, pandoc simply concatenated them and passed the contents to the readers, which took a Text argument. As a result, the readers had no way of knowing which file was the source of any particular bit of text. This meant that we couldn't report accurate source positions on errors or include accurate source positions as attributes in the AST. More seriously, it meant that we couldn't resolve resource paths relative to the files containing them (see e.g. #5501, #6632, #6384, #3752). Add Text.Pandoc.Sources (exported module), with a `Sources` type and a `ToSources` class. A `Sources` wraps a list of `(SourcePos, Text)` pairs. [API change] A parsec `Stream` instance is provided for `Sources`. The module also exports versions of parsec's `satisfy` and other Char parsers that track source positions accurately from a `Sources` stream (or any instance of the new `UpdateSourcePos` class). Text.Pandoc.Parsing now exports these modified Char parsers instead of the ones parsec provides. Modified parsers to use a `Sources` as stream [API change]. The readers that previously took a `Text` argument have been modified to take any instance of `ToSources`. So, they may still be used with a `Text`, but they can also be used with a `Sources` object. In Text.Pandoc.Error, modified the constructor PandocParsecError to take a `Sources` rather than a `Text` as first argument, so parse error locations can be accurately reported. T.P.Error: showPos, do not print "-" as source name.
2021-02-18	Org reader: fix bug in org-ref citation parsing.	Albert Krewinkel
	The org-ref syntax allows to list multiple citations separated by comma. This fixes a bug that accepted commas as part of the citation id, so all citation lists were parsed as one single citation. Fixes: #7101
2021-01-08	Update copyright notices for 2021 (#7012)	Albert Krewinkel

2021-01-03	Org reader: mark verbatim code with class "verbatim". (#6998)	Dimitri Sabadie
	* Replace org-mode’s verbatim from code to codeWith. This adds the `"verbatim"` class so that exporters can apply a specific style on it. For instance, it will be possible for HTML to add a CSS rule for code + verbatim class. * Alter test for org-mode’s verbatim change. See previous commit for further detail on the new implementation.
2020-12-05	Org reader: preserve targets of spurious links	Albert Krewinkel
	Links with (internal) targets that the reader doesn't know about are converted into emphasized text. Information on the link target is now preserved by wrapping the text in a Span of class `spurious-link`, with an attribute `target` set to the link's original target. This allows to recover and fix broken or unknown links with filters. See: #6916
2020-06-30	Org reader: respect export setting disabling footnotes	Albert Krewinkel
	Footnotes can be removed from the final document with the `#+OPTION: f:nil` export setting.
2020-06-30	Org reader: respect export setting which disables entities	Albert Krewinkel
	MathML-like entities, e.g., `\alpha`, can be disabled with the `#+OPTION: e:nil` export setting.
2020-06-25	Org reader: honor tex export option	Albert Krewinkel
	The `tex` export option can be set with `#+OPTION: tex:nil` and allows three settings: - `t` causes LaTeX fragments to be parsed as TeX or added as raw TeX, - `nil` removes all LaTeX fragments from the document, and - `verbatim` treats LaTeX as text. The default is `t`. Closes: #4070
2020-04-28	Support new Underline element in readers and writers (#6277)	Vaibhav Sagar
	Deprecate `underlineSpan` in Shared in favor of `Text.Pandoc.Builder.underline`.
2020-03-22	Finer grained imports of Text.Pandoc.Class submodules (#6203)	Albert Krewinkel
	This should speed-up recompilation after changes in `Text.Pandoc.Class`, as the number of modules affected by a change will be smaller in general. It also offers faster insights into the parts of `T.P.Class` used within a module.
2020-03-15	Use implicit Prelude (#6187)	Albert Krewinkel
	* Use implicit Prelude The previous behavior was introduced as a fix for #4464. It seems that this change alone did not fix the issue, and `stack ghci` and `cabal repl` only work with GHC 8.4.1 or newer, as no custom Prelude is loaded for these versions. Given this, it seems cleaner to revert to the implicit Prelude. * PandocMonad: remove outdated check for base version Only base versions 4.9 and later are supported, the check for `MIN_VERSION_base(4,8,0)` is therefore unnecessary. * Always use custom prelude Previously, the custom prelude was used only with older GHC versions, as a workaround for problems with ghci. The ghci problems are resolved by replacing package `base` with `base-noprelude`, allowing for consistent use of the custom prelude across all GHC versions.
2020-03-13	Update copyright year (#6186)	Albert Krewinkel
	* Update copyright year * Copyright: add notes for Lua and Jira modules
2020-02-08	Org reader: simplify parsing of sub- and superscripts	Albert Krewinkel
	Speeds up parsing of single-word, markup-less sub- and superscripts. Fixes: #6127
2019-11-20	Fix typos (#5919)	Brian Wignall

2019-11-12	Switch to new pandoc-types and use Text instead of String [API change].	despresc
	PR #5884. + Use pandoc-types 1.20 and texmath 0.12. + Text is now used instead of String, with a few exceptions. + In the MediaBag module, some of the types using Strings were switched to use FilePath instead (not Text). + In the Parsing module, new parsers `manyChar`, `many1Char`, `manyTillChar`, `many1TillChar`, `many1Till`, `manyUntil`, `mantyUntilChar` have been added: these are like their unsuffixed counterparts but pack some or all of their output. + `glob` in Text.Pandoc.Class still takes String since it seems to be intended as an interface to Glob, which uses strings. It seems to be used only once in the package, in the EPUB writer, so that is not hard to change.
2019-05-05	Org reader: prefer plain symbols over math symbols	Albert Krewinkel
	Symbols like `\alpha` are output plain and unemphasized, not as math. Fixes: #5483
2019-03-01	Remove license boilerplate.	John MacFarlane
	The haddock module header contains essentially the same information, so the boilerplate is redundant and just one more thing to get out of sync.
2019-02-04	Add missing copyright notices and remove license boilerplate (#5112)	Albert Krewinkel
	Quite a few modules were missing copyright notices. This commit adds copyright notices everywhere via haddock module headers. The old license boilerplate comment is redundant with this and has been removed. Update copyright years to 2019. Closes #4592.
2019-01-01	Org reader: fix self-link parsing regression	Albert Krewinkel
	Fixes a regression introduced by the previous commit.
2019-01-01	Org reader: fix treatment of links to images	Albert Krewinkel
	Links with descriptions which are pointing to images are no longer read as inline images, but as proper links. Fixes: #5191
2019-01-01	Org reader: hlint	Albert Krewinkel

2018-10-22	Add enclosedByPair1 and change relevant invocations.	leungbk

2018-09-28	Parse empty argument array in inline src blocks.	leungbk
	`enclosedByPair` alone does not the handle the empty array properly since it uses `many1Till`.
2018-09-26	Force inline code blocks to honor export options.	leungbk
	`exportsCode` is moved from `Blocks.hs` to `Shared.hs` and exported accordingly.
2018-07-02	Spellcheck comments	Alexander Krotov

2018-03-18	Removed unnecessary import.	John MacFarlane

2018-03-18	Use NoImplicitPrelude and explicitly import Prelude.	John MacFarlane
	This seems to be necessary if we are to use our custom Prelude with ghci. Closes #4464.
2018-03-16	Monoid/Semiground cleanup relying on custom Prelude.	John MacFarlane

2018-03-13	Require pandoc-types 1.17.4.	John MacFarlane
	And a few tweaks related to the Semigroups/Monoid change. Closes #4448.
2018-02-21	Org reader: allow changing emphasis syntax	Albert Krewinkel
	The characters allowed before and after emphasis can be configured via `#+pandoc-emphasis-pre` and `#+pandoc-emphasis-post`, respectively. This allows to change which strings are recognized as emphasized text on a per-document or even per-paragraph basis. The allowed characters must be given as (Haskell) string. #+pandoc-emphasis-pre: "-\t ('\"{" #+pandoc-emphasis-post: "-\t\n .,:!?;'\")}[" If the argument cannot be read as a string, the default value is restored. Closes: #4378
2018-01-05	Update copyright notices to include 2018	Albert Krewinkel

2017-10-27	Consistent underline for Readers (#2270)	hftf
	* Added underlineSpan builder function. This can be easily updated if needed. The purpose is for Readers to transform underlines consistently. * Docx Reader: Use underlineSpan and update test * Org Reader: Use underlineSpan and add test * Textile Reader: Use underlineSpan and add test case * Txt2Tags Reader: Use underlineSpan and update test * HTML Reader: Use underlineSpan and add test case
2017-10-02	Org reader: support `\n` export option	Albert Krewinkel
	The `\n` export option turns all newlines in the text into hard linebreaks. Closes #3950
2017-09-25	Org reader: update emphasis border chars	Albert Krewinkel
	The org reader was updated to match current org-mode behavior: the set of characters which are acceptable to occur as the first or last character in an org emphasis have been changed and now allows all non-whitespace chars at the inner border of emphasized text (see `org-emphasis-regexp-components`). Fixes: #3933
2017-07-07	Rewrote LaTeX reader with proper tokenization.	John MacFarlane
	This rewrite is primarily motivated by the need to get macros working properly. A side benefit is that the reader is significantly faster (27s -> 19s in one benchmark, and there is a lot of room for further optimization). We now tokenize the input text, then parse the token stream. Macros modify the token stream, so they should now be effective in any context, including math. Thus, we no longer need the clunky macro processing capacities of texmath. A custom state LaTeXState is used instead of ParserState. This, plus the tokenization, will require some rewriting of the exported functions rawLaTeXInline, inlineCommand, rawLaTeXBlock. * Added Text.Pandoc.Readers.LaTeX.Types (new exported module). Exports Macro, Tok, TokType, Line, Column. [API change] * Text.Pandoc.Parsing: adjusted type of `insertIncludedFile` so it can be used with token parser. * Removed old texmath macro stuff from Parsing. Use Macro from Text.Pandoc.Readers.LaTeX.Types instead. * Removed texmath macro material from Markdown reader. * Changed types for Text.Pandoc.Readers.LaTeX's rawLaTeXInline and rawLaTeXBlock. (Both now return a String, and they are polymorphic in state.) * Added orgMacros field to OrgState. [API change] * Removed readerApplyMacros from ReaderOptions. Now we just check the `latex_macros` reader extension. * Allow `\newcommand\foo{blah}` without braces. Fixes #1390. Fixes #2118. Fixes #3236. Fixes #3779. Fixes #934. Fixes #982.
2017-06-03	Improve code style in lua and org modules	Albert Krewinkel

2017-06-03	Org reader: apply hlint suggestions	Albert Krewinkel

2017-05-31	Org reader: fix module names in haddock comments	Albert Krewinkel
	Copy-pasting had lead to haddock module descriptions containing the wrong module names.