summaryrefslogtreecommitdiff
path: root/src/Text/Pandoc/Readers/HTML/Parsing.hs
AgeCommit message (Collapse)Author
2023-10-19HTML reader: allow th to close td and vice versa.John MacFarlane
Closes #9090.
2023-01-10Update copyright years, it's 2023!Albert Krewinkel
2022-01-02Copyright notices: update for 2022Albert Krewinkel
2021-11-24HTML reader: parse attributes on links and images.John MacFarlane
Closes #6970.
2021-08-10HTML reader: treat commments as blank when parsing.John MacFarlane
This modifies pBlank. Previously comments could sometimes flummox the parser. Cloes #7482.
2021-07-06HTML reader: add col, colgroup to 'closes' definitionsJohn MacFarlane
2021-01-08Update copyright notices for 2021 (#7012)Albert Krewinkel
2020-12-10HTML reader: retain attribute prefixes and avoid duplicates.John MacFarlane
Previously we stripped attribute prefixes, reading `xml:lang` as `lang` for example. This resulted in two duplicate `lang` attributes when `xml:lang` and `lang` were both used. This commit causes the prefixes to be retained, and also avoids invald duplicate attributes. Closes #6938.
2020-11-26HTML reader: improve support for table headers, footer, attributesAlbert Krewinkel
- `<tfoot>` elements are no longer added to the table body but used as table footer. - Separate `<tbody>` elements are no longer combined into one. - Attributes on `<thead>`, `<tbody>`, `<th>`/`<td>`, and `<tfoot>` elements are preserved.
2020-11-26HTML reader: allow finer grained options for tag omissionAlbert Krewinkel
2020-11-24HTML reader: extract table parsing into separate moduleAlbert Krewinkel
2020-11-23HTML reader: extract submodulesAlbert Krewinkel
Reducing module size should reduce memory use during compilation. This is preparatory work to tackle support for more table features.