[go: up one dir, main page]

Menu

#512 rst2html incorrectly converts tables with Combining Diacritical Marks into HTML

open-fixed
nobody
rst parser (3)
5
2025-09-22
2025-09-16
CHÉN Zhé
No

Content of reST file:

+----+
| ç̌ |
+----+

Expected results:

<table border="1" class="docutils">
  <colgroup>
    <col width="100%" />
  </colgroup>
  <tbody valign="top">
    <tr><td>ç̌</td>
    </tr>
  </tbody>
</table>

Real results:

<table border="1" class="docutils">
  <colgroup>
    <col width="100%" />
  </colgroup>
  <tbody valign="top">
    <tr><td>ç̌ |</td>
    </tr>
  </tbody>
</table>
3 Attachments

Related

Bugs: #128

Discussion

  • CHÉN Zhé

    CHÉN Zhé - 2025-09-16

    “ç̌” is composed of two Unicode characters: a c-cedilla and a combining caron.

    U+00E7 : LATIN SMALL LETTER C WITH CEDILLA
    U+030C : COMBINING CARON {hacek, V above}
    
     
  • Günter Milde

    Günter Milde - 2025-09-17

    Thank you for the report.
    Combining characters in grid tables are known to be problematic. See [bugs:#128]
    Mind, that the intended behaviour would be to get the "expected" result with the input

    +---+
    | ç̌ |
    +---+
    

    For simple tables, you can use the simple table syntax, where this problem is fixed:

    = ==
    ç̌ OK
    = ==
    
     

    Related

    Bugs: #128

  • CHÉN Zhé

    CHÉN Zhé - 2025-09-17

    Thanks, I’ll give it a try. 👍🏻

     
  • Günter Milde

    Günter Milde - 2025-09-19
    • labels: rest, tables --> rst parser
     
  • Günter Milde

    Günter Milde - 2025-09-22
    • status: open --> open-fixed
     

Log in to post a comment.