I followed the question in another post: Regex to capture LaTeX comments
The provided answer is awesome. However, it seems like it can only be used in the .net engine, but I want to use the PCRE engine. That is because I'm using sublime text and it seems like this engine is used by default. I tried many times, but without success.
The latex is
\usepackage{test}%COMMENT1
TEXT
%COMMENT2
TEXT
Value is 10\%, this should not be removed. %COMMENT3
begin{tikz}[
important 1,
%COMMENT4
important 2, %COMMENT5
]
TEXT
%COMMENT 6
TEXT
Table: value1&value2\\%COMMENT7
Table: value1&value2\\ %COMMENT8
Table: value1&value2 \\%COMMENT 9
Table: value1&value2\\% COMMENT 10
Table: value1&value2\\%COMMENT11
I tried (?m)(?<=(?<!\\)(?:\\{0}))%.*(?:\r?\n(?!\r?$))?
. Only works on comment 1-6,8.
The online results can be found https://regex101.com/r/zSIBMu/3
How should I modify it?
CodePudding user response:
With a capture group:
((?<!\\)(?:\\\\)*)%.*
and with $1
as replacement.
Or perhaps, if \K
is supported:
(?<!\\)(?:\\\\)*\K%.*
with an empty string as replacement.
Notice: I don't see any reason to remove the newline sequence even if the comment takes all the line.
CodePudding user response:
You might also make use of a SKIP FAIL approach:
\\(?<!\\.)(?>\\\\)*%(*SKIP)(*F)|%.*
The pattern matches:
\\(?<!\\.)
Match\
not preceded with\
(?>\\\\)*
Match optional pairs of\\
%(*SKIP)(*FAIL)
Match%
and skip the match|
Or%.*
Match%
and the rest of the line
Edit
A non lookaround solution suggested by Casimir et Hippolyte to skip the match for \\
or \%
\\[\\%](*SKIP)(*F)|%.*
\\[\\%]
Match either\\
or\%
using a character class(*SKIP)(*FAIL)
Skip the match|
Or%.*
Match%
and the rest of the line