linux - Lowercase all text except xml tags -
i've got large number of tagged strings:
watch <team>philly's</team> game what's on <time>wednesday night 8 o'clock</time>
i lowercase text except xml tags. i.e.
watch <team>philly's</team> game what's on <time>wednesday night 8 o'clock</time>
i can lower case text using awk:
awk '{print tolower($0)}' file.txt
but have no idea how avoid xml tags. languages/tools welcome.
this sed (gnu) one-liner may help:
sed -r 's/([^<>]*)($|<)/\l\1\e\2/g'
with example:
kent$ echo "watch <team>philly's</team> game what's on <time>wednesday night 8 o'clock</time>"|sed -r 's/([^<>]*)($|<)/\l\1\e\2/g' watch <team>philly's</team> game what's on <time>wednesday night 8 o'clock</time>
Comments
Post a Comment