unoffice

Reclaim text from office documents
git clone https://logand.com/git/unoffice.git/
Log | Files | Refs | README

unabw (237B)


      1 #!/usr/bin/env bash
      2 set -euo pipefail
      3 sed 's/<p[^<\/]*>//g' "$1" \
      4     | sed 's/<[^<]*>//g' \
      5     | sed 's/&lt;/</g' \
      6     | sed 's/&gt;/>/g' \
      7     | sed "s/&apos;/'/g" \
      8     | sed 's/&quot;/"/g' \
      9     | sed 's/&amp;/&/g' \
     10     | cat -s