Quantcast
Channel: Active questions tagged r - Stack Overflow
Viewing all articles
Browse latest Browse all 211971

Converting from NCR to Unicode in R

$
0
0

I have some html files (I'm working with them as plain texts) that utilize decimal NCR to encode special characters. Is there a way to convert them conveniently to Unicode using R?
NCR codes does not always have one-on-one match with unicode and it becomes quite confusing, since ѣ is equal not to \u1123, but to \u0463:

> stri_unescape_unicode("\u1123")
[1] "ᄣ"

and

> stri_unescape_unicode("\u0463")
[1] "ѣ"

Viewing all articles
Browse latest Browse all 211971

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>