[OAI-implementers] implementation of non-English characters
w/UTF-8?
Charles Blair
chas at jump.lib.uchicago.edu
Tue Sep 13 17:27:47 EDT 2005
> Meaning, we have non-English characters that are "choking" when we
> test our Data Provider. [Think "e" with the accent over it
> http://lib-app1.usc.edu:8085/oaidp?verb=GetRecord&identifier=oai:usc:digitalarchive:bh
> e/ bhe-m27&metadataPrefix=oai_dc (surname after first name of
> "Elmo").]
this seems to be encoded in iso-8859-1. a unix command like this will
convert it to utf-8:
recode -v ISO-8859-1/..UTF-8 < oaidp.xml > r-oaidp.xml
(recode is GNU recode.)
More information about the OAI-implementers
mailing list