Support #705

How to UTF-8 characters in FDSN StationXML

Added by Ellen Yu about 9 years ago. Updated almost 9 years ago.

Start date:
Due date:
% Done:


Estimated time:
(Total: 0.00 h)


In SIS we allow unicode characters. We understand that FDSN StationXML uses ASCII as a standard.
If in our system the string is "Puerto PeƱasco" and we want to write it out in FDSN StationXML that could be converted to dataless
can we write it in the FDSN StationXML out as:

Puerto Peñasco

And the converter write it as whatever is appropriate in dataless?
I believe dataless does not accept & -- but at least users of FDSNStationXML would be able to get back to unicode format.

Please advise.



Bug #407: 8bits ascii char in datalessClosedYazan Suleiman


#1 Updated by Yazan Suleiman almost 9 years ago

Can you please provide an example xml file

#2 Updated by Chad Trabant almost 9 years ago

  • Assignee set to Yazan Suleiman

Hi Ellen,

As you state, the non-ASCII characters in StationXML description fields are not allowed. That being said, I would expect this to be a common issue for conversion.

The safest thing for SIS would be to ASCII-ify the strings before giving them to the converter so you do it the way you want.

We will investigate how to map the UTF-8 strings to ASCII when creating dataless SEED, in order to try and avoid making invalid SEED and warn the user of any conversion.

I do not see how we can ever reverse that process without making the ASCII version of the string very ugly by including escaped Unicode in the strings.

Also available in: Atom PDF