File URI I18N

From Offset
Revision as of 21:52, 28 May 2013 by Mjb (talk | contribs) (+category)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

File URI Internationalization (and character mapping)

Sometimes, it is desirable to take a web of material and move it to the local file system. This works OK sometimes, but there are problems when the original web (whether from HTTP or a file system) contains characters that are not allowed or supported in the local file system.

There is a two-way mapping between file: URIs and OS paths.

For URI -> OS path, the mapping should attempt to map URI characters into OS path characters, if they are allowed. If not, then a file system encoding may be necessary.

(If the file system has limited capabilities, then the translation may fail; I don't know any way around that.)

For OS path -> URI, it is important to reverse the transformation.