Skip to main content
added 189 characters in body; edited body; added 1 characters in body
Source Link
user425727
user425727

I came across two special characters which seem not to be covered by the ISO-8859-1 character set i.e. they don't make it through to my program.

The German ß and the Norwegian ø

i'm reading the files as follows:

FileInputStream inputFile = new FileInputStream(corpus[i]); InputStreamReader ir = new InputStreamReader(inputFile, "ISO-8859-1") ; 

Is there a way for me to read these characters without having to apply manual replacement as a workaround?

[EDIT]

this is how it looks on screen. Note that i have no problems with other accents e.g. è and the lot...

enter image description here

I came across two special characters which seem not to be covered by the ISO-8859-1 character set i.e. they don't make it through to my program.

The German ß and the Norwegian ø

i'm reading the files as follows:

FileInputStream inputFile = new FileInputStream(corpus[i]); InputStreamReader ir = new InputStreamReader(inputFile, "ISO-8859-1") ; 

Is there a way for me to read these characters without having to apply manual replacement as a workaround?

I came across two special characters which seem not to be covered by the ISO-8859-1 character set i.e. they don't make it through to my program.

The German ß and the Norwegian ø

i'm reading the files as follows:

FileInputStream inputFile = new FileInputStream(corpus[i]); InputStreamReader ir = new InputStreamReader(inputFile, "ISO-8859-1") ; 

Is there a way for me to read these characters without having to apply manual replacement as a workaround?

[EDIT]

this is how it looks on screen. Note that i have no problems with other accents e.g. è and the lot...

enter image description here

Source Link
user425727
user425727

reading file with accented characters in Java

I came across two special characters which seem not to be covered by the ISO-8859-1 character set i.e. they don't make it through to my program.

The German ß and the Norwegian ø

i'm reading the files as follows:

FileInputStream inputFile = new FileInputStream(corpus[i]); InputStreamReader ir = new InputStreamReader(inputFile, "ISO-8859-1") ; 

Is there a way for me to read these characters without having to apply manual replacement as a workaround?