The tag has no wiki summary.

learn more… | top users | synonyms (1)

44
votes
4answers
1k views

Java Unicode String length

I am trying hard to get the count of unicode string and tried various options. Looks like a small problem but struck in a big way. Here I am trying to get the length of the string str1. I am getting ...
22
votes
2answers
5k views

Convert between string, u16string & u32string

I've been looking for a way to convert between the Unicode string types and came across this method. Not only do I not completely understand the method (there are no comments) but also the article ...
18
votes
8answers
10k views

String To Lower/Upper in C++

What is the best way people have found to do String to Lower case / Upper case in C++? The issue is complicated by the fact that C++ isn't an English only programming language. Is there a good ...
11
votes
7answers
3k views

Is there an STL string class that properly handles Unicode?

I know all about std::string and std::wstring but they don't seem to fully pay attention to extended character encoding of UTF-8 and UTF-16 (On windows at least). There is also no support for UTF-32. ...
11
votes
6answers
9k views

How to work with unicode in Python

I am trying to clean all of the HTML out of a string so the final output is a text file. I have some some research on the various 'converters' and am starting to lean towards creating my own ...
11
votes
3answers
9k views

How I can print the wchar_t values to console?

Example: #include <iostream> using namespace std; int main() { wchar_t en[] = L"Hello"; wchar_t ru[] = L"Привет"; //Russian language cout << ru << endl ...
9
votes
3answers
1k views

UTF-16 string terminator

What is the string terminator sequence for a UTF-16 string? EDIT: Let me rephrase the question in an attempt to clarify. How's does the call to wcslen() work?
9
votes
5answers
509 views

Is it possible to reliably auto-decode user files to Unicode? [C#]

I have a web application that allows users to upload their content for processing. The processing engine expects UTF8 (and I'm composing XML from multiple users' files), so I need to ensure that I ...
8
votes
3answers
508 views

Where can I get started with Unicode-friendly programming in C?

So, I’m working on a plain-C (ANSI 9899:1999) project, and am trying to figure out where to get started re: Unicode, UTF-8, and all that jazz. Specifically, it’s a language interpreter project, and I ...
7
votes
4answers
4k views

What is the range of Unicode Printable Characters?

Can anybody please tell me what is the range of Unicode (UTF8) printable characters? [e.g. Ascii printable character range is \u0020 - \u007f]
7
votes
1answer
776 views

How to get new Unicode Charectors or EMOJI's in order to use in iOS App?

After searching over web for couple of Hours, I didnt get any solution to my requirement. I am creating new iOS app which shows some EMOJI's, but these should be My Own Custom Emoji's , there are ...
6
votes
8answers
4k views

Why the Excess Memory for Strings in Delphi?

I'm reading in a large text file with 1.4 million lines that is 24 MB in size (average 17 characters a line). I'm using Delphi 2009 and the file is ANSI but gets converted to Unicode upon reading, so ...
6
votes
3answers
3k views

Java: How to create unicode from string “\u00C3” etc

I have a file that has strings hand typed as \u00C3. I want to create a unicode character that is being represented by that unicode in java. I tried but could not find how. Help. Edit: When I read ...
5
votes
2answers
6k views

Is it advisable to use strcmp or _tcscmp for comparing strings in Unicode versions?

Is it advisable to use strcmp or _tcscmp for comparing strings in Unicode versions?
5
votes
3answers
1k views

Java Unicode strings sorting

In Java, how does Unicode strings get compared? What I mean is, if I have a few say, Japanese strings, when I do the following: java.util.Arrays.sort(arrayOfJapaneseStrings); how does those ...

1 2 3 4 5 12
15 30 50 per page