Moving Blog - GB18030 and GB2312

Nov 7, 2010
I was in US when Microsoft announced the MSN blog moving plan to wordpress, and I finished the moving with a few mouse clicks. But when I got back to Beijing, I found that I can not open woody1234.wordpress.com web pages, then I decided to move my MSN blog to my blog in Palmmicro web site manually.
After one month of casual work, I have moved 37 old blogs so far, and reduced Palmmicro.com links to aredfox.spaces.live.com from 130 to 10.
When I was checking the result tonight, I found that 1/5 of the Chinese blog pages had small display errors with my English IE8 running on 64-bit English Windows Vista, but Firefox and Chrome running on the same laptop work well. Further testing showed that if I change the Encoding to GB2312 in IE8 menu, it will also display correctly, and keep displaying correctly even if I change Encoding back to GB18030.
After I changed the meta part of all my Chinese pages from charset=gb18030 to charset=gb2312, all the three browers worked well with all my Chinese pages.

Moving Blog - Translation

Nov 10, 2010
After warming up in the past month, I was focused on moving blog for the past 3 whole days. However the progress was not good. Although I have eliminated the last 10 Palmmicro.com links to aredfox.spaces.live.com, the total number of blogs I moved only increased from 37 to 56. As I estimated about another 50 to be moved, I will need extra 8 full days to end the job with current pace.
Why so slow? A major delay is because of translation. Most blogs before 2009 were not translated into Chinese by myself or not translated at all. And now I am spending a lot of time translate them! I even made a new web page to keep track of common phrase.
Sometime ago I was asked about which language version I wrote first. I answered English of course, if I wrote in Chinese first, I will not be able to translate it into English!
The sad truth is, tranlate from English to Chinese is also difficult, many Chinese say they can not understand what I wrote in Chinese neither.

Moving Blog - Summary

Nov 14, 2010
After 1 month of warming up and 8 days of focused hard work, at last I finished moving the 98 blogs from MSN space to Palmmicro website.
Less than 10 old blogs were discarded, mostly because the AR1688 technical details they discussed were not correct any more. Besides many newly added Chinese translation, I have also added many links between different pages, corrected obvious errors. It is quite difficult to keep things as they were. Now I fully understand why Jin Yong was modifying his 15 novels again and again in past 30 years.
All remarks in original post were discarded. It is a pity but I can not move remarks for others.
Why not stay in wordpresss.com? Here is the answers:

  1. Actually I was planning to move my blogs from MSN space to company website since May. I built my blog page at that time and began to use MSN space as a copy. But the huge moving work made me hesitated. I am glad that finally Microsoft helped me to make the choice.
  2. The visit to wordpress.com is often slow or totally blocked in China. And I can not see any reason it will not be blocked by GFW in the near future.
  3. The automatic moving to wordpress.com is not good as promised. I have noticed many lost of ' ', '\0' and ''' in the moved text. And when displaying in Chinese, wordpress will stupidly convert many punctuation marks into Chinese version and made the whole page silly.

Tang Li is also moving her MSN space to Palmmicro website. With so many new pages added, the visitor statistics of our website is expected to explode next. The image below shows 885 visits from 230 cities in the world for the past 30 days, with 6,649 total pageviews.
Google Analytics reports of Palmmicro.com visitor location information on Oct, 2010.

Browsers Used by Palmmicro Web Visitors

March 28, 2011
With the coming of IE9 and Firefox4, the news of web browsers are booming once again. The most disturbing among them is that 360 is working with the father of GFW to provide a secure web browser for Chinese users.
360 also said that according to Baidu, there were 18% users using so called 360 web browsers. I am using Google Analytics to track Palmmicro.com traffic, let us see browsers used by Palmmicro web visitors according to Google in the image below.
During the past 30 days, there were 1,072 visits came from 294 cities in 69 countries/territories, pages were viewed a total of 6,619 times, almost the same as it was 4 month ago.
Google Analytics reports of Palmmicro.com visitor web browser usage on Mar, 2011.

Replacing GB2312 with UTF-8

March 8, 2012
Growing up with GB2312, and puzzled by Microsoft's 2 bytes unicode in the past 2 years, I was ignoring UTF-8 for most of the time. It is funny that I was still debugging GB18030 and GB2312 in late 2010. However, as I was getting more and more UTF-8 encoded Chinese emails sent from iPads during the past year, I began to think it must be important, since Apple always boasts of the easy to use of its products.
More investigation on the usage of UTF-8 shocked me. Among those tens of websites I usually visit, only SMTH and the forum part of TianYa NOT using UTF-8 now. Better late than never, I started to convert Palmmicro web from GB2312 to UTF-8 since last weekend.
As always, the work took me longer than expected. I spent some time modifying Woody's Web Tool, and more time on learning the VC2008 editor settings to edit UTF-8 files. The stupid editor always need to save as UTF-8 without signature.
And again I had to discard many blog comments previous saved in GB2312 coding, still a new PHP/MySQL programmer, I can not figure out an easy way to convert them in current MySQL database.

Converting GB2312 Encoded String to UTF-8 Using PHP

June 9, 2016
I have been adding more features for SZ162411 net value tool recently. Working on web page now and then for so many years, my original planned PA6488 and PA3288 products web management is still unavailable, and palmmicro.com is becoming an amateur stock web site.
As more and more stocks are involved, I plan to use the stock information in Sina stock data directly instead of input them by hand. Now the problem of 4 years ago comes back, the data from Sina is still GB2312 encoded, and I still can not convert them from GB2312 to UTF8 by native PHP functions like mb_detect_encoding and iconv.
But I am much more experienced in PHP now. First I downloaded the GB2312 and UNICODE converting table, from internet, saved it to file unicode_gb2312.txt. Then I wrote a converting tool in /php/gb2312.php, generated array $arGB2312 sorted by GB2312 as key, put it in /php/gb2312/gb2312_unicode.php file. Finallly function FromGB2312ToUTF8 searched UNICODE from $arGB2312 table, and called a small function unicode_to_utf8 to convert it to UTF8. The whole process was done in a night, it really feels good!

No comments for this page yet.

More options? Please login or register account.