26

I'm Vietnamese and i want to upload a utf-8 filename like

Tên Tệp Tiếng Việt.JPG

Here is my code

<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/1999/xhtml"> <head> <meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> <title>utf-8</title> </head> <body> <?php if(isset($_POST["submit"])) { if($_FILES["upload"]["error"] > 0 ) echo "FILE ERROR!"; else { $base_dir = "D:/"; $fn = $_FILES["upload"]["name"]; $fn2 = $base_dir.$fn; move_uploaded_file($_FILES["upload"]["tmp_name"],$fn2); } } ?> <form action="" method="post" enctype="multipart/form-data" name="form1" id="form1"> <input type="file" name="upload" id="upload" /> <input type="submit" name="submit" id="submit" value="Send" /> </form> </body> </html> 

but when i upload that i see on my computer D:\ has a file like

Tên Tệp Tiếng Việt.JPG

How to fix that thanks

7
  • 2
    On Windows you may have to use iconv() to convert it into ucs2 Commented Aug 13, 2013 at 8:40
  • @Jack I try ` $fn2 = iconv("UTF-8","ucs2", $base_dir.$fn) ;` but get error iconv(): Wrong charset, conversion from UTF-8' to ucs2' is not allowed ? Commented Aug 13, 2013 at 8:44
  • 1
    It's UCS-2 actually :) Commented Aug 13, 2013 at 8:45
  • i edit that to $fn2 = iconv("UTF-8","UCS-2", $base_dir.$fn); but i get new error move_uploaded_file(): Filename cannot be empty .. and move_uploaded_file(): Unable to move 'E:\PHP\Xampp\tmp\php6B2C.tmp' ... ? what is wrong Commented Aug 13, 2013 at 8:49
  • 1
    First things first. Can you echo the filename you receive on the server? Test whether the filename is uploaded encoded in UTF-8? Because that part is likely working. The problem is how PHP interacts with the file system when writing files, which basically depends on whatever the underlying filesystem does, which may be pretty ugly and complex in case of Windows. If the filename is uploaded fine in UTF-8 the question becomes "how to write a filename in UTF-8", the answer to which may not be pretty. Commented Aug 28, 2013 at 7:22

9 Answers 9

16
+50

I'm on Windows 8 chinese version, and I deal with similar problem with this:

$filename = iconv("utf-8", "cp936", $filename); 

cp stands for Code page and cp936 stands for Code page 936, which is the default code page of simplified chinese version of Windows.

So I think maybe your problem could be solved in a similar way:

$fn2 = iconv("UTF-8","cp1258", $base_dir.$fn); 

I'm not quite sure whether the default code page of your OS is 1258 or not, you should check it yourself by opening command prompt and type in command chcp. Then change 1258 to whatever the command give you.

UPDATE

It seems that PHP filesystem functions can only handle characters that are in system codepage, according to this answer. So you have 2 choices here:

  1. Limit the characters in the filename to system codepage - in your case, it's 437. But I'm pretty sure that code page 437 does not include all the vietnamese characters.

  2. Change your system codepage to the vietnamese one: 1258 and convert the filename to cp1258. Then the filesystem functions should work.

Both choices are deficient:

Choice 1: You can't use vietnamese characters anymore, which is not what you want.

Choice 2: You have to change system code page, and filename characters are limited to code page 1258.

UPDATE

How to change system code page:

Go to Control Panel > Region > Administrative > Change system locale and select Vietnamese(Vietnam) in the drop down menu.

Sign up to request clarification or add additional context in comments.

7 Comments

+1 because I think this answer is going in the right direction. Since you'll need to choose the right codepage for the set of characters you want to use, this is of course hard to generalize.
i try and give: Active code page: 437, and i change to $fn2 = iconv("UTF-8","cp437", $base_dir.$fn); and give error like Notice: iconv(): Detected an illegal character in input string in and some warning like Warning: move_uploaded_file(): Filename cannot be empty ...?
@trungkien Now I have no idea either. Try cp1258, maybe?
that's running look better than before :( but still not utf-8
i follow superuser.com/questions/269818/… to change codepage to 1258. The first I just change in HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Control\Nls\CodePage\OEMCP. But the second, i can't find Autorun in HKEY_LOCAL_MACHINE\Software\Microsoft\Command Processor\Autorun ?
|
5

This meta has no effect:

<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />

because the web server has already sent Content-Type header, and thus decided what the encoding will be. Web browsers send forms in the same encoding. The meta is useful when user is off-line.

So you have to sned http header Content-Type by yourself:

<?php header("Content-Type: text/html; charset=utf-8"); ?> 

ensure that you put this before any html, content or whatever is sent.


Alternatively, accept-charset tag on form should work as weel:

 <form accept-charset="utf-8"> 

10 Comments

i added <?php header("Content-Type: text/html; charset=utf-8"); ?> at first of file (file is my code in my question) but nothing work :(
@trungkien What browser are you using ? Try to watch network traffic and especially when you send form, check the sent (from browser) content-type header. Is there a charset ? You can also try adding accept-charset="utf-8" to <form>
I'm using firefox. How to watch network traffic?. I using firebug and i go to: Net>All>Headers>Response Headers and i see Content-Type text/html; charset=utf-8 on Response Headers. I try to add accept-charset="utf-8" but not working. Here is my upload.php file jsfiddle.net/nnBwV
@trungkien Okay, I meant content-type in Request Hedears, take a look there.
in option Request Headers From Upload Stream has Content-Type multipart/form-data; boundary=---------------------------18199138701329 ??
|
3

I am Persian and I have same problem with utf-8 character in my language. I could solve my problem with this code:

$fn = $_FILES["upload"]["name"]; // name of file have some utf-8 characters $name=iconv('utf-8','windows-1256', str_replace('ی', 'ي', $fn)); move_uploaded_file($_FILES["upload"]["tmp_name"],$name ); 

I am not sure about vientam language but maybe you can use the same code as above with a few changes:

$fn = $_FILES["upload"]["name"]; // name of file have some utf-8 characters $name=iconv('utf-8','cp936', $fn); move_uploaded_file($_FILES["upload"]["tmp_name"],$name ); 

Comments

1

The only solution I have found so far.. (2014 year):

1) I dont store files on my FTP in UTF-8 string. Instead, i use this function, to rename the uploaded files:

<?php // use your custom function.. for example, utf8_encode $newname = utf8_encode($_FILES["uploadFile"]["name"]); move_uploaded_file($_FILES["uploadFile"]["tmp_name"], $newname); ?> 

2) Now you can rename (or etc) $newname;

1 Comment

It Didn't Work On Windows
0

For a start get detecting filename encoding (before uploading).

print_r($_FILES["upload"]); 

Insert filename to decoder and check encoding.

Comments

0

Sorry! your question is about file name.

You must save your file name with iconv but read without this.

for saving:

<?php $name = $_FILES['photo']['name']; $unicode = iconv('windows-1256', 'utf-8', $name); move_uploaded_file($_FILES['photo']['tmp_name'], 'photos/' . $name); mysql_query("INSERT INTO `photos` (`filename`) VALUES ('{$unicode}')"); ?> 

for reading:

<?php $images = mysql_query('SELECT * FROM `photos`'); if($images && mysql_num_rows($images) > 0) { while($image = mysql_fetch_assoc($images)) { $name = iconv('utf-8', 'windows-1256', $image['filename']); echo '<img src="photos/' . $name . '"/>'; } mysql_free_result($images); }?> 

Comments

-1
function convToUtf8($str) { if( mb_detect_encoding($str,"UTF-8, ISO-8859-1, GBK")!="UTF-8" ) { return iconv("gbk","utf-8",$str); } else { return $str; } } $filename= convToUtf8($filename) ; 

Comments

-2

try this

$imgname = $_FILES['img'] ['name'] ; $imgsize = $_FILES['img'] ['size'] ; $imgtmpname = $_FILES['img'] ['tmp_name'] ; $imgtype = $_FILES['img'] ['type'] ; $size = 1024; $imgtypes = array('image/jpeg','image/gif','image/png'); $folder = "up"; if(empty($imgname)){ echo "Shose ur photo"; }else if(!in_array($imgtype,$imgtypes)){ echo "this photo type is not avalable"; }else if($imgsize > $size){ echo "this photo is dig than 6 MB"; }else if($imgwidth > 5000){ echo "the file is to big"; } else{ move_uploaded_file($imgtmpname, $folder, $filename); } 

1 Comment

sorry but, my question is about store utf-8 name of file on my computer?
-3

You can use this

$fn2 = basename($_FILES['upload']['name']); 

1 Comment

It won't solve ant problem, be careful about what you are saying!

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.