darcs

Issue 2524 regression: handling of unicode filenames

Title regression: handling of unicode filenames
Priority bug Status needs-diagnosis/design
Milestone Resolved in
Superseder Nosy List attila.lendvai, ganesh
Assigned To ganesh
Topics

Created on 2017-02-12.17:04:46 by attila.lendvai, last changed 2017-02-12.18:38:06 by ganesh.

Messages
msg19348 (view) Author: attila.lendvai Date: 2017-02-12.17:04:44
it broke somewhere after v2.5

it's broken with both 2.10.1 and 2.12.4

$ mkdir -p /tmp/delme
$ cd /tmp/delme/
$ darcs init
Repository initialized.
$ touch zorká
$ darcs add zorká 
File zorká does not exist!

darcs failed:  No files were added
$ locale
LANG=en_US.utf8
LANGUAGE=
LC_CTYPE="en_US.utf8"
LC_NUMERIC="en_US.utf8"
LC_TIME="en_US.utf8"
LC_COLLATE="en_US.utf8"
LC_MONETARY="en_US.utf8"
LC_MESSAGES="en_US.utf8"
LC_PAPER="en_US.utf8"
LC_NAME="en_US.utf8"
LC_ADDRESS="en_US.utf8"
LC_TELEPHONE="en_US.utf8"
LC_MEASUREMENT="en_US.utf8"
LC_IDENTIFICATION="en_US.utf8"
LC_ALL=
$ ls -l
total 4
drwxr-xr-x 6 alendvai alendvai 4096 Feb 12 19:38 _darcs
-rw-r--r-- 1 alendvai alendvai    0 Feb 12 19:38 zorká 
$ darcs --exact-version 
darcs compiled on Aug 12 2015, at 14:56:00

Context:

[TAG 2.10.1
Guillaume Hoffmann <guillaumh@gmail.com>**20150709164849
 Ignore-this: 91f5dd97c5899801364cdfc0a8266faa
] 

Compiled with:

array-0.5.0.0
attoparsec-0.12.1.6
base-4.7.0.2
base16-bytestring-0.1.1.6
binary-0.7.1.0
bytestring-0.10.4.0
containers-0.5.5.1
cryptohash-0.11.6
data-ordlist-0.4.7.0
dataenc-0.14.0.7
directory-1.2.1.0
filepath-1.3.0.2
hashable-1.2.3.3
haskeline-0.7.1.2
html-1.0.1.2
mmap-0.5.9
mtl-2.1.3.1
old-locale-1.0.0.6
old-time-1.1.0.2
parsec-3.1.9
process-1.2.0.0
random-1.1
regex-applicative-0.3.2.1
regex-compat-tdfa-0.95.1.4
tar-0.4.1.0
terminfo-0.4.0.0
text-1.2.0.6
time-1.4.2
transformers-0.3.0.0
transformers-compat-0.4.0.4
unix-2.7.0.1
unix-compat-0.4.1.4
utf8-string-1
vector-0.10.12.3
zip-archive-0.2.3.7
zlib-0.5.4.2
$ 
$ darcs --exact-version 
darcs compiled on Nov  8 2016, at 15:11:43

Context:

[TAG 2.12.4
Guillaume Hoffmann <guillaumh@gmail.com>**20160914143914
 Ignore-this: fee5b93a45e3a175e63e6fdb837ec4dc
] 

Compiled with:

HTTP-4000.3.3
array-0.5.1.1
async-2.1.0
attoparsec-0.13.1.0
base-4.9.0.0
base16-bytestring-0.1.1.6
binary-0.8.3.0
bytestring-0.10.8.1
containers-0.5.7.1
cryptohash-0.11.9
data-ordlist-0.4.7.0
directory-1.2.6.2
fgl-5.5.3.0
filepath-1.4.1.0
graphviz-2999.18.1.2
hashable-1.2.4.0
haskeline-0.7.2.3
html-1.0.1.2
mmap-0.5.9
mtl-2.2.1
network-2.6.3.1
network-uri-2.6.1.0
old-time-1.1.0.3
parsec-3.1.11
process-1.4.2.0
random-1.1
regex-applicative-0.3.3
regex-compat-tdfa-0.95.1.4
sandi-0.4.0
tar-0.5.0.3
terminfo-0.4.0.2
text-1.2.2.1
time-1.6.0.1
transformers-0.5.2.0
transformers-compat-0.5.1.4
unix-2.7.2.0
unix-compat-0.4.2.0
utf8-string-1.0.1.1
vector-0.11.0.0
zip-archive-0.3.0.5
zlib-0.6.1.2
$ uname -a
Linux lelap 4.8.0-0.bpo.2-amd64 #1 SMP Debian 4.8.15-2~bpo8+2 (2017-01-17) 
x86_64 GNU/Linux
msg19349 (view) Author: ganesh Date: 2017-02-12.18:37:44
A few notes:

This only seems to affect filenames passed on the command-line, so you 
can work around with 'darcs rec -l' or similar to get the files added.

issue2095 is quite related - I suspect the way I fixed that is related 
to this problem. One thing to look at (I'll do it when I have a chance) 
is whether the GHC version is actually the trigger here, rather than 
darcs - GHC 7.0 versus 7.4 would be the relevant thing to check.
History
Date User Action Args
2017-02-12 17:04:46attila.lendvaicreate
2017-02-12 18:37:46ganeshsetmessages: + msg19349
2017-02-12 18:38:06ganeshsetstatus: unknown -> needs-diagnosis/design
assignedto: ganesh
nosy: + ganesh