Created on 2008-05-23.12:33:54 by kgardas, last changed 2008-08-25.09:31:42 by kowey.
| msg5657 (view) |
Author: kgardas |
Date: 2008-08-22.18:23:38 |
|
The only improvement is that instead of live-lock I get dead-lock. So darcs does
not consume 100% CPU, but just waits silent forever. I've reported this to
darcs-users@ and Dmitry has kindly asked me to duplicate with --debug
--debug-http, but I've not had any free time to do this yet. Sorry, for that.
For more information see Solaris' buildbot and its failing test_network test. It
locks all the time now.
Well, but if you care I may open different bug for this if you like to have this
one closed. Thanks!
|
| msg5655 (view) |
Author: kowey |
Date: 2008-08-22.17:39:22 |
|
Dmitry says that issue977 has been resolved. Any improvements with the latest
darcs, Karel?
|
| msg5444 (view) |
Author: mornfall |
Date: 2008-08-12.22:17:39 |
|
I suspect this might be related to curl versus threaded runtime. See also
issue977 -- Dmitry is looking into that, I believe.
|
| msg5050 (view) |
Author: kowey |
Date: 2008-06-16.10:02:08 |
|
Just to check, is it just darcs get that's causing problems?
If we echo changes.sh > tests/network/tests_to_run (disabling the get test), do
we still have a hang?
|
| msg5004 (view) |
Author: kowey |
Date: 2008-06-11.12:23:02 |
|
Argh. Nevemind. That was noise. I was (a) using a silly URL and (b) had not
set my proxy server
|
| msg5003 (view) |
Author: kowey |
Date: 2008-06-11.10:55:34 |
|
At the risk of throwing in some noise, I notice that under Windows (darcs
2.0.1rc1 + 15) with curl pipelining enabled, darcs will sit there forever trying
to identify the repository format.
The debugging info says that it has successfully retrieved the format file, but
if you ls it, the tmpfile it's using is 0 byte.
Attached
|
| msg4925 (view) |
Author: kgardas |
Date: 2008-06-02.19:24:42 |
|
It seems that this issue also might cause strange dead-locks in darcs testsuite
as shown on the darcs buildbots. Sometimes either make test or make test_network
hangs forever. Following little scripts duplicates this issue on Solaris:
[code]
while (true); do gmake test; gmake test_network; done
[/code]
|
| msg4855 (view) |
Author: tux_rocker |
Date: 2008-05-24.10:02:30 |
|
I don't think this is caused by ntpd. So many boxes are running that, that if it
ever caused trouble, it would have been fixed already. Can you reproduce it
under Solaris?
Also, removing the question marks from ptrace output is impossible AFAIK. A tool
like ptrace assumes that the processor registers are used according to some C
ABI, but GHC's stack and register layout is totally different from C's.
Creating a tracing tool for GHC-compiled executables seems like a fun project
though :)
|
| msg4850 (view) |
Author: karel.gardas |
Date: 2008-05-23.14:44:02 |
|
Actually I think Zooko on darcs-user might right claiming that this
might be issue in libcurl badly handling of time changes. i.e. I'm
running ntpd and perhaps at the right time it moved clock backward which
might confuse curl enough to break whole darcs. But this would require
some testing for a proof.
|
| msg4849 (view) |
Author: kowey |
Date: 2008-05-23.14:24:27 |
|
For what it's worth, I was able to get that repository under Linux (darcs 2.0.0
+ 264)
|
| msg4848 (view) |
Author: kgardas |
Date: 2008-05-23.12:33:52 |
|
While trying to get http://darcs.haskell.org/ghc-stable/packages/pretty
by invoking `darcs get --partial
http://darcs.haskell.org/ghc-stable/packages/pretty' command as part of
GHC buildprocess on GHC buildbot I've notices that this darcs never
finishes its `Identifying repository
http://darcs.haskell.org/ghc-stable/packages/pretty
' step. It seems there is some live-lock involved while calling licurl
functions. This is on Solaris 11, ghc 6.8.2.20080430, darcs from May 22,
linked against curl 7.18.0 with curl pipelining enabled.
As this is on Solaris I've saved few process's stacktraces by using
pstack(1) command. It seems darcs is again and again invoking following
trace:
----------------- lwp# 1 / thread# 1 --------------------
fea545b7 gettimeofday (80459c0, 8561e60, 80459d8, feed829d, 80459c0,
8561e64) + 7
feed82da curl_multi_perform (8561e60, 8045a2c, 0, 0, 0, 0) + 8e
0841e5db curl_wait_next_url (0, 0, 0, 0, 0, 0) + 37
08189e3d ???????? (fe3ec8c1, fe3ec8ce, fe3ec8e7, fe3ec8f0, 818b29c,
fe3ec8fa)
00000001 ???????? (c5830818, a79ae9fc, f689002e, 384ab0, 4, ffff0022)
b11c0045 ???????? ()
Anyway whole log saved by few seconds running `while (true) do pstack
<darcs pid>; done is attached.
Could anybody be so kind and try to duplicate this issue on Linux? Also
if anybody is able to tell me how to get rid of question marks in the
trace and replace them with appropriate GHC generated functions name,
then I would be glad to provide "fixed" stacktrace to show more
information about the issue. Thanks!
PS: Please note this is in Solaris running on real-metal, not in broken
VirtualBox!
|
Browse related patches:
unstable
|
stable
|
| Date |
User |
Action |
Args |
| 2008-08-25 09:31:42 | kowey | set | topic:
+ HTTP nosy:
droundy, tommy, beschmi, kowey, dagit, tux_rocker, dmitry.kurochkin, Serware, karel.gardas, kgardas, mornfall |
| 2008-08-23 07:57:53 | kowey | set | status: testing -> need-info nosy:
+ droundy, Serware topic:
+ ReleaseCritical_2.1, - ReleaseCritical_2.0.3 title: darcs2 live-locks on GHC subrepo (darcs get --partial http://darcs.haskell.org/ghc-stable/packages/pretty) -> darcs2 dead-locks on GHC subrepo (darcs get --partial http://darcs.haskell.org/ghc-stable/packages/pretty) |
| 2008-08-22 21:47:54 | dmitry.kurochkin | set | nosy:
+ dmitry.kurochkin |
| 2008-08-22 18:23:42 | kgardas | set | nosy:
tommy, beschmi, kowey, dagit, tux_rocker, karel.gardas, kgardas, mornfall messages:
+ msg5657 |
| 2008-08-22 17:39:29 | kowey | set | status: chatting -> testing nosy:
tommy, beschmi, kowey, dagit, tux_rocker, karel.gardas, kgardas, mornfall messages:
+ msg5655 topic:
+ ReleaseCritical_2.0.3 assignedto: karel.gardas superseder:
+ get => Possible bug in URL.waitNextUrl (2.0.2 + 20) |
| 2008-08-12 22:17:42 | mornfall | set | nosy:
+ mornfall, karel.gardas, - karel.gardas messages:
+ msg5444 |
| 2008-06-16 10:02:09 | kowey | set | nosy:
tommy, beschmi, kowey, dagit, tux_rocker, karel.gardas, kgardas messages:
+ msg5050 |
| 2008-06-11 12:23:12 | kowey | set | files:
- hang-windows nosy:
tommy, beschmi, kowey, dagit, tux_rocker, karel.gardas, kgardas |
| 2008-06-11 12:23:04 | kowey | set | nosy:
tommy, beschmi, kowey, dagit, tux_rocker, karel.gardas, kgardas messages:
+ msg5004 |
| 2008-06-11 10:55:35 | kowey | set | files:
+ hang-windows nosy:
tommy, beschmi, kowey, dagit, tux_rocker, karel.gardas, kgardas messages:
+ msg5003 |
| 2008-06-02 19:24:44 | kgardas | set | nosy:
tommy, beschmi, kowey, dagit, tux_rocker, karel.gardas, kgardas messages:
+ msg4925 |
| 2008-05-24 10:02:32 | tux_rocker | set | nosy:
+ tux_rocker, karel.gardas, - karel.gardas messages:
+ msg4855 |
| 2008-05-24 09:48:42 | kowey | link | issue878 superseder |
| 2008-05-24 09:48:27 | kowey | link | issue877 superseder |
| 2008-05-23 14:44:04 | karel.gardas | set | nosy:
+ karel.gardas messages:
+ msg4850 |
| 2008-05-23 14:24:50 | kowey | set | topic:
+ SunOS nosy:
tommy, beschmi, kowey, dagit, kgardas |
| 2008-05-23 14:24:29 | kowey | set | status: unread -> chatting nosy:
+ kowey messages:
+ msg4849 |
| 2008-05-23 12:33:54 | kgardas | create | |
|