[charm] how to run SimpleArrayHello on multiple PCs
Chiara Orsini
chiara.orsini at iet.unipi.it
Tue Jun 29 09:10:07 CDT 2010
Dear charm++ users,
I tried to run SimpleArrayHello program on multiple PCs, but the program
ends with this error:
Charmrun> node programs all connected
Charmrun: error on request socket--
Socket closed before recv.
I read the FAQ, but linking with *-memory paranoid* did not give me any
useful information.
Consider two host A and B. I configured host A and B in order to be
accessible with each other through ssh (without password).
I can run BasicHelloWorld on host A (launching it from host A) successfully.
I can run BasicHelloWorldon host B (launching it from host A) successfully.
But, I cannot run BasicHelloWorld on host A and B (launching it from host
A). This is my nodelist file:
group main ++shell ssh
host IP_B
host IP_A
This is the output that I obtain running this command: ./charmrun *
PROGRAM_NAME* +p2 ++verbose
Charmrun> charmrun started...
Charmrun> using ./nodelist as nodesfile
Charmrun> adding client 0: "IP_B", IP:IP_B
Charmrun> adding client 1: "IP_A", IP:IP_A
Charmrun> Charmrun = IP_A, port = 50501
Charmrun> Sending "0 IP_A 50501 2547 0" to client 0.
Charmrun> find the node program "/PATH/PROGRAM_NAME/prefix" at
"/PATH/PROGRAM_NAME" for 0.
Charmrun> node 0: xterm is xterm
Charmrun> Starting ssh IP_B -l user /bin/sh -f
Charmrun> remote shell (IP_B:0) started
Charmrun> Sending "1 IP_A 50501 2547 0" to client 1.
Charmrun> find the node program "/PATH/PROGRAM_NAME/prefix" at
"/PATH/PROGRAM_NAME" for 1.
Charmrun> node 1: xterm is xterm
Charmrun> Starting ssh IP_A -l user /bin/sh -f
Charmrun> remote shell (IP_A:1) started
Charmrun> node programs all started
Charmrun remote shell(IP_A.1)> remote responding...
Charmrun remote shell(IP_A.1)> using xterm /usr/X11R6/bin/xterm
Charmrun remote shell(IP_A.1)> starting node-program...
Charmrun remote shell(IP_A.1)> rsh phase successful.
Charmrun remote shell(IP_B.0)> remote responding...
Charmrun remote shell(IP_B.0)> using xterm /usr/X11R6/bin/xterm
Charmrun remote shell(IP_B.0)> starting node-program...
Charmrun remote shell(IP_B.0)> rsh phase successful.
Charmrun> Waiting for 0-th client to connect.
Charmrun> Waiting for 1-th client to connect.
Charmrun> client 1 connected (IP=IP_A data_port=54659)
Charmrun> client 0 connected (IP=IP_B data_port=34817)
Charmrun> All clients connected.
Charmrun> IP tables sent.
Charmrun> node programs all connected
Charmrun: error on request socket--
Socket closed before recv.
Could anyone explain how to solve this problem?
Any information you'll give me will be surely appreciated.
Thank you in advance.
Best regards,
Chiara
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.cs.uiuc.edu/pipermail/charm/attachments/20100629/a0818088/attachment.html
More information about the charm
mailing list