25-Nov-2006

Bind is Ok

it seems, since there were no reports of Internet connectivity not working. Si I guess it was just a glitch in the interaction between Diana and the router. No problems aftre yesterday (excpet that at some point, the PHP process ran out of file quota, but whether these were VMS of PHP-internal could not be found.

One  more (minor) issue with PHP – and I found this before:

%HTTPD-W-NOTICED, 23-NOV-2006 22:13:17, CGI:1969, not a strict CGI response
-NOTICED-I-SERVICE, http://www.grootersnet.nl:80
-NOTICED-I-CLIENT, 192.168.0.33
-NOTICED-I-URI, GET (18 bytes) /sysblog/index.php
-NOTICED-I-SCRIPT, /sysblog/index.php sysblog:[000000]index.php (cgi_exe:phpwasd.exe) SYSBLOG:[000000]index.php
-NOTICED-I-CGI, 254445425547424F4F542D572D43484E2C2061737369676E (62 bytes) %DEBUGBOOT-W-CHN, assign channel system service request failed
-NOTICED-I-RXTX, err:0/0 raw:495/0 net:495/0

It’s this message that triggers the “Not a strict CGI response” situation. Something within PHP fails to (re)connect to a channel. It’s a sprurious error and mostly nothing is wrong, just once in a while this happens. Stopping the sub-process heals – but it should  not happen….

(I guess the previous trouble posting had the same cause)

24-Nov-2006

Still problems with BINDThe problems yesterday did indeed cause a problem with mail. Since checking domains failed, delivery of mail was rejected as well. Therefore, I had to resent mail pushed out yesterday. I did that at 7:00 am – about that time.

But at 9, Kim called that there was again no connectivity to the Internet, and Kasper did as well around 11. Mail sent to grootersnet.nl failed again!

So The first thing to do was having a look once more. But $ TCPIP SHO HOST showed all I would expect. I looked in Operartor.log and teh BIND logs but there is no clue whatsoever why it suddenly fails to translate external addresses! After stopping and starting BIND, and waiting a few minutes, there was no more problem!

It will be one of the first things to check tomorrow.

Aphrodite has some problem once in a while. It will get all configuration data form Diana (set up to be configured by DHCP – as all Windows boxes) but connection for telnet and POP (for mail) will fail; Starting a DOS box and give the command ipconfig /renew solves the problem. In the BIND log, Aphrodite is often mentioned trying to make adjustments to the BIND database – which is, for good reasons, not allowed.

 

23-Nov-2006

Power outage

Yesterday I had to repair a light switch, and so power had to be taken off – and I expected that the datacentre would powered: the guide sheet stated “First floor” and did not mention the attic where the datacenter is. Alas – that’s correct where light is concerned but major power is wired to the first floor – so DOWN all went.

Nasty thing is I just found out today. That is: Kim complained that Internet connectivity didn’t work. But the local network seemed Ok. It was impossible to access the system directly (I was on another location) so I got into the issue tonight – and found out what happened.

On Irene – the Windows machine for ‘normal use’ – nslookup of an external site failed, but internal all was well. So I took a look to DIana’s TCPIP$BIND process – which was running fine but $ TCPIP SHOW HOST just showed the local database, not what is stored in DNS! So I restarted BIND (that is: $ @SYS$STARTUP:TCPIP$BIND_SHUTDOWN followed by $ @SYS$STARTUP:TCPIP$BIND_STARTUP) and next the data did properly show up. Next I had to renew Irene’s DHCP data, before nslookup did work.

But to find out WHY this happened: $ SHOW SYSTEM/NOPROC showed an uptime of only almost 24 hours, and yesterday’s Operator.log ended when I switched off power: at 19:15, and a second log started when power was restored (at 19:45). That’s a way to find out what miistake I made last night!

(If I could have checked the logfiles from my job location, something I usually do at another site, I would have noticed before)

Everthing worked fine after that, except for this blog. Simple reason: the logical where this log resides wasn’t properly defined. Just had to change that (manually, to be done in the startup files as well). Now all runs fine, only it seems mail is not delivered. I sent something this afternoon, there was nothing wong with it, it didn;t bounce but I have not got it here. It might come later…

21-Nov-2006

PHP + MySQL

is not fully stable: Today I tried to post a message into the security category, from elsewhere, but both attemps failed die to a crash in WASD_PHP, likely to be happened in PHPSHR:

%HTTPD-W-NOTICED, 21-NOV-2006 08:31:11, CGI:1969, not a strict CGI response
-NOTICED-I-SERVICE, http://www.grootersnet.nl:80
-NOTICED-I-CLIENT, 194.151.195.208
-NOTICED-I-URI, POST (26 bytes) /sysblog/wp-admin/post.php
-NOTICED-I-SCRIPT, /sysblog/wp-admin/post.php sysblog:[wp-admin]post.php (cgi_exe:phpwasd.exe) SYSBLOG:[wp-admin]post.php
-NOTICED-I-CGI, 2553595354454D2D462D485041524954482C206869676820 (129 bytes) %SYSTEM-F-HPARITH, high performance arithmetic trap, Imask=00000000, Fmask=00000002, summary=02, PC=00000000001E9C94, PS=0000001B
-NOTICED-I-RXTX, err:0/0 raw:6291/0 net:876/0
After another attempt, I gave up from there, and tried just a few minutes ago from tjhe LAN – and it went perfectly. But that also means I cannot reproduce it using WASD’s facilities.

Starting the admin pages a seceond time missed the stylesheets – but refresh gave them back…

Login failures

I created a commandprocedure (for a colleague, to start with) to run every night, scanning the accounting file for failed logins, using the /SINCE=YESTERDAY quelifier, so I’ll get only the most recent ones. I display name, account, and, if applicable, the address of the remote site.

Usually, it just shows:

================================================================================
21-NOV-2006 00:01:02.27 Login failures found
================================================================================
Time                 Username     UIC                  Account             
                     RemoteID             System
                     Code      TEXT
——————————————————————————–
================================================================================
No login failures found
but to have a clear view on what happened before, I used /SINCE=01-Jan-2004 – which gave me all entries since December 2005, and nothing before. Obvious, since that’s when this VMS instance came alive (I will need to run the test on the old accounting file to get more)

That revealed a few interesting attemps. What to think of a script from a Windows box:

13-MAY-2006 16:40:03 Administrato [TCPIP$AUX,TCPI                     
                     FTP_18F2BB1A         rrcs-24-242-187-26.sw.biz.rr.com
                     00D380F4  %LOGIN-F-NOSUCHUSER, no such user

** Repeated 2281 times **
13-MAY-2006 16:54:39 Administrato [TCPIP$AUX,TCPI                     
                     FTP_18F2BB1A         rrcs-24-242-187-26.sw.biz.rr.com
                     00D380F4  %LOGIN-F-NOSUCHUSER, no such user
About a month later, it was attempted a second time:

12-JUN-2006 21:50:29 Administrato [TCPIP$AUX,TCPI                     
                     FTP_DD8B3219         221.139.50.25
                     00D380F4  %LOGIN-F-NOSUCHUSER, no such user

** Repeated 2281 times **
12-JUN-2006 22:16:50 Administrato [TCPIP$AUX,TCPI                     
                     FTP_DD8B3219         221.139.50.25
                     00D380F4  %LOGIN-F-NOSUCHUSER, no such user
Quite recent, but seen the number of attempts, probabbly just an error or someone who is more or less knowing what he’s doing:

 8-OCT-2006 03:49:37 root         [TCPIP$AUX,TCPI                     
                     FTP_40394046         drizzle.bluegravity.com
                     00D380F4  %LOGIN-F-NOSUCHUSER, no such user
8-OCT-2006 03:49:37 root         [TCPIP$AUX,TCPI                     
                     FTP_40394046         drizzle.bluegravity.com
                     00D380F4  %LOGIN-F-NOSUCHUSER, no such user
I hope this time posting doestn’t fail. I tried twice today and both attempts went wrong due to a HPARITHM error