Recente Storingen

From Cncz
Revision as of 11:39, 15 December 2014 by Polman (talk | contribs) ([IMAP mailserver probleem][IMAP mail server problem])
Jump to: navigation, search

{{#customtitle:Recent Service Interruptions}}


Current Service Interruptions and Planned Maintenance

IMAP mail server problem

  Begin         : 20141215   6:30
  End           : 20141215  11:15
  Affected      : all users wanting to read Science mail 

There was a problem with a failed harddisk of the IMAP server. This slowed the machine down so that it was practically unusable. Problem was solved by replacing the disk.


Report a problem

Use this form to report less urgent problems. For urgent problems, call 20000 (helpdesk).

Recently Resolved Service Interruptions and Maintainance

To be quickly informed about service interruptions one can subscribe to the CPK mailinglist.

Network problem RU

  Begin         : 20141126     09:45
  End           : 20141126     10:00
  Affected      : all users of the network at RU

As reported by the ISC, the RU was victim of a DDOS attack. As a consequence of this is the connection to the Internet was interrupted. Measures have been taken to prevent a similar attack having this result in the future.

Network problems FNWI

  Begin1        : 20141111     06:00
  Begin2        : 20141119     13:34
  End           : 20141120     15:32
  Affected      : all users of the wired network at FNWI

After the network maintenance of 20141111 a single user reported later a problem with the wired network. These problems intensified and grew faculty wide on 20141119 at 13:34 hours. The network switches in the Faculty of Science received so many topology changes of the Spanning Tree Protocol (STP) that network traffic often stuttered. By curbing the STP traffic on 20141120 at 15:32 the stuttering disappeared. We still search for the exact cause, probably it is a housing move.

Network maintenance in Mercator III south and low-rise buildings

  Begin         : 20141119     18:30
  End           : 20141119     23:00
  Affected      : network in Mercator III north

The network switches in the buildings/floors/locations mentioned above will be replaced. This means that both wired en wireless networks at these locations will be interrupted and network traffic will not be possible. This includes the network for access control, climate control etc.

Server nachtwacht preventive maintenance

  Begin         : 20141117 16:30
  End           : 20141117 17:00
  Affected      : n/a (no disturbance for users expected)

The server nachtwacht reports having a hard drive that will soon fail. We will replace the disk as soon as possible.

DNS resolver problem

  Begin1         : 20141115     11:29
  End1           : 20141115     14:40
  Begin2         : 20141117     04:30 / 06:30
  End2           : 20141117     08:37 for DNS, 09:34 for network boot and websites
  Affected       : almost all computers within FNWI

The server that acts within FNWI as first DNS resolver crashed, probably because of a disk becoming defective. This made the network slow or even virtually unusable for a lot of computers in FNWI. Only after a reboot of the server, the problem1 was solved. Monday morning, at the weekly reboot of this server, it would only boot after manually acknowledging the defective disk (problem2), When other servers rebooted at 06:30, a lot of them had problems due to the missing DNS resolver. Further measures to reduce the inconvenience of such a crash have been taken. The move to new DNS servers is in preparation.

Fileserver bulk problems

  Begin         : 2014-11-12 14:45
  End           : 2014-11-12 15:05
  Affected      : Users of shared network disks: acfiles ARK-Backup Bargerveen Bargerveen-Gebruikers beevee bioniccell bio-orgchem B-Ware-Arnica B-Ware-Dotter B-Ware-Erica B-Ware-Knolrus B-Ware-Lobelia B-Ware-Stratiotes celbio1 celbio2 csgi-archief csg-staf cvi desda ds ehef1 evsf2dataoud evsf2schijf1 femtospin Floron geminstr2 giphouse gissig gmi hfml-backup hfml-backup2 highres impuls introcie isis janvanhest kaartenbak kangoeroe microbiology microbiology2 milkun2 milkun3 milkun5 milkun6 milkun7 milkun7rw molbiol mwstudiereis neuroinf neuroinf2 nfstest nwibackup olympus puc RandomWalkModel Ravon-Algemeen Ravon-Foto Ravon-Projecten secres2 sigma sigmaalmanak sigmacies sigmaexchange sigmasymposium splex spmdata3 spmdata4 spmdata5 spmdata6 spmdata7 stroom thalia ucm vsc vsc1 vsc10 vsc11 vsc12 vsc13 vsc14 vsc15 vsc2 vsc3 vsc4 vsc5 vsc6 vsc7 vsc8 vsc9 vscadmin wiskalg wiskunde

Resolved by a server reboot.

Network maintenance in Mercator III north

  Begin         : 20141111     18:30
  End           : 20141111     23:00
  Affected      : network in Mercator III north

The network switches in the buildings/floors/locations mentioned above will be replaced. This means that both wired en wireless networks at these locations will be interrupted and network traffic will not be possible. This includes the network for access control, climate control etc.

Network maintenance in server room Huygens

  Begin         : 20141111     06:00
  End           : 20141111     06:15
  Affected      : network in server room Huygens. Connection to many servers may be impossible.

Because of a software upgrade of the network switches in the buildings/floors/locations mentioned above, these network switches will be rebooted at 06:00h (AM) exactly. This means that both wired en wireless networks at these locations will be interrupted for 10 to 15 minutes and network traffic will not be possible. This includes the network for access control, climate control etc.

Network maintenance in part of Huygens

  Begin         : 20141106     06:00
  End           : 20141106     06:15
  Affected      : network in Huygens wing 3 all floors
                             Huygens wing 4 all floors
                             Huygens wing 8 all floors
                             Huygens central street between wing 6 and 8 all floors

Because of a software upgrade of the network switches in the buildings/floors/locations mentioned above, these network switches will be rebooted at 06:00h (AM) exactly. This means that both wired en wireless networks at these locations will be interrupted for 10 to 15 minutes and network traffic will not be possible. This includes the network for access control, climate control etc.

Network maintenance in peripheral buildings

  Begin         : 20141104     06:00
  End           : 20141104     06:15
  Affected      : network in peripheral buildings: FELIX, HFML, Nanolab, Goudsmit pavillion (NMR),
                  Lab DCN in A-2.002, Kinderopvang Heyendael (both buildings),
                  Logistiek Centrum FNWI, Linnaeus building, Mercator 1, 6th floor

Because of a software upgrade of the network switches in the buildings/floors/locations mentioned above, these network switches will be rebooted on Tuesday morning at 06:00h (AM) exactly. This means that both wired en wireless networks at these locations will be interrupted for 10 to 15 minutes and network traffic will not be possible. This includes the network for access control, climate control etc.

Network maintenance in Huygens wing 7

  Begin         : 20141029     19:00
  End           : 20141029     23:00
  Affected      : network in Huygens wing 7 and in the central street between wings 5 and 7, all floors.

Because of an upgrade of the network switches in the buildings/floors/locations mentioned above, these network switches will be replaced. This means that both wired en wireless networks at these locations will be interrupted and network traffic will not be possible. This includes the network for access control, climate control etc.

RU network problem

  Begin         : 20141029     09:38
  End           : 20141029     15:30
  Affected      : everything connected to the network within RU until 9:45, after that Eduroam users.

Due to a short DDOS-attack the RU was virtually disconnected from the Internet during ca 7 minutes. All network connected systems may have had problems during that time. Until 15:30 some Eduroam users still had problems acquiring an IP address.

FNWI network problem

  Begin         : 20141015     03:44
  End           : 20141015     ca. 11:36 
  Affected      : almost everything connected to the network within FNWI (Heyendaal East)

The router that regulates all IP traffic between subnets in the Huygens building and surrounding buildings, became too busy. All network traffic, especially UDP had problems. ISC network admins reduced the load by simplifying the ACLs and reducing logging. At about 12:00 the problem was temporarily solved. C&CZ will do a further inspection of all ACLs, with the aim to reduce the load on the router without reducing security for connected systems.

DNS resolver problem

  Begin         : 20140920     10:45
  End           : 20140920     14:04 
  Affected      : almost all computers within FNWI

The server that acts within FNWI as first DNS resolver crashed. This made the network virtually unusable for a lot of computers in FNWI. Only after a reboot of the server, the problem was solved. Measures to reduce the inconvenience of such a crash and to make such a crash less likely, have been partly taken and are partly in preparation.

Disturbance of Vodafone VWO data traffic

  Begin         : 20140924 08:30
  End           : 20140924 17:00
  Affected      : Employees with a RU Vodafone VWO mobile phone


On the ISC website we read:

Radboud University agreed on a new mobile telephone contract with - again - provider Vodafone. For this purpose, on Wednesday, September 24th during daytime the Radboud subscriptions will be adjusted. Employees with a Vodafone mobile wille have no disturbance in calling or being called that day. But the Vodafone data network might be temporarily unavailable. To restore the data connection it is necessary to turn the mobile phone off and then on again. Of course employees on campus can always use the eduroam Wi-Fi network.

Webserver (klopjacht) disk problem

  Begin         : 20140909 10:46
  End           : 20140909 11:50
  Affected      : Dozens of websites on this webserver

A defective disk made the server stop serving web requests. A reboot fixed the problem. The defective disk will be replaced soon.

Surfnet uplink offline

  Begin         : 20140908 22:00
  End           : 20140909  0:03
  Affected      : All network traffic in and outbound the RU.

Due to an error during the planned maintenance by Surfnet of the RU network interface, the uplink of Radboud University to Surfnet was down.

Fileserver goudsmit problems

  Begin         : 20140908  6:30
  End           : 20140908 10:43
  Affected      : Users with home directories / data on the goudsmit fileserver

One of the filesystems contained errors and prevented the system from starting up.

SMTP server offline for half an hour

  Begin         : 20140821 16:00
  End           : 20140821 16:30
  Affected      : Users of the science mail service

Both processor fans have failed and need to be replaced.


Fileserver bulk problems

  Begin         : 20140815 12:04
  End           : 20140815 14:25
  Affected      : Users of shared network disks: acfiles ARK-Backup Bargerveen Bargerveen-Gebruikers beevee bioniccell bio-orgchem B-Ware-Arnica B-Ware-Dotter B-Ware-Erica B-Ware-Knolrus B-Ware-Lobelia B-Ware-Stratiotes celbio1 celbio2 csgi-archief csg-staf cvi desda ds ehef1 evsf2dataoud evsf2schijf1 femtospin Floron geminstr2 giphouse gissig gmi hfml-backup hfml-backup2 highres impuls introcie isis janvanhest kaartenbak kangoeroe microbiology microbiology2 milkun2 milkun3 milkun5 milkun6 milkun7 milkun7rw molbiol mwstudiereis neuroinf neuroinf2 nfstest nwibackup olympus puc RandomWalkModel Ravon-Algemeen Ravon-Foto Ravon-Projecten secres2 sigma sigmaalmanak sigmacies sigmaexchange sigmasymposium splex spmdata3 spmdata4 spmdata5 spmdata6 spmdata7 stroom thalia ucm vsc vsc1 vsc10 vsc11 vsc12 vsc13 vsc14 vsc15 vsc2 vsc3 vsc4 vsc5 vsc6 vsc7 vsc8 vsc9 vscadmin wiskalg wiskunde

The fileserver had problems, which were resolved after a server reboot.

M1.04 no network

  Begin         : 20140814  11:32 partly - 16:09 complete
  End           : 20140815  09:28 partly - 11:46 complete
  Affected      : Users in Mercator1 4th floor

While administering the network switchports for the move of ISIS to Mercator1 floor 0 to 3, ISC network management erroneously also changed the configuration of the network switchports of floor 4. When this was reported this morning, it could be readily remedied.

Failure cn56 disk

  Begin         : 20140707   8:00
  End           : 20140707  11:00
  Affected      : Users of cluster node cn56 and Sun Grid Engine on Ubuntu 12.04 systems

One of the mirrored disks failed. We didn't manage to rebuild the raid set with a new disk. The system is reinstalled with two new disks. After that, data from /scratch could be recovered from one of the old disks.

Reboot of login servers after network drive migration

  Begin         : 20140605  16:00
  End           : 20140605  ~17:00
  Affected      : Users of e.g. the Linux login servers (lilo2, lilo3, lilo4)

The planned move of network drives from an old to a new server caused a lot of "stale NFS file handles", which made it necessary to reboot some login servers.

Network drive migration

  Begin         : 20140605  16:00
  End           : 20140605  ~17:00
  Affected      : Users of the following network drives:
                  acfiles, bioniccell, bio-orgchem, celbio1, CSGI-Archief,
                  Csg-Staf, ds, evsf2schijf1, giphouse, gissig, gmi, hfml-backup,
                  hfml-backup2, highres, impuls, introcie, isis, kaartenbak,
                  microbiology, microbiology2, microbiologyftp, milkun2, milkun3,
                  milkun5, milkun6, milkun7, milkun7rw, mwstudiereis, neuroinf,
                  olympus, permissies, puc, secres2, sigma, sigmaalmanak, sigmacies,
                  sigmaexchange, sigmasymposium, Soheto, splex, spmdata3, spmdata4, 
                  spmdata5, spmdata6, spmdata7, stroom, thalia, ucm, vsc, vsc1, vsc2,
                  vsc3, vsc4, vsc5, vsc6, vsc7, vsc8, vsc9, vsc10, vsc11, vsc12,
                  vsc13, vsc14, vsc15,wiskalg, wiskunde

The network drives will be migrated to a new server. During the migration, the network drives will be unavailable.

IMAP mail server problem

  Begin         : 20140605  14:05
  End           : 20140605  14:15
  Affected      : all users wanting to read Science mail 

The IMAP server got overloaded again. Restarting the IMAP service was required to bring it back in operation. We think it is related to snapshots and therefore regularly remove invalid snapshots from now on.

IMAP mail server problem

  Begin         : 20140527  14:00
  End           : 20140527  14:10
  Affected      : all users wanting to read Science mail 

The IMAP server got overloaded. Restarting the IMAP service was required to bring it back in operation.

Data/file/vol server (heap) problem

  Begin         : 20140526 06:45
  End           : 20140526 13:45
  Affected      : Users of this data/file/vol server

The server had a problem, which had started at the Monday morning reboot. A reboot of the machine solved the problem. Soon all data will be moved to a new server. This will not cause much problems for users, because everybody uses the aliases something-srv.science.ru.nl.

Poster printer "kamerbreed" broken

  Begin         : 20140421
  End           : 20140516
  Affected      : Users of the poster printer "kamerbreed"

The printer has to be repaired. Spare parts have been ordered. We hope to have the printer operational again mid-May.

Maintenance wireless infrastructure May 19

  Begin         : 20140519 8:00 pm
  End           : 20140519 12:00 pm
  Affected      : Users of Eduroam and Ru-guest

At Monday May 19, a major upgrade of the wireless infrastructure and its maintenance tool (called Prime) will take place by the ISC. Therefore the wireless service (i.e. Eduroam) will be interrupted several times from 8:00 pm. Existing wireless connections will be lost and new connections will not be possible for some time. Due to the complexity of this operation this maintenance will possibly be continued at Monday, May 26, again with interruptions from 8:00 pm.

Maintenance Goudsmit

  Begin         : 20140512 13:00
  End           : 20140512 14:21
  Affected      : Users of the goudsmit

One of the systemdisks reported SMART-errors and will be replaced.

lilo.science.ru.nl offline

  Begin         : 20140512  6:30
  End           : 20140512  9:36
  Affected      : Users of the linux loginserver lilo

Lilo did not shut down correctly during the planned weekly reboot. After a hardware reset, the machine booted just fine

And yet again Print/phpMyAdmin server problem

  Begin         : 20140508 16:24
  End           : 20140508 16:33
  Affected      : Users of the printto/printsmb/phpMyAdmin service

Just like yesterday, a reboot of the machine was necessary to restore the functionality. Tomorrow morning, the server will restart with a new kernel. It this doesn't solve the problem, we will look into a new version of Samba.

Yet again Print/phpMyAdmin server problem

  Begin         : 20140507 12:49
  End           : 20140507 13:14
  Affected      : Users of the printto/printsmb/phpMyAdmin service

Just like yesterday, a reboot of the machine was necessary to restore the functionality. We will investigate if we can remediate this problem with newer software versions (Samba, Ubuntu, ...)

Eduroam problem

  Begin        : 20140506 13:40
  End          : 20140506 18:29
  Affected     : Eduroam users

Some Eduroam users didn't get an IP number, because one of the IP ranges had no free leases. This ranges contained all @science.ru.nl users. The ISC networking department first decreased the lease time. Now they also make this range free for use by @science.ru.nl users. This will prevent the problem from occuring again soon.

U-number authentication problem Eduroam

  Begin        : 20140430 ca 13:00
  End          : 20140501 16:25
  Affected     : Employees using  for Eduroam authentication, primarily within FNWI

The new RU IdM system erroneously rewrote the RU password hash when data about a person changed. This primariliy affected employees of FNWI, because for this group the Science email address had been incorporated in IdM. This will make it possible to use the Science address as an external mail address when resetting RU passwords. When the problem was recognized, it was fixed swiftly by restoring the RU password hashes from backup.

And again Print/phpMyAdmin server problem

  Begin         : 20140506 12:14
  End           : 20140506 12:35
  Affected      : Users of the printto/printsmb/phpMyAdmin service

Again a reboot of the machine was necessary to restore the functionality. As a patch we will reboot this server each night.

Peage MFP Huygens building in maintenance April 28/29/30

  Begin         : 20140428 ca. 12:00
  End           : 20140430 ca. 12:00
  Affected      : Users of Peage in the Huygens building

April 28/29/30, the Peage MFP and the Peage POS near the restaurant in the Huygens building cannot be used. The ISC will then investigate whether Peage can also be made available for employees instead of only for students. April 28, the MFP will be moved to the test location, April 30 it will be moved back.

After replacement still Print/phpMyAdmin server problem

  Begin         : 20140417 13:35
  End           : 20140417 13:50
  Affected      : Users of the printto/printsmb/phpMyAdmin service

Although the hardware of the server was replaced, the server didn't react for unknown reasons. A reboot of the machine solved the problem. We will think about ways to tackle this problem.

Print/phpMyAdmin server replacement Thursday April 17 08:30-09:20

  Begin         : 20140417 08:30
  End           : 20140417 09:20
  Affected      : Users of the printto/printsmb/phpMyAdmin service

Because of the recent problems with this server, we are going to replace the server, in order to prevent further service interruptions.

U:/ home server pile problem

  Begin         : 20140416 17:42 (and earlier for some users)
  End           : 20140416 ca. 18:00
  Affected      : Users of this U:/ home server

The server had a problem with one partition, which had started during the creation of new snapshots. We waited with the reboot untill after working hours. A reboot of the machine solved the problem. To prevent these problems in the future, we will no longer make local snapshots of homeservers, but of course the daily backups of the homeservers by the backup server will be continued.

And yet again Print/phpMyAdmin server problem

  Begin         : 20140414 13:05
  End           : 20140414 13:12
  Affected      : Users of the printto/printsmb/phpMyAdmin service

Just like four days ago, the server didn't react for unknown reasons. A reboot of the machine solved the problem. We plan to replace the server next Thursday 08:30-09:00 hours, in order to prevent further service interruptions.

Again Print/phpMyAdmin server problem

  Begin         : 20140410 12:54
  End           : 20140410 13:15
  Affected      : Users of the printto/printsmb/phpMyAdmin service

Just like two weeks ago, the server didn't react for unknown reasons. A reboot of the machine solved the problem. Together with the supplier we will try to find out what to replace to prevent this in the future.

U: / home server bundle problems

  Begin         : 20140404 ca. 11:35
  End           : 20140404 12:25
  Affected      : Users of this U: / home server

From the moment of yesterday's snapshots (13:00 hours), more and more processes were hung at the server. The first complaints arrived at C&CZ ca. 11:15 hours this morning. Therefore we decided ca. 11:35 to restart the server. The reboot resolved the problem. The number of snapshots will be reduced in order to try to prevent problems due to snapshots in the future.

Printer pr-hg-00-002 did not print from Windows/Mac

 Begin         : 20140402 11:32
 End           : 20140404 10:27
 Getroffen     : students of the Faculty of Science

Probably due to a too large printjob, the printer queue on the server (printto/printsmb/ooievaar) for the pr-hg-00-002 printer got stuck onWednesday morning. C&CZ was only informed of this problem at the end of Thursday. Friday morning this has been fixed by emptying the printqueue with old jobs.

Print/phpMyAdmin server problem

  Begin         : 20140327 10:40
  End           : 20140327 11:00
  Affected      : Users of the printto/printsmb/phpMyAdmin service

The server didn't react for unknown reasons. A reboot of the machine solved the problem.

Linux login server lilo3 reboot due to NFS-problems

  Begin         : 20140310 12:31
  End           : 20140310 13:53
  Affected      : Users of lilo3

The Linux login server lilo3 didn't have the homedirectories of some users. It took a reboot to fix this.

U:/ home server pile problem

  Begin         : 20140324 17:25
  End           : 20140324 17:35
  Affected      : Users of this U:/ home server

The server had a problem with one partition. A reboot of the machine solved the problem.

File server heap problems

  Begin         : 20140310 06:30
  End           : 20140310 10:10
  Affected      : Users of shared network disks

The fileserver failed to start after the Monday morning reboot, because of filesystem errors (LVmd2-7). Manually fixing the errors solved the problem and the machine started successfully.

U: / home server pile and Linux login server lilo/stitch problems

  Begin         : 20140305 ca. 13:00
  End           : 20140305 16:40
  Affected      : Users of this U: / home server or thee Linux loginservers

The server had a high load and stopped servicing users, probably because of problems with the creation of new snapshots at 13:00 hours. A reboot solved the problem.

Scan to email sometimes doesn't work for Konica Minolta MFPs

  Begin         : 2013???? ??:??
  End           : 20140226  
  Affected      : Users of a KM MFP C364e as a scanner through e-mail

Update: we changed the configuration of the network switchports of the KM's, which resolves the problem. This was necessary, because the KM's do not try hard enough to make a connection with the SMTP-server.

Update: an alternative is scanning to a USB stick. Log in at the MFP with the scan-pin, put a document in the feeder or on the document glass, select Scan and then plug the USB stick into the USB port on the right side of the MFP , on the top side. After a few seconds the MFP recognizes the USB stick and presents the choice "Save a document to external memory". Choose OK and press Start. See also [RU http://www.ru.nl/publish/pages/687597/uitrol-mf-scan.pdf manual] on the RU-page about the MFP's.

Users report to us that the scanning to e-mail sometimes doesn't work, with an error like "Server not found. Scan deleted". The problem has been reported to KM. As far as we know, the problem occurs only seldom. It can temporarily be resolved by switching the machine off and on again.

BASS Java 7 problem

  Begin        : 20131201
  End          : 20140210
  Affected     : Users of BASS with Oracle forms (Java)

The ISC let us know that after the installation of patches installed in BASS during the weekend of December 1, the possibility to use a recent version of Java (version 7) had erroneously disappeared. As of February 11, 2014 this issue has been resolved. The function in BASS to work with all versions of Java SE (JRE) on the client (PC) has been reactivated. Furthermore all JAR-files in BASS have received a security certificate (they have been signed), with which it possible for BASS to work with the latest Java SE versions (Java SE 7.51 en hoger). BASS users who work with Forms, will see pop-up windows asking "Do you want to run this application?". Of this a description has been made on the RU Intranet.

Network interruptions in Huygens wing 5

  Begin        : 20140203 19:00 hrs (7:00 pm)
  End          : 20140203 24:00 hrs (12:00 pm)
  Affected     : Users in Huygens wing 5

On Monday evening, February 3rd between 19:00 - 24:00 (7:00 - 12:00 pm) maintanance will be carried out on the network devices in Huygens Wing 5. Therefore the wired and wireless networks will not be available at some moments at all locations in this Wing, on all floors.

Again network interruptions in Huygens wing 2

  Begin        : 20140128 7:00 pm
  End          : 20140128 12:00 pm
  Affected     : Users in Huygens wing 2 and corridors between wings 2 and 4

The announced proceedings on Tuesday, January 21st on the network equipment unfortunately did not take place due to a sudden instability of the network in Wing 1 the same day. In order to prevent further instability issues in Wing 2, it was necessary to solve the problems in Wing 1 first. These problems have been identified and corrected. The network maintenance in Wing 2 will now take place Tuesday, January 28th between 19:00 and 24:00 ( 7:00 pm to 12:00 ) during which the wireless and cabled networks will be off line for some time. This will affect all floors in Wing 2 and rooms along the corridors between wings 2 and 4 (all floors).

Network interruptions in Huygens wing 2

  Begin        : 20140121 7:00 pm
  End          : 20140121 12:00 pm
  Affected     : Users in Huygens wing 2 and corridors between wings 2 and 4

On Tuesday evening January 21 between 7:00 pm and 12:00 pm maintanance will be carried out on the network devices in Huygens wing 2. Therefore the wired and wireless networks will not be available at some moments at all locations in wing 2 and corridors between wings 2 and 4, on all floors.

Network interruptions in Huygens wing 1

  Begin        : 20140113 7:00 pm
  End          : 20140113 12:00 pm
  Affected     : Users in Huygens wing 1 and corridors between wings 1 and 3

On Monday evening January 13 between 7:00 pm and 12:00 pm maintanance will be carried out on the network devices in Huygens wing 1. Therefore the wired and wireless networks will not be available at some moments in many locations in wing 1 and corridors between wings 1 and 3, on all floors.

Network problems

  Begin        : 20140106 11:29
  End          : 20140106 11:40

Due to an error by an ISC network administrator, all internal and external RU network traffic was blocked for a short period.

Moving network shares to new servers

  Begin         : 20131223
  End           : 20131224
  Affected      : Users of the following network shares:
                  acfiles2 botany botany-general carta comsol digicd encapson encapson2
                  exoarchief felix gi2 gi3 hfml-45t hfml-data hfml-engineering ifl iris
                  mailmanincludes mbaudit1 mbaudit2 mbbioel mbcns mbcortex mbdata mbread
                  mbwrite mestrelab mi2 mi3 milkun4 milkun4rw molchem molchem2 molphtec
                  mol-secr multimedia neuropi ns3 nwi-backup onlyme owc pcb planthgl
                  plantkunde-hgl sdisk/software share snn sofie spmdata1 tdisk/cursus tece teceleiding
                  temp tracegastemp wallpaper xpcursus xpsoftware

In the course of these two days, above network shares will be moved to new servers. During the move action, a share will not be available. If you encounter any problems connecting to a network share using Windows, always connect using the "share-name minus srv" naming scheme: \\sharenaam-srv.science.ru.nl\sharenaam. See also Diskruimte#Naming.

Postponed replacement of home-server "bundle"

  Begin        : 20131223 07:30
  End          : 20131223 09:00 (expected)
  Affected     : Users with homedirectory server "bundle" (as can be seen on http://DIY.science.ru.nl)

The replacement of the old home server "bundle" has been postponed and will now guaranteed take place on Monday morning December 23. Because the data have been synchronized with the new server, there will not be much downtime. The new server should be very dependable: hardware RAID-6, double processors and power supplies and a 5-year support contract from the supplier. The performance has improved, e.g. by using hardware RAID with a 1 GB write cache with battery backup.


Interruption of network in Huygens wing 1

  Begin        : 20131217 19:00
  End          : 20131217 24:00
  Affected     : Users of the network in Huygens wing 1, especially the ground floor.

Tuesday night December 17 19:00-24:00 hours, maintenance work will be done affecting the data network in Huygens wing 1. The wired and wireless network will suffer service interruptions a few times during that period. The main inconvenience will be on the ground floor.

Service interruption Konica Minolta MFP's outside of Huygens building

  Begin         : 20131209 12:48
  End           : 20131209 13:40 after reset of a KM MFP
  Affected      : Users of the KM MFP's outside of the Huygens building

C&CZ changed the DHCP-configuration of the KM MFP's, because KM had told us all KM's had an identical configuration after the upgrade of the night of December 4. Soon thereafter the KM's outside of the Huygens building showed problems. After changing the DHCP-configuration back to the previous version and restarting the MFP's, the problem was resolved. The firmware upgrade of these machines still has to take place.

Konica Minolta MFP's firmware upgrade

  Begin         : 20131204 02:00
  End           : 20131204 06:00
  Affected      : Nightly users of the KM MFP's

Wednesday night, the firmware of the Konica Minolta multifunctionals will be upgraded. This should resolve existing problems, like the not waking up from sleep mode. Please report all remaining problems, for MFP-hardware and paper to KM via phone: 55955 option 4.

Power dip December 3

  Begin         : 20131203  ca 09:36
  End           : 20131203  ca 10:00
  Affected      : all RU/UMCN users

Tuesday morning around half past nine, there was a short power dip for RU/UMCN, probably due to a switch error at Liander. This power dip, that is not listed on the power interruption website, made all systems restart that were not on UPS power. Because only the network switches in Huygens wing 1 and 7 are on UPS power, a lot of users lost their connection to the network, including wireless and IP-telephony, for about 20 minutes. Apparatus that restarted faster than the network, might have needed an extra restart to restore the connection to the network.

Poster printer "kamerbreed" broken

  Begin         : 20130925 17:00
  End           : 20131120 14:00
  Affected      : Users of the poster printer "kamerbreed"

The motherboard of the printer had to be replaced. Because there is no maintenance contract for this old printer and spare parts were hard to get, repair has taken a long time.

Gipphoenix network problem

  Begin         : 20131118 06:33
  End           : 20131029 09:16
  Affected      : Users of virtual machines hosted by gipphoenix

For unknown reasons, the network interface of the server was not online. Running ifdown / ifup eth2 resolved the problem.

Web lectures unavailable for 20 minutes

  Begin         : 20131115 10:35
  End           : 20131029 17:17
  Affected      : Users of Blackboard / Web lectures

For unknown reasons one of the servers had a high load and stopped servicing users. A reboot solved the problem.

U: / home server bundle problems

  Begin         : 20131029 ca. 16:00
  End           : 20131029 17:17
  Affected      : Users of this U: / home server

For unknown reasons the server had a high load and stopped servicing users. A reboot solved the problem.

Again IMAP mail server problem

  Begin         : 20131028 13:35 
  End           : 20131028 14:08
  Affected      : all users wanting to read Science mail 

Just like 3 days ago, the IMAP server got overloaded after noon. It took a reboot to bring the service back in operation. We suspect that our making of a backup-snapshot triggers this and now have disabled the snapshot during working hours.

IMAP mail server problem

  Begin         : 20131025 13:23 
  End           : 20131025 14:18
  Affected      : all users wanting to read Science mail 

During an extra backup the IMAP server got overloaded. It took a reboot to bring the service back in operation.

print_mail_to_link('Wireless_', 'ru.nl_authentication_problem')">Wireless @ru.nl authentication problem

  Begin         : 20131023  ca 14:30
  End           : 20131023  ca 16:15
  Affected      : all wireless @ru.nl users 

Yesterday afternoon around 14:30 the ISC conducted a seemingly innocent maintenance on the LDAP-server, but immediately after that auth-requests from Radius were no longer serviced. This made it impossible for wireless users to authenticate with their u/s/e number. Users in the realm @science.ru.nl were not affected by this.

Power dip October 22, ca. 11:00

  Begin         : 20131022  ca 10:55
  End           : 20131022  ca 11:00
  Affected      : all RU/UMCN users

Tuesday morning around 11 o'clock, there was a short power dip for RU/UMCN. This power dip, that is not listed on the power interruption website, made all systems restart that were not on emergency power. Because only the network switches in Huygens wing 1 and 7 are on emergency power, a lot of users lost their connection to the network, including wireless and IP-telephony, for about 5 minutes. Apparatus that restarted faster than the network, might have needed an extra restart to restore the connection to the network. A department reported that a departmental printer did not survive the power dip.

Again mail problems after supplying password to phishers

 Begin         : 20131010 00:04
 End           : 20131010 00:33
 Affected      : Users of Science mail

Again a Science user supplied his Science password to phishers. We notice that because Internet criminals use these passwords to get into the Science mail servers (horde webmail, smtp) in order to send spam.

PLEASE: do not naively click on a link in an e-mail!

Printto/printsmb/phpmyadmin server problems again

  Begin         : 20131001 15:18
  End           : 20131001 15:31
  Affected      : Users of phpmyadmin, printto and printsmb printer servers

The printserver crashed again, for yet unknown reasons. A reboot of the machine solved the problem.

U: / home server bundle problems

  Begin         : 20130930 13:46
  End           : 20130930 14:02
  Affected      : Users of this U: / home server

The server crashed. A reboot of the machine solved the problem.

Printto/printsmb/phpmyadmin server problems

  Begin         : 20130927 11:02
  End           : 20130927 11:10
  Affected      : Users of phpmyadmin, printto and printsmb printer servers

The printserver crashed. A reboot of the machine solved the problem.

Sound on Linux dual-boot PC's disabled

  Begin         : 20130923
  End           : 201?????
  Affected      : Users of sound in Linux on dual boot PC's

The availability of sound in Windows appeared to be depending on the state in which Linux had left it. Therefore the sound in Linux on dual boot PC's has been disabled, in order to have it available on Windows.

BASS RU problem

  Begin         : 20130603     00:00
  End           : 20130926     08:00
  Affected      : all FNWI users using the Port Forwarder to connect to BASS RU.

The central BASS RU environment has been updated last weekend. Part of this upgrade was a change of the web address of the second logon screen, as announced on our BASS page. This morning it became clear that access to the second logon screen via the Port Forwarder didn't work anymore. Therefore the UCI rolled back the change of the second logon screen. This means that AFTER LOGGING ON TO THE PROXY at https://admin.ru.nl/ BASS is available again via https://bassruap01.uci.ru.nl:8010/OA_HTML/AppsLocalLogin.jsp

This problem will be finally resolved when the Port Forwarder will be stopped on October 1, 2013.

Printto/printsmb server problems

  Begin         : 20130924 11:41
  End           : 20130924 12:22
  Affected      : Users of the printto and printsmb printer servers

The printserver crashed. A reboot of the machine solved the problem.

File server heap problems

  Begin         : 20130917 16:25
  End           : 20130917 17:15
  Affected      : Users of 75 shared network disks

The fileserver failed to see its disks. A reboot of the machine solved the problem.

File server chunk failed reboot

  Begin         : 20130916 06:30
  End           : 20130916 13:05
  Affected      : Users of the following disks:
                arb botgarden ccs4 excienwi exoarchief FIT gi1 gipsy isis-dhz
                itt iwwr1 leonardo mercator micord molspec olcwis ons pvs
                ratio tzacad vb vscxray WiskundeToernooi zeegras

The fileserver failed the regular Monday morning reboot. Only after a rescue boot and a removal of all snapshots rebooting the machine worked.

Mail problems after supplying password to phishers

 Begin         : 20130906 10:13
 End           : 20130906 10:20
 Affected      : Users of Science mail

Again a Science user supplied his Science password to phishers. We notice that because Internet criminals use these passwords to get into the Science mail servers (horde webmail, smtp) in order to send spam.

PLEASE: do not naively click on a link in an e-mail!

Peter van Campen 6 sep 2013 11:17 (CEST)

B/W KM printer pr-hg-00-002 printed in color

 Begin         : 20130902 13:33
 End           : 20130903 15:02
 Getroffen     : students of the Faculty of Science

The b/w tuned Konica Minolta printer/MFP pr-hg-00-002 suddenly printed in color. C&CZ refunded the student's budget and stopped the printer until KM had corrected the settings.

Peter van Campen 3 sep 2013 17:49 (CEST)

And again mail problems after supplying password to phishers

 Begin         : 20130904 12:22
 End           : 20130904 12:30
 Affected      : Users of Science mail

Again a Science user supplied his Science password to phishers. We notice that because Internet criminals use these passwords to get into the Science mail servers (horde webmail, smtp) in order to send spam.

PLEASE: do not naively click on a link in an e-mail!

Peter van Campen 4 sep 2013 12:38 (CEST)

And yet again mail problems after supplying password to phishers

 Begin         : 20130903 13:12
 End           : 20130903 13:30
 Affected      : Users of Science mail

Again a Science user supplied his Science password to phishers. We notice that because Internet criminals use these passwords to get into the Science mail servers (horde webmail, smtp) in order to send spam.

PLEASE: do not naively click on a link in an e-mail!

Peter van Campen 3 sep 2013 13:33 (CEST)

DNS nameserver problem

  Begin         : 20130902     04:30
  End           : 20130902     08:30
  Affected      : DNS clients

The DNS server on ns1.science.ru.nl didn't start after the reboot, due to a syntax error in one of the zone files. When this had been corrected, it started without problems.

Peter van Campen 2 sep 2013 18:27 (CEST)

Wireless access to BASS problem

  Begin         : 20130812     00:00
  End           : 20130823     ca. 15:00
  Affected      : all FNWI users using the wireless network to connect to bass.ru.nl.

From August 12 the wireless network in the Science buildings is being replaced. The new IP numbers appear to not be able to connect directly to BASS. We expect that this will be changed soon.

Peter van Campen 20 aug 2013 11:56 (CEST)

Yet again mail problems after supplying password to phishers

 Begin         : 20130829 05:30
 End           : 20130829 06:30
 Affected      : Users of Science mail

Again a Science user supplied his Science password to phishers. We notice that because Internet criminals use these passwords to get into the Science mail servers (horde webmail, smtp) in order to send spam.

PLEASE: do not naively click on a link in an e-mail!

Peter van Campen 29 aug 2013 06:36 (CEST)

Fileserver chunk needs reboot

 Begin         : 20130829 07:30
 End           : 20130829 08:00 (expected)
 Affected      : Users of file services:
                 arb botgarden ccs4 excienwi exoarchief FIT gi1 gipsy isis-dhz
                 itt iwwr1 leonardo mercator micord molspec olcwis ons pvs
                 ratio tzacad vb vscxray WiskundeToernooi zeegras

Wim Janssen 28 aug 2013 25:37 (CEST)


Again mail problems after supplying password to phishers

 Begin         : 20130822 22:38
 End           : 20130822 23:18
 Affected      : Users of Science mail

Again a Science user supplied his Science password to phishers. We notice that because Internet criminals use these passwords to get into the Science mail servers (horde webmail, smtp) in order to send spam.

PLEASE: do not naively click on a link in an e-mail!

Peter van Campen 22 aug 2013 23:27 (CEST)

Mail problems after supplying password to phishers

 Begin         : 20130822 01:45
 End           : 20130822 06:30
 Affected      : Users of Science mail

Again a Science user supplied his Science password to phishers. We notice that because Internet criminals use these passwords to get into the Science mail servers (horde webmail, smtp) in order to send spam.

PLEASE: do not naively click on a link in an e-mail!

Peter van Campen 22 aug 2013 06:34 (CEST)

Windows Roaming Profile problem

 Begin         : 20130801 00:00
 End           : 20130807 12:00
 Affected      : Windows B-Fac Users with roaming profiles

An error in distributing a Group Policy Object (GPO) caused roaming profiles to fail for the last week. The start date/time of the problem is unknown, This is an estimate.

Wim Janssen 7 aug 2013 12:23 (CEST)

Lilo3 reboot for different IP number

 Begin         : 20130715 11:00
 End           : 20130715 11:15
 Affected      : Users of lilo (lilo3)

Today lilo3 appeared to have an incorrectly chosen IP number. A reboot fixed this.

Peter van Campen 15 jul 2013 12:56 (CEST)

Mail problems after supplying password to phishers

 Begin         : 20130701 23:50
 End           : 20130702 00:35
 Affected      : Users of Science mail

Again a Science user supplied his Science password to phishers. We notice that because Internet criminals use these passwords to get into the Science mail servers (horde webmail, smtp) in order to send spam.

PLEASE: do not naively click on a link in an e-mail!

Peter van Campen 2 jul 2013 00:42 (CEST)

Mail problems after supplying password to phishers

 Begin         : 20130627 17:49
 End           : 20130627 18:19
 Affected      : Users of Science mail

Again a Science user supplied his Science password to phishers. We notice that because Internet criminals use these passwords to get into the Science mail servers (horde webmail, smtp) in order to send spam.

PLEASE: do not naively click on a link in an e-mail!

Wim Janssen 28 jun 2013 23:20 (CEST)

Mail problems after supplying password to phishers

 Begin         : 20130627 17:49
 End           : 20130627 18:19
 Affected      : Users of Science mail

Again a Science user supplied his Science password to phishers. We notice that because Internet criminals use these passwords to get into the Science mail servers (horde webmail, smtp) in order to send spam.

PLEASE: do not naively click on a link in an e-mail!

Peter van Campen 27 jun 2013 18:23 (CEST)

Mail problems after supplying password to phishers

 Begin         : 20130624 09:00
 End           : 20130624 10:05
 Affected      : Users of Science mail

Again a Science user supplied his Science password to phishers. We notice that because Internet criminals use these passwords to get into the Science mail servers (horde webmail, smtp) in order to send spam.

PLEASE: do not naively click on a link in an e-mail!

Wim Janssen 24 juni 2013 11:06 (CEST)

Mail problems after supplying password to phishers

 Begin         : 20130623 15:54
 End           : 20130623 16:30
 Affected      : Users of Science mail

Again a Science user supplied his Science password to phishers. We notice that because Internet criminals use these passwords to get into the Science mail servers (horde webmail, smtp) in order to send spam.

PLEASE: do not naively click on a link in an e-mail!

DNS nameserver problem

  Begin         : 20130617     04:30
  End           : 20130617     08:45
  Affected      : DNS clients

The DNS server on ns1.science.ru.nl didn't start after the reboot, due to a syntax error in one of the zone files. When this had been coreected, it started without problems.

Peter van Campen 17 jun 2013 11:19 (CEST)

Mail problem (IMAP server)

  Begin         : 20130612     13:45
  End           : 20130612     14:27
  Affected      : all FNWI users wanting to read mail on the IMAP-server.

The IMAP server had a problem, which made a reboot necessary.

Peter van Campen 12 jun 2013 14:29 (CEST)

Maintenance Wireless@RU

  Begin         : 20130606     10:00 pm
  End           : 20130606     12:00 pm
  Affected      : all users using the wireless networks ru-wlan and eduroam.

On Thursday June 6th from 10:00 pm the wireless networks ru-wlan and eduroam will be unavailable for at least 2 hours. All existing connections will be cut off. The wireless network Science however will not be effected and will be kept available.

Marcel Kuppens 3 jun 2013 13:51 (CEST)

OpenID server problem

  Begin         : 20130524 ca. 03:30
  End           : 20130529     14:15
  Affected      : all FNWI users wanting to login on a wiki using OpenID.

During the renewal of the OpenID server a configuration error was made. When users reported problems logging in, we started searching for the origin of problem. When it was found, it could easily be corrected and the server could be restarted with the correct configuration.

Peter van Campen 29 mei 2013 16:49 (CEST)

Homeserver bundle crashed

  Begin         : 20130521 ca. 17:00
  End           : 20130521     17:27
  Affected      : all FNWI users with a homedirectory on fileserver bundle

The fileserver crashed. After a reboot everything was back to normal.

Peter van Campen 22 mei 2013 13:58 (CEST)

Fileserver stack crashed due to failed hard disk

  Begin         : 20130429 04:08
  End           : 20130501 09:15
  Affected      : Users of disk volumes/network shares on file server Stack.
  Problem       : Stack: Crash due to defective disk
  Solution      : Deactivated disk using hot spare and reboot

Wim Janssen 02 may 2013 10:00 (CET)

Homeserver bundle failed reboot

  Begin         : 20130429 06:30
  End           : 20130429 11:30
  Affected      : all FNWI users with a homedirectory on fileserver bundle

The fileserver failed to reboot during the regular Monday morning shutdown schedule. It was possible to gain access to the system console only after having removed all power from the chassis. Snapshots were removed using the rescue reboot but rebooting the machine resulted in a faulty filesystem. We were able to boot the system after all filesystems had been checked offline. These actions resulted in a unusual long downtime.

Erik Joost Visser 29 apr 2013 12:00 (CET)

Homeserver bundle failed reboot

  Begin         : 20130422 06:30
  End           : 20130422 09:50
  Affected      : all FNWI users with a homedirectory on fileserver bundle

The fileserver failed the regular Monday morning reboot. Only after a rescue boot and a manual removal of all snapshots rebooting the machine worked.

Disk server stack offline

  Begin        : 20130408 08:55
  End          : 20130408 09:30
  Affected     : Users of disk volumes/network shares on file server Stack.
  Problem      : Stack:  Crash due to failing disk
  Solution     : Deactivated disk using hot spare

Disk server pile offline

  Begin        : 20130408 06:30
  End          : 20130408 08:15
  Affected     : Users of disk volumes on file server Pile (userhomes).
  Problem      : Pile: Did not shutdown properly during weekly reboot due to a kernel panic which was 
  Solution     : Executed power-cycle of the system

Erik Joost Visser 8 apr 2013 9:30 (CET)

Mail problems after supplying password to phishers

 Begin         : 20130319 11:45
 End           : 20130319 12:14
 Affected      : Users of Science mail

Again a Science user supplied his Science password to phishers. We notice that because Internet criminals use these passwords to get into the Science mail servers (horde webmail, smtp) in order to send spam.

PLEASE: do not naively click on a link in an e-mail!

File server miii no private network

  Begin        : 20130315 17:30
  End          : 20130318 09:00
  Affected     : Users of network disks of the server miii.
  Problem      : While performing maintenance (removing a network card) a network cable was not inserted correctly.

Wim Janssen 18 mrt 2013 11:11 (CET)

Disk server pile offline

  Begin        : 20130318 06:30
  End          : 20130318 07:36
  Affected     : Users of disk volumes on file server Pile.
  Problem      : Pile: Was waiting for interactive input after reporting a warning (^d)

Network problems due to installation of Matlab R2013a

Begin         : 20130313 13:00
End           : 20130313 13:40
Affected      : users of the network

Yesterday Matlab R2013a has been installed. Today at 13:00 hours many servers started to automatically copy this 5.4 GB to their local disc. Some parts of the network were overloaded by all these copying, which made accessing the network slow for many users. The distribution of this software will now be scheduled to happen over a longer period, primarily outside of working hours.

Peter van Campen 13 mrt 2013 18:31 (CET)

More IP-numbers for ru-wlan and Science (wireless)

Monday, March 4th 2013 at 18:00 hours, the number of IP numbers that is available in the FNWI buildings for ru-wlan and Science will be doubled. Because ru-wlan moves to a new range, users of ru-wlan will lose connectivity for at most 15 minutes. There was already a plan to replace ru-wlan and Science within the FNWI buildings by the RU-wide Eduroam and ru-wlan, but the wireless network usage has grown so fast that we can not wait for this plan to be realized. Last week some wireless users at times could not even get an IP address, although the lease time had been brought down to 30 minutes. Therefore this temporary measure became necessary on such short notice.

Marcel Kuppens 4 mar 2013 12:20 (CET)

Short interval in wireless network service

On Monday feb 18 at 6:00 pm there will be some maintenance at the wireless network which will effect the following locations at Toernooiveld:

FNWI cellar A1
FEL
Huygens: Library of Science, terrace behind Huygens, cantine, room HG-1.132
KDV1 en KDV2
Linnaeus building
Logistic Centre
Mercator I
Mercator II, ground floor and 7nd floor
Mercator III, 2nd floor
Transitorium FNWI (ACSW and FELIX)
UBC

We expect the service will be completely available again within 30 minutes.

Marcel Kuppens 18 feb 2013 10:53 (CET)

Mail problems after supplying password to phishers

 Begin         : 20130212
 End           : 20130214 (for now)
 Affected      : Users of Science mail, specifically of horde webmail

The last few days three Science users have supplied their Science password to phishers. We notice that because Internet criminals use these passwords to get into the Science mail servers (horde webmail, smtp) in order to send spam. This time they even used a fake copy of the horde Science webmail website. The big differences with the real horde Science webmail website are:

  • the URL is not within science.ru.nl
  • the connection is not a secure https connection, there is no lock
  • the username and password do not arrive at C&CZ servers, but in the hands of Internet criminals.

PLEASE: do not naively click on a link in an e-mail!

New Radius server for ru-wlan and eduroam (wireless)

On Monday, January 28th 2013 at 8:00 am, one of the servers that is being used by the wireless network of the RU, will be replaced. This replacement will affect you as a user of the wireless networks ru-wlan and eduroam: There will appear a new certificate when connecting. You can just accept this, after which the connection should work. If this appears not to be the case, then it’s best that you remove your old Eduroam- respectively your old RU-WLAN settings first to activate the new connection .

Specifically for iPhone / iPad users: We recommend that you first remove your old Eduroam- respectively your old RU-WLAN profile before activating the new connection without a profile. If that unexpectedly fails, please review the information on www.ru.nl/wireless for iPhone/iPads. If necessary, you can also download a new profile from that site.

Marcel Kuppens 17 jan 2013 10:53 (CET)

Homeserver bundle crashed

  Begin         : 2013-01-16 ~ 13:30
  Einde         : 2013-01-16 ~ 14:00
  Affected      : all FNWI users with a homedirectory on fileserver bundle

Because the file server crashed, it had to be rebooted.

LDAP server vernieuwd

  Date         : 20121214
  Affected     : Users with a Fedora based desktop PC

Older Fedora desktop PC's may experience startup problems after an upgrade of one of our LDAP servers. A fix is available and has been applied. If you still encounter this problem, please contact C&CZ.

Mail problems after supplying password to phishers

  Begin        : 20121116 04:45
  End          : 20121117 ca 12:00
  Affected     : Users of horde webmail and users wanting to send mail to e.g. hotmail.com

Horde webmail again appeared to be misused for sending spam. This could happen because a naive user gave the Science password to phishers/spammers. After first stopping horde, early Friday morning we disabled the account of the naive user and restarted horde. Saturday morning it appeared that this short spam-outbreak had caused administrators of hotmail.com to add our mail server to their blacklist. Therefore we switched the IP-number of this mail server Saturday morning.

Homeserver bundle will be rebooted

  Begin         : 2012-10-24 ~ 12:45
  Einde         : 2012-10-24 ~ 13:00
  Affected      : all FNWI users with a homedirectory on fileserver bunlde

Because the file server refuses to accept a spare disk, it needs a reboot.

Homeserver bundle unavailable

  Begin         : 2012-10-22 12:15
  End           : 2012-10-22 13:00
  Affected      : all FNWI users with a homedirectory on fileserver bunlde

At the moment, we are solving the problem.

Services unavailable due to power and network outage

  Begin         : 20121018 03:00
  End           : 20121018 10:00
  Affected      : all users until 09:30; afterwards: "bundle" home directories, wireless, "plus" network shares and several websites

During the night of wednesday on thursday a power outage resulted in a network outage in the basement computing facilities. The power was restored to the network equipment using a bypass thus circumventing the UPS at about 09:15. Further checks implied that most servers had not become powerless so that most services became automatically available again. Network drivers on "bundle" had to be restarted in order to get access to home directories for a large number of users. Furthermore, several websites had to be restarted which made it possible for PC's to boot properly. During the day, an unrelated issue with the RAID storage of "plus" has been fixed as well granting access to the following network shares: sofie, ams*, molchem, mb*, encapson, milkun4, snn, neuropi, digicd. carta, ... Since wireless devices were unable to acquire IP addresses, i.e. gain access to the network, a split-brain situation was diagnosed within the DHCP service which was resolved around 13:00.

Announced downtime: home server "pile" down for reboot

  Begin        : 20121012 07:00
  End          : 20121012 09:00
  Affected     : Users with homedirectory server "pile" (as can be seen on http://DIY.science.ru.nl)

Next Friday morning, the home server "pile" will be rebooted. There are problems with the snapshots, which could make a reboot take more time. Therefore we schedule the reboot for early next Friday.

Peage top-up unit near Huygens restaurant in maintenance

In order to test new software, the Peage top-up unit near the Huygens restaurant was switched to maintenance mode. This unit is not used often yet, therefore this wil not have caused problems. Students that wanted to top-up their Peage account, could do that only elsewhere on campus. See the http://www.ru.nl/peage Peage website], locations are the halls of the Erasmus, Spinoza and Library buildings.

Eduroam incoming doesn't work for iPhone/iPad/iPod

  Begin         : spring 2012 (?)
  End           : 20121005
  Affected      : incoming Eduroam users with an iPhone/iPad/iPod

The UCI network management reports that at this moment the incoming version of Eduroam doesn't work for iPhone/iPad/iPod. A solution is being worked upon. Eduroam incoming means that one uses the wireless network of a remote institute, with authentication (login/password) being checked by RU or Science.

Horde webmail server down because of spam

  Begin        : 20120925 23:05
  End          : 20120926 10:20
  Affected     : Users of horde webmail

Yesterday evening, horde webmail appeared to be misused for sending spam. This could happen because a naive user gave the Science password to spammers. First we stopped horde. This morning we disabled the account of the naive user and restarted horde.

Disk server "Stack" offline

  Begin        : 20120924 06:30
  End          : 20120924 09:35
  Affected     : Users of disk volumes on file server Stack.

Disk server "Plenty" offline

  Begin        : 20120924 06:30
  End          : 20120924 09:00
  Affected     : Users of disk volumes on file server Plenty. The S and T disks that are used in the PC rooms.

During the weekly reboot (monday mornings), the server got stuck in the BIOS.

Announced downtime: home server "pile" down for replacement

  Begin        : 20120724 07:00
  End          : 20120724 09:00 (ca)
  Affected     : Users with homedirectory server "pile" (as can be seen on http://DIY.science.ru.nl)

Next Tuesday morning, the home server "pile" will be replaced by a new, more powerful server. Because the data have been synchronized with the new server, there will not be much downtime.

Postponed: Announced downtime: home server "pile" down for replacement

The downtime below has been postponed, because we had a few questions on the new server, that could not be answered in time. To be continued...

  Begin        : 20120724 07:00
  End          : 20120724 09:00 (ca)
  Affected     : Users with homedirectory server "pile" (as can be seen on http://DIY.science.ru.nl)

Next Tuesday morning, the home server "pile" will be replaced by a new, more powerful server. Because the data have been synchronized with the new server, there will not be much downtime. The new server should be very dependable: hardware RAID-6, double processors and power supplies and a 5-year support contract from the supplier. The performance has improved, e.g. by using hardware RAID with a 1 GB write cache with battery backup.

Partly announced downtime for mailman + horde webmail server

  Begin        : 20120712 09:09
  End          : 20120712 14:00 (ca)
  Affected     : Users of horde webmail and/or mailman mailing lists

This morning, horde webmail appeared to be misused for sending spam. This could happen because naive users gave their Science password to spammers. After we found out who the users were and had them change their password, we decide to also replace a defective cpu fan. Therefore also Mailman mailing lists will be down from 13:00 to 14:00 hours.


SMTP server blacklisted by MS Live Hotmail

  Begin        : 20120711 03:08
  End          : 20120711 14:55
  Affected     : Science mail users trying to send mail to MS-domains: hotmail.com, live.com, ...

This morning, users reported that mail from smtp.science.ru.nl to hotmail users was being bounced by hotmail. We have tried to let the hotmail administrators change this fast, but when this took too long, we changed the IP-number of our smtp-server.

Planned service interruption: file server with problems

  Begin        : 20120622 17:03
  End          : 20120624 19:30
  Affected     : stack fileservices

A hardware failure of a boot disk of the fileserver stack was reported Friday morning June 22. We decided to repair this after working hours. Thus at approximately 17:00 the defective boot disk was removed from the machine and replaced by a spare one. Enabling the disk, making it bootable, restoring file systems and rebooting the machine (after removing all snapshots) took a lot of time. When this was resolved Friday evening, the NFS/SMB fileservice was not active on the mounted filesystems. It took a reboot Sunday evening to resolve all problems.

Tracelab server poly defective

  Begin       : 20120621 14:12
  End         : 20120621 17:15
  Affected    : Tracelab for users. For administrators also Prism&Deploy and the WDS-service

A hardware failure of the server poly was reported at 2012-06-21 14:12. After a restart of the machine, it stopped working again. No more recoveries were attempted and an identical spare machine was outfitted with the disks from the defective server. Disks had to be synchronized before making the machine available again.

Servers without electric power

  Begin       : 20120607 13:45
  End         : 20120607 15:30
  Affected    : e-mail and users of the fileservers bundle, heap and stack

A power failure in a rack in a server room brought some C&CZ servers down. After less than two hours all problems were dealt with. Affected systems ware mainly: postvak (Science mail server), bundle (user homedisk), heap/stack (network discs), resser/kookpunt/brievenbus/rustug (mail transport smtp servers)

Planned Service: website-databases and maybe Linux clients

20 Apr 2012 17:00 - 17:15

A defective hard disc has been replaced in a server, but the server needs to be rebooted to ensure that this is reboot proof. The MySQL database of roughly 70 websites will therefore be down for a short time. Since this server also provides the Kerberos authentication for Linux clients, Linux clients might encounter service interruptions during a short period.

Windows server "plenty" with xpsoftware unavailable

Thursday July 7, around 13.00 hours the server "plenty" could not be reached. Because this server serves the "xpsoftware" share for the Managed Windows PC's, all these PC's had a problem. After the server was restarted and the disks had been checked, it was available again at 14:26.


Downtime Science servers: Sunday July 3, 09:00 - 12:00 hours

In order to improve the cooling of a server room, we plan to move three racks of Science servers a few meters on Sunday morning, July 3. We will have to switch off a lot of servers temporarily. Therefore several services will be unavailable some time starting July 3, 09:00 hours. We expect the downtime will last until 10:00 hours for servers with a lot of different users. The cn compute cluster will probably be fully operational again at 12:00 hours.

The servers/services affected are:

fileservers: plenty/pile/bundle with shares like:
             amsbackup2 bbb-priv botany bsweet comsol exoarchief gi3 hfml-data ifl iris
             lambiek mestrelab mi1/2/3 molchem2 molphtec morph multimedia olsen pcb planthgl
             sdisk share snn2 spmdata1 tdisk tece temp wallpaper xpcursus xpsoftware
potkast: films via Blackboard
ts2: Windows Terminal Server
lilo1: Linux Login Server, alternative: lilo/lilo2
cn compute cluster
horde webmail
License server for: Comsol

With apologies for the inconvenience
C&CZ

Peter van Campen 22 jun 2011 09:57 (UTC)

Network outage June 22, 10:55-11:30

This morning, in the network hub for Huygens South a UPS (battery power supply) went down, which made a set of network switches loose power. Because of this, users in Huygens wing 1 and spin-off companies lost their connection to the network. After bypassing the UPS, everything was up and running again at 11:30. We are still searching for the exact origin of this outage.

New SSH keys for new login servers

The LInux LOgin server lilo has been replaced. The name now points to the new machine lilo2, because that one is faster than the other login server lilo1. Therefore it is quite normal to accept once the new SSH-key.

Planned Service: Limited computer services

12 Feb 2011 7:00 - 11:00

A backup cooling system will be installed in our main computer room. Therefore the air conditioning system must be switched off, which means that most of the computer facilities in this room must be shut down. This includes the cluster nodes cn00 through cn53 and many of the web- and file- (network share) servers. It is advised to expect a very limited service level. We will try to keep all home directories and the mail system available. For detailed information about the impact please contact C&CZ.

Printer lp5

24 Jan 2011 - 11 Mar 2011

Printer lp5 has been moved to HG00.089. You can't use this printer at the moment, there's a problem with the power supply unit.

Fixed phone problem

7 Mrt 2011

You can't reach certain fixed phones at the university right now, mobile phones and Skype do work ok though.

Mailserver blacklisted

4 Feb 2011 9:00 - 12:00

One of our mail servers has been sending loads of spam after a successful phishing attack. Since then, our server has been blacklisted on several domains. Currently this affects the delivery of email to @hotmail and @live addresses.