[Ncep.nhc.nco_contacts] TSB Morning Rounds - Fri Jan 27, 2017
nhc.tsbadmin at noaa.gov
Fri Jan 27 14:17:29 UTC 2017
Here is today's summary of NHC computer operations:
--- Craig Mattocks
1. GFE/NCP issues: In addition to a lack of 00Z model data, there were
significant problems with AWIPS, both GFE and Textws. The LX2 and LX3
workstations froze. Called NCF and they attempted to restart, but the
workstations froze again. Atlantic text products were formatted and sent on
LX7 early based on the evening shift's database in order to get them out
before all workstations crashed. WFO-Miami reported similar issues. NCF
suggested we reboot manually by power-cycling (hard rebooting) the boxes
because the systems get hung up when trying to kill all of the processes
(according to AWIPS Sysadmin Chris Mello). This worked. Products resumed as
normal. 06Z data populated on time. Grids were then updated.
No new issues.
1. WCOSS Cray transfer jobs started failing due to Luna connection issues.
The problem was traced to a failure in the DDN (DirectData Networks)
high-performance raid storage controllers, which had gone into a
force-verify state. The file system became sluggish to unavailable and
users could not log in - stale NFS file handle errors. Cray is working with
the vendor to resolve the problem. The parallel production test originally
scheduled in Reston for today has been postponed until early next week.
2. Team ATCF is conducting a 30-day test of a new, streamlined version of
the NHC guidance suite/spaghetti models on the WCOSS Crays, which includes
a test of the ATCF systems at CPHC.
3. NHC is planning a series of PSurge test simulations in the coming weeks.
The first run will be on Monday at 15Z (10 am EST) for the storm AL812017.
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the Ncep.nhc.nco_contacts