[Ncep.list.nems.announce] Planned commit to NEMS trunk: (step 3 under NEMS ticket #41)

Shrinivas Moorthi shrinivas.moorthi at noaa.gov
Wed Sep 18 15:54:10 UTC 2013


My regression test seems to be progressing; it is in the 9th test now.
Hopefully, it will finish.
Moorthi
On 09/18/2013 11:48 AM, Raghu Reddy wrote:
>
> It so happens I was also doing the same thing!  And it has completed 
> seven tests so far and seems to be progressing well.
>
> And as Yusong has explained, we are working on different ways of 
> addressing this issue.
>
> Thanks,
>
> --Raghu
>
> *From:*ncep.list.nems.announce-bounces at lstsrv.ncep.noaa.gov 
> [mailto:ncep.list.nems.announce-bounces at lstsrv.ncep.noaa.gov] *On 
> Behalf Of *Yusong Wang - NOAA Affiliate
> *Sent:* Wednesday, September 18, 2013 11:38 AM
> *To:* Ratko Vasic
> *Cc:* ncep.list.nems.announce at lstsrv.ncep.noaa.gov
> *Subject:* Re: [Ncep.list.nems.announce] Planned commit to NEMS trunk: 
> (step 3 under NEMS ticket #41)
>
> Ratko, Moorthi and others,
>
> We did implement wrappers for both qstat and qsub to work around the 
> PBS issues and replaced the default version during the maintenance.
>
> Under normal conditions, the new wrappers are much more stable than 
> the original qsub and qstat. While as Ratko pointed out, there are 
> massive number of jobs submitted in the last couple of days, which 
> makes the system (especially the Moab/Torque) over-loaded. We are 
> working with the vendor to find a feasible way to prevent this from 
> happening in the future.
>
> The Zeus team has been working with the user to reduce the workload in 
> the last couple of days. We have observed the throughput of PBS in the 
> last 24 hours has been improved significantly due to the reduced 
> workload on the system.
>
> I am running a previous version of NEMS regression test on Zeus this 
> morning. So far, 11 tests passed without any issue. Please give 
> another try to see if it is any better today.
>
> Thanks for your patience.
>
> On Wed, Sep 18, 2013 at 11:19 AM, Ratko Vasic <ratko.vasic at noaa.gov 
> <mailto:ratko.vasic at noaa.gov>> wrote:
>
> This is PBS error. I thought they solved problem: there was user
> submitting several thousands of jobs at same time(more than 6k).Now I
> see only ~1500 jobs on Hold (from same user).
>
> Ratko
>
>
> On 9/18/2013 7:29 AM, Shrinivas Moorthi wrote:
> > pbs_iff: Invalid credential MSG=cannot authenticate user. Client
> > connection not found
> > No Permission.
>
> --
> Ratko Vasic
> Meteorologist
> 301-683-3814 <tel:301-683-3814>
> National Oceanic and Atmospheric Administration
> NCEP/EMC,  Room 2791
> NCWCP  W/NP2
> 5830 University Research Court
> College Park, MD  20740-3818
>
>
> _______________________________________________
> Ncep.list.nems.announce mailing list
> Ncep.list.nems.announce at lstsrv.ncep.noaa.gov 
> <mailto:Ncep.list.nems.announce at lstsrv.ncep.noaa.gov>
> https://lstsrv.ncep.noaa.gov/mailman/listinfo/ncep.list.nems.announce
>
>
>
>
> -- 
> Yusong Wang,   Ph.D.
> High Performance Computing Application Specialist
> NOAA/ National Weather Service
> National Centers for Environmental Prediction
>
> Building: NCWCP, Room: 2028
> 5830 University Research Ct
> College Park,MD   20740
> Tel (Office):  (301)683-3690
> Fax: (301)683-3703
>
>
>
> _______________________________________________
> Ncep.list.nems.announce mailing list
> Ncep.list.nems.announce at lstsrv.ncep.noaa.gov
> https://lstsrv.ncep.noaa.gov/mailman/listinfo/ncep.list.nems.announce


-- 
Dr. Shrinivas Moorthi
Research Meteorologist
Global Climate and Weather Modeling Branch
Environmental Modeling Center / National Centers for Environmental Prediction
5830 University Research Court - (W/NP23), College Park MD 20740 USA
Tel:(301)683-3718

-------------- next part --------------
An HTML attachment was scrubbed...
URL: https://lstsrv.ncep.noaa.gov/pipermail/ncep.list.nems.announce/attachments/20130918/5909f446/attachment-0001.html 


More information about the Ncep.list.nems.announce mailing list