<?xml version="1.0" encoding="UTF-8"?>
<feed xml:lang="en-US" xmlns="http://www.w3.org/2005/Atom">
  <id>tag:status.rc.fas.harvard.edu,2005:/history</id>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu"/>
  <link rel="self" type="application/atom+xml" href="https://status.rc.fas.harvard.edu/history.atom"/>
  <title>FAS Research Computing Status - Incident history</title>
  <updated>2026-06-15T13:00:00.000+00:00</updated>
  <author>
    <name>FAS Research Computing</name>
  </author>
  
<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmouer4mn03r4amtc5shv8576</id>
  <published>2026-06-15T13:00:00.000+00:00</published>
  <updated>2026-06-15T13:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmouer4mn03r4amtc5shv8576"/>
  <title>2026 MGHPCC power downtime June 15-18, 2026</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    
    <p><strong>Affected Components:</strong> FASRC Two-Factor (OpenAuth), HolyLFS04 (Tier 0), GPU nodes (Holyoke), Coldfront, Infiniband - Holyoke/MGHPCC, FASSE Compute Cluster (Holyoke), SLURM Scheduler - FASSE, Boston Tier 2 NFS (new), Holyoke Tier 2 NFS (new), Holylabs, Holyoke Firewall, Network - Holyoke/MGHPCC, Holyoke-Boston fiber link (short path), Infiniband - Boston, HolyLFS06 (Tier 0), bosECS, Boston Specialty Storage, Samba Cluster, Software &amp; Modules, Holyoke/MGHPCC Data Center, Cannon Compute Cluster (Holyoke), Network - Boston, Cambridge firewall and other redundancy, FIINE billing portal, License Servers, seas_compute, Cannon Open OnDemand/VDI, FASSE login nodes, Kempner Cluster GPU, Kempner Cluster CPU, FASSE Open OnDemand/VDI, Starfish, Web Proxies, CEPH Storage Boston (Tier 2), Authentication, Globus Data Transfer, Holyoke-Boston fiber link (long path), Virtual Infrastructure - Holyoke, Holystore01 (Tier 0), Login Nodes - Holyoke, Boston Data Center, NESE (NorthEast Storage Exchange), SLURM Scheduler - Cannon, Isilon Storage Holyoke (Tier 1), holECS, Holyoke Specialty Storage, FASRC Downloads Site, Citrix, Login Nodes - Boston, HolyLFS05 (Tier 0), Virtual Infrastructure - Boston, Network - Cambridge, FASRC VPN (Cambridge) , FASRC VPN (Boston), Grafana Cloud (FASRC), BosLFS02 (Tier 0), Isilon Storage Boston (Tier 1), Home Directory Storage - Boston, portal.rc.fas.harvard.edu, Netscratch (Global Scratch), Tape - (Tier 3), Boston Compute Nodes</p>
    <p><small>Jun <var data-var='date'> 15</var>, <var data-var='time'>13:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  The yearly power downtime at our Holyoke data center, MGHPCC, has been scheduled by the facility. This year&#039;s power downtime will take place on Tuesday June 15th - 18th, 2025\. There will be no June monthly maintenance as a result.

Since the facility will be powered down for two days this year, we will not be performing the usual maintenance tasks.   
That said, networking and other key infrastructure will be doing maintenance.

**IMPORTANT NOTE**: FASRC storage at both Holyoke and Boston **will be** affected and should not be expected to be available throughout the downtime. Please plan ahead accordingly.

* **Monday June 15th** \- Power-down begins at 9AM
* **Tuesday June 16th** \- Power out at MGHPCC
* **Wednesday June 17th** \- Power out at MGHPCC
* **Thursday June 18th** \- Expected return to full service by 5PM
* **Friday June 19th** \- Please note that June 19th is a university holiday

![Monday June 15th -  Power-down begins at 9AM
Tuesday June 16th - Power out at MGHPCC
Wednesday June 17th - Power out at MGHPCC
Thursday June 18th - Expected return to full service by 5PM](https://www.rc.fas.harvard.edu/wp-content/uploads/2026/05/mghpcc_powerdown_2026-1.jpg)

**For more detailed information and follow-up, please see:**   
&lt;https://www.rc.fas.harvard.edu/mghpcc-yearly-shutdown&gt; **or this** [**Status Page**](https://status.rc.fas.harvard.edu/).</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmoagu0310052elrw0kbo3tbu</id>
  <published>2026-05-18T11:00:00.000+00:00</published>
  <updated>2026-05-18T11:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmoagu0310052elrw0kbo3tbu"/>
  <title>MGHPCC power work - Part 2 May 18</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    
    <p><strong>Affected Components:</strong> FASSE Compute Cluster (Holyoke), GPU nodes (Holyoke), SLURM Scheduler - FASSE, Cannon Compute Cluster (Holyoke), seas_compute, Kempner Cluster GPU, Kempner Cluster CPU, SLURM Scheduler - Cannon, Boston Compute Nodes</p>
    <p><small>May <var data-var='date'> 18</var>, <var data-var='time'>11:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  Our Holyoke data center, MGHPCC, will be doing power work on Row 8A. This work, which is being completed over the course of 2 weeks, will bring online another power feed which will increase power capacity. 

In order to do this work, it will require us to idle half the nodes in 8a for the duration of the week. This means all partitions in this row will be at half capacity. Existing jobs should drain naturally and no job should need to be canceled. 

The impacted partitions are:

```
arguelles_delgado_h100
bigmem
bigmem_intermediate
blackhole_gpu
dvorkin
eddy
enos
gershman
gpu
gpu_h200
gpu_requeue
hejazi
hernquist_ice
hoekstra
hsph
hsph_gpu
huce_ice
iaifi_gpu_requeue
intermediate
itc_cluster
itc_gpu
janson_sapphire
joonholee
jshapiro
kempner
kempner_priority
kempner_dev
kempner_eng
kempner_h200_priority
kempner_h100
kempner_h100_priority
kempner_h100_priority2
kempner_h100_priority3
kempner_h100_priority4
kempner_interactive
kovac
kozinsky
kozinsky_gpu
kozinsky_requeue
murphy_ice
mweber_compute
mweber_gpu
olveczky_sapphire
ortegahernandez_ice
rivas
sapphire
seas_compute
siag
siag_combo
test
yao
yao_priority
zhuang
```.</p>
<p><small>May <var data-var='date'> 18</var>, <var data-var='time'>11:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  Rescheduled to May 18.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmoagr8fc0009m6hqfi4wd20x</id>
  <published>2026-05-11T11:00:00.000+00:00</published>
  <updated>2026-05-11T11:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmoagr8fc0009m6hqfi4wd20x"/>
  <title>MGHPCC power work - Part 1 May 11</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 5 days and 12 hours</p>
    <p><strong>Affected Components:</strong> GPU nodes (Holyoke), FASSE Compute Cluster (Holyoke), SLURM Scheduler - FASSE, Cannon Compute Cluster (Holyoke), seas_compute, Kempner Cluster GPU, Kempner Cluster CPU, SLURM Scheduler - Cannon, Boston Compute Nodes</p>
    <p><small>May <var data-var='date'> 11</var>, <var data-var='time'>11:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  Rescheduled to May 11.</p>
<p><small>May <var data-var='date'> 11</var>, <var data-var='time'>11:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  Our Holyoke data center, MGHPCC, will be doing power work on Row 8A. This work, which will occur this week and next week, will bring online another power feed which will increase power capacity.

In order to do this work, it will require us to idle half the nodes in 8a for the duration of the week. This means all partitions in this row will be at half capacity. Existing jobs should drain naturally and no job should need to be canceled.

The impacted partitions are:

```
arguelles_delgado_h100
bigmem
bigmem_intermediate
blackhole_gpu
dvorkin
eddy
enos
gershman
gpu
gpu_h200
gpu_requeue
hejazi
hernquist_ice
hoekstra
hsph
hsph_gpu
huce_ice
iaifi_gpu_requeue
intermediate
itc_cluster
itc_gpu
janson_sapphire
joonholee
jshapiro
kempner
kempner_priority
kempner_dev
kempner_eng
kempner_h200_priority
kempner_h100
kempner_h100_priority
kempner_h100_priority2
kempner_h100_priority3
kempner_h100_priority4
kempner_interactive
kovac
kozinsky
kozinsky_gpu
kozinsky_requeue
murphy_ice
mweber_compute
mweber_gpu
olveczky_sapphire
ortegahernandez_ice
rivas
sapphire
seas_compute
siag
siag_combo
test
yao
yao_priority
zhuang
```.</p>
<p><small>May <var data-var='date'> 16</var>, <var data-var='time'>23:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>
<p><small>May <var data-var='date'> 11</var>, <var data-var='time'>11:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmou9kqfs000jano84c7xx9bc</id>
  <published>2026-05-06T16:20:55.382+00:00</published>
  <updated>2026-05-06T16:20:55.581+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmou9kqfs000jano84c7xx9bc"/>
  <title>www.rc.fas.harvard.edu is back up</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    
    <p><strong>Affected Components:</strong> www.rc.fas.harvard.edu</p>
    <p><small>May <var data-var='date'> 6</var>, <var data-var='time'>16:20:55</var> GMT+0</small><br /><strong>Investigating</strong> -
  www.rc.fas.harvard.edu is down at the moment. This incident was created automatically..</p>
<p><small>May <var data-var='date'> 6</var>, <var data-var='time'>16:30:56</var> GMT+0</small><br /><strong>Resolved</strong> -
  www.rc.fas.harvard.edu is back up. This incident was resolved automatically..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmoa68qts000ceyvghagi44uw</id>
  <published>2026-05-04T13:00:00.000+00:00</published>
  <updated>2026-05-04T13:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmoa68qts000ceyvghagi44uw"/>
  <title>Monthly maintenance May 4th 2026 9am-1pm</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 4 hours</p>
    <p><strong>Affected Components:</strong> FASRC Two-Factor (OpenAuth), GPU nodes (Holyoke), FASSE Compute Cluster (Holyoke), SLURM Scheduler - FASSE, Cannon Compute Cluster (Holyoke), seas_compute, Cannon Open OnDemand/VDI, FASSE login nodes, Kempner Cluster GPU, Kempner Cluster CPU, FASSE Open OnDemand/VDI, Login Nodes - Holyoke, SLURM Scheduler - Cannon, Login Nodes - Boston, Netscratch (Global Scratch), Boston Compute Nodes</p>
    <p><small>May <var data-var='date'> 4</var>, <var data-var='time'>13:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  FASRC monthly maintenance will take place on May 4th 2026\. Our maintenance tasks should be completed between 9am-1pm.

**NOTICES:**

* Annual data center power downtime: The annual downtime at MGHPCC will take place June 15 - June 18\. This year&#039;s downtime will be one day longer. More details will be sent to all users next month.
* Training: Upcoming training from FASRC and other sources can be found on our Training Calendar. at &lt;https://www.rc.fas.harvard.edu/upcoming-training/&gt;
* Status Page: You can subscribe to our status to receive notifications of maintenance, incidents, and their resolution at &lt;https://status.rc.fas.harvard.edu/&gt; (click Get Updates for options).

**MAINTENANCE TASKS**

Cannon cluster will be paused during this maintenance?: **YES**  
FASSE cluster will be paused during this maintenance?: **YES**

* Slurm 25.11.5 Upgrade  
   * Audience: All cluster users  
   * Impact: Jobs will be paused during the upgrade
* Reboot remaining stuck nodes from power outage  
   * Audience: N/A  
   * Impact: No visible impact to user
* Two-Factor/OpenAuth ([two-factor.rc.fas.harvard.edu](http://two-factor.rc.fas.harvard.edu)) replacement  
   * Audience: All account holders  
   * Impact: The server will be unavailable during maintenance. You will be unable to obtain a new or replacement OpenAuth token during this period.
* Domain controller replacement  
   * Audience: Internal  
   * Impact: End users should not see any impact
* OOD/Open OnDemand reboots  
   * Audience: All OOD users, reboot of the head nodes  
   * Impact: Running sessions will _not_ be affected
* Login node reboots  
   * Audience; All login node users  
   * Impact: Login nodes will reboot during the maintenance window
* Netscratch 90-day retention cleanup  
   * Audience; All netscratch users  
   * Impact: Files older than 90 days will be removed per our [scratch policy](https://docs.rc.fas.harvard.edu/kb/policy-scratch/). Please note that this cleanup can happen at any time, not just during maintenance.

Thank you,  
FAS Research Computing  
&lt;https://docs.rc.fas.harvard.edu/&gt;  
&lt;https://www.rc.fas.harvard.edu/&gt;.</p>
<p><small>May <var data-var='date'> 4</var>, <var data-var='time'>13:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>May <var data-var='date'> 4</var>, <var data-var='time'>14:56:02</var> GMT+0</small><br /><strong>Identified</strong> -
  The scheduler is re-opened and jobs un-paused. Other, non-impacting, work continues..</p>
<p><small>May <var data-var='date'> 4</var>, <var data-var='time'>17:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmo8wbelf004s7x6qrzk183z3</id>
  <published>2026-05-01T20:00:00.000+00:00</published>
  <updated>2026-05-01T20:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmo8wbelf004s7x6qrzk183z3"/>
  <title>Starfish maintenance</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 1 hour</p>
    <p><strong>Affected Components:</strong> Starfish</p>
    <p><small>May <var data-var='date'> 1</var>, <var data-var='time'>20:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  Starfish will be upgraded to the latest version on Friday, May 1st from 4pm-5pm. The service and dashboard will be down during this time. .</p>
<p><small>May <var data-var='date'> 1</var>, <var data-var='time'>20:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>May <var data-var='date'> 1</var>, <var data-var='time'>21:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmohlvitv03efhvfzdaadj1lz</id>
  <published>2026-04-30T12:00:00.000+00:00</published>
  <updated>2026-04-30T12:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmohlvitv03efhvfzdaadj1lz"/>
  <title>OpenOnDemand maintenance</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 2 hours</p>
    <p><strong>Affected Components:</strong> Cannon Open OnDemand/VDI, FASSE Open OnDemand/VDI</p>
    <p><small>Apr <var data-var='date'> 30</var>, <var data-var='time'>12:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  At 8am on Thursday April 30th we will be upgrading from Open OnDemand version 4.0.7 to 4.1.4 on both the Cannon and FASSE clusters. 

This is not expected to impact running jobs. 

This upgrade adds the Jobs-&gt;Project Manager menu item and fixes an issue that affected access to the Clusters-&gt;Shell Access menu item when using Firefox..</p>
<p><small>Apr <var data-var='date'> 30</var>, <var data-var='time'>12:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>Apr <var data-var='date'> 30</var>, <var data-var='time'>14:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmokvaw2500ewyhgkqgzfuort</id>
  <published>2026-04-30T02:31:25.900+00:00</published>
  <updated>2026-04-30T02:31:26.123+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmokvaw2500ewyhgkqgzfuort"/>
  <title>login.rc.fas.harvard.edu is responding normally</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    
    <p><strong>Affected Components:</strong> , Login Nodes - Holyoke, Login Nodes - Boston, , 
Login Nodes → 
login.rc.fas.harvard.edu →</p>
    <p><small>Apr <var data-var='date'> 30</var>, <var data-var='time'>02:31:26</var> GMT+0</small><br /><strong>Investigating</strong> -
  login.rc.fas.harvard.edu is not responding normally. This incident was automatically created..</p>
<p><small>Apr <var data-var='date'> 30</var>, <var data-var='time'>15:43:53</var> GMT+0</small><br /><strong>Resolved</strong> -
  \\\[login.rc.fas.harvard.edu\\\](http://login.rc.fas.harvard.edu) is responding normally. This incident was automatically resolved..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmokukngw04fg48dopehgznvm</id>
  <published>2026-04-30T02:11:01.535+00:00</published>
  <updated>2026-04-30T02:11:01.535+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmokukngw04fg48dopehgznvm"/>
  <title>Login and OOD node access restricted due to serious security issue - No ETA</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 16 hours and 2 minutes</p>
    <p><strong>Affected Components:</strong> Cannon Open OnDemand/VDI, FASSE login nodes, FASSE Open OnDemand/VDI, Login Nodes - Holyoke, Login Nodes - Boston</p>
    <p><small>Apr <var data-var='date'> 30</var>, <var data-var='time'>02:11:01</var> GMT+0</small><br /><strong>Identified</strong> -
  **Due to a serious in-the-wild exploit which can compromise Fedora-based Linux distributions including Rocky, which is used on the cluster, we need to restrict access. All login and OOD nodes are shut down until a fix can be put in place. Jobs running on the cluster will continue running.**

**No ETA, There is not fix at this time. We will update our status page in the morning once we have more information or a fix to roll out.**

**This is a serious exploit and we do not take this measure lightly. Please follow this status page for updates and eventual resolution.**.</p>
<p><small>Apr <var data-var='date'> 30</var>, <var data-var='time'>14:44:58</var> GMT+0</small><br /><strong>Identified</strong> -
  We are developing a plan of attack to mitigate this exploit. Please know that this is a very serious issue and so we are treating it as such. Thank you for your understanding.  
  
We are currently awaiting further information from the Redhat/Fedora/Rocky community but building a plan in the meantime with the information we have. More details to follow as we can share them.  
  
If you need to access storage (except scratch and home directories), [Globus ](https://docs.rc.fas.harvard.edu/kb/globus-file-transfer/)is still online and available. But again, login nodes and OOD are not available..</p>
<p><small>Apr <var data-var='date'> 30</var>, <var data-var='time'>15:40:41</var> GMT+0</small><br /><strong>Identified</strong> -
  To mitigate this exploit we will need to restart -all nodes- on the cluster. 

This **will begin at 1PM** and run until all nodes have restarted (no ETA).   

This **will mean** any un-finished jobs will be **terminated**. There is no way to avoid this.   
  
We will then be validating the fix before re-opening the login. OOD nodes, and scheduler.

Next steps and updates will be posted here..</p>
<p><small>Apr <var data-var='date'> 30</var>, <var data-var='time'>18:12:41</var> GMT+0</small><br /><strong>Resolved</strong> -
  The cluster has been rebooted and all nodes, including login and OOD, have been patched. 

The scheduler is re-opened and jobs which were preempted/requeued have priority for re-scheduling.

Some non-standard, lab-owned nodes may still require patching. The owners of these machines may be contacted about this.

Thank you for your patience. This is a global issue and is being addressed at centers everywhere..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmok8r4au00pamcnk0tz7d2ve</id>
  <published>2026-04-29T16:00:11.873+00:00</published>
  <updated>2026-04-29T16:00:11.873+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmok8r4au00pamcnk0tz7d2ve"/>
  <title>holylfs06 down</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 33 minutes</p>
    <p><strong>Affected Components:</strong> HolyLFS06 (Tier 0)</p>
    <p><small>Apr <var data-var='date'> 29</var>, <var data-var='time'>16:00:11</var> GMT+0</small><br /><strong>Identified</strong> -
  Holylfs06 storage is down. We are investigating. More details as they are known..</p>
<p><small>Apr <var data-var='date'> 29</var>, <var data-var='time'>16:32:45</var> GMT+0</small><br /><strong>Resolved</strong> -
  Holylfs06 is accessible again. 

This incident has been resolved..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmoiqv16102yytc5y4eabtkxp</id>
  <published>2026-04-28T17:00:00.000+00:00</published>
  <updated>2026-04-28T17:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmoiqv16102yytc5y4eabtkxp"/>
  <title>Website security maintenance (www.rc and docs.rc) 4-28-26 1pm</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 17 minutes</p>
    <p><strong>Affected Components:</strong> docs.rc.fas.harvard.edu, www.rc.fas.harvard.edu</p>
    <p><small>Apr <var data-var='date'> 28</var>, <var data-var='time'>17:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  Security updates are required for [www.rc.fas.harvard.edu](http://www.rc.fas.harvard.edu) and [docs.rc.fas.harvard.edu](http://docs.rc.fas.harvard.edu)   
This work will take place today between 1pm and 2pm  
Both sites will be down for very short periods during the updates..</p>
<p><small>Apr <var data-var='date'> 28</var>, <var data-var='time'>17:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>Apr <var data-var='date'> 28</var>, <var data-var='time'>17:16:58</var> GMT+0</small><br /><strong>Completed</strong> -
  Website maintenance has completed successfully..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmo7omopz0041p3pjvxgqi747</id>
  <published>2026-04-20T21:03:38.448+00:00</published>
  <updated>2026-04-20T21:03:38.448+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmo7omopz0041p3pjvxgqi747"/>
  <title>Starfish - Out of date scans </title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 7 days and 18 hours</p>
    <p><strong>Affected Components:</strong> Starfish</p>
    <p><small>Apr <var data-var='date'> 20</var>, <var data-var='time'>21:03:38</var> GMT+0</small><br /><strong>Identified</strong> -
  Some starfish scans restarted following earlier issue and may be up to a week out of date due to the delayed scans 

Two scanning agents are still down. So please bear this in mind when viewing Starfish data..</p>
<p><small>Apr <var data-var='date'> 21</var>, <var data-var='time'>13:48:40</var> GMT+0</small><br /><strong>Monitoring</strong> -
  The down agents are back in service. Scans are on-going but data will still be out of date until they catch up..</p>
<p><small>Apr <var data-var='date'> 28</var>, <var data-var='time'>15:03:28</var> GMT+0</small><br /><strong>Resolved</strong> -
  Scans are running normally. Any out of date stats will catch up shortly..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmo5pma7u0n7mjomagp7omaed</id>
  <published>2026-04-19T11:55:47.130+00:00</published>
  <updated>2026-04-19T11:55:47.329+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmo5pma7u0n7mjomagp7omaed"/>
  <title>login.rc.fas.harvard.edu is responding normally</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    
    <p><strong>Affected Components:</strong> , Login Nodes - Holyoke, Login Nodes - Boston, , 
Login Nodes → 
login.rc.fas.harvard.edu →</p>
    <p><small>Apr <var data-var='date'> 19</var>, <var data-var='time'>11:55:47</var> GMT+0</small><br /><strong>Investigating</strong> -
  login.rc.fas.harvard.edu is not responding normally. This incident was automatically created..</p>
<p><small>Apr <var data-var='date'> 19</var>, <var data-var='time'>12:55:25</var> GMT+0</small><br /><strong>Resolved</strong> -
  \\\[login.rc.fas.harvard.edu\\\](http://login.rc.fas.harvard.edu) is responding normally. This incident was automatically resolved..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmo5eigd3025we79u8go90tos</id>
  <published>2026-04-19T06:44:52.695+00:00</published>
  <updated>2026-04-19T06:44:52.709+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmo5eigd3025we79u8go90tos"/>
  <title>Authentication outage</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 7 hours and 30 minutes</p>
    <p><strong>Affected Components:</strong> Authentication</p>
    <p><small>Apr <var data-var='date'> 19</var>, <var data-var='time'>06:44:52</var> GMT+0</small><br /><strong>Investigating</strong> -
  Authentication issues with openauth/radius. This incident was created by an automated monitoring service..</p>
<p><small>Apr <var data-var='date'> 19</var>, <var data-var='time'>14:14:53</var> GMT+0</small><br /><strong>Resolved</strong> -
  Openauth/radius is now operational. This update was created by an automated monitoring service..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmo5mx8uq0wy7113vg0ey0q9t</id>
  <published>2026-04-19T06:18:00.000+00:00</published>
  <updated>2026-04-19T06:18:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmo5mx8uq0wy7113vg0ey0q9t"/>
  <title>MGHPCC Power Loss</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 8 hours and 54 minutes</p>
    <p><strong>Affected Components:</strong> FASSE Compute Cluster (Holyoke), FASRC Two-Factor (OpenAuth), HolyLFS04 (Tier 0), GPU nodes (Holyoke), Infiniband - Holyoke/MGHPCC, SLURM Scheduler - FASSE, Holyoke Tier 2 NFS (new), Holylabs, Holyoke Firewall, Network - Holyoke/MGHPCC, HolyLFS06 (Tier 0), Holyoke/MGHPCC Data Center, Cannon Compute Cluster (Holyoke), seas_compute, Cannon Open OnDemand/VDI, FASSE login nodes, Kempner Cluster GPU, Kempner Cluster CPU, FASSE Open OnDemand/VDI, Virtual Infrastructure - Holyoke, Holystore01 (Tier 0), Login Nodes - Holyoke, NESE (NorthEast Storage Exchange), SLURM Scheduler - Cannon, Isilon Storage Holyoke (Tier 1), holECS, Holyoke Specialty Storage, Login Nodes - Boston, HolyLFS05 (Tier 0), Netscratch (Global Scratch), Tape - (Tier 3), Boston Compute Nodes</p>
    <p><small>Apr <var data-var='date'> 19</var>, <var data-var='time'>06:18:00</var> GMT+0</small><br /><strong>Investigating</strong> -
  At 2:18am on April 19th MGHPCC (our Holyoke datacenter) lost cooling which caused the entire facility to shutdown. This caused the loss of all jobs that were running. Storage and data on that storage should be safe. The facility is working on restoring cooling and power. Unfortunately we do not have an ETA..</p>
<p><small>Apr <var data-var='date'> 19</var>, <var data-var='time'>13:06:16</var> GMT+0</small><br /><strong>Identified</strong> -
  Power was fully restored to MGHPCC at 7:39am on April 19th. FASRC staff has restored functionality to most systems except for FASSE Open OnDemand. All other services are up and operating normally. If you continue to see issues with any system that is marked operational please let us know. We will deal with any non urgent requests in normal working hours..</p>
<p><small>Apr <var data-var='date'> 19</var>, <var data-var='time'>15:12:24</var> GMT+0</small><br /><strong>Resolved</strong> -
  All services, including FASSE OOD, should be functional at this time. If you continue to see issues with any system that is marked operational please let us know by sending an email to [rchelp@rc.fas.harvard.edu](mailto:rchelp@rc.fas.harvard.edu)

We will deal with any non urgent requests in normal working hours.

This incident has been resolved..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmnyotr4d003l6i6q561of3ku</id>
  <published>2026-04-13T16:30:00.000+00:00</published>
  <updated>2026-04-13T16:30:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmnyotr4d003l6i6q561of3ku"/>
  <title>Starfish down</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 1 day and 48 minutes</p>
    <p><strong>Affected Components:</strong> Starfish</p>
    <p><small>Apr <var data-var='date'> 13</var>, <var data-var='time'>16:30:00</var> GMT+0</small><br /><strong>Investigating</strong> -
  Starfish is currently unavailable, due to a network card issue. Updates to come.

We are currently investigating this incident..</p>
<p><small>Apr <var data-var='date'> 14</var>, <var data-var='time'>14:17:21</var> GMT+0</small><br /><strong>Identified</strong> -
  Staff will be at the datacenter today to check on the physical status of the server. Updates to come. 

We are continuing to work on a fix for this incident..</p>
<p><small>Apr <var data-var='date'> 14</var>, <var data-var='time'>17:17:30</var> GMT+0</small><br /><strong>Resolved</strong> -
  The network card has been replaced, and Starfish is back up. 

This incident has been resolved..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmnq3714o000vs7adgfjwe485</id>
  <published>2026-04-08T13:31:31.202+00:00</published>
  <updated>2026-04-08T13:31:31.202+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmnq3714o000vs7adgfjwe485"/>
  <title>Coldfront is down.</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 43 minutes</p>
    <p><strong>Affected Components:</strong> Coldfront</p>
    <p><small>Apr <var data-var='date'> 8</var>, <var data-var='time'>13:31:31</var> GMT+0</small><br /><strong>Investigating</strong> -
  Coldfront logins are producing an error message. We are currently investigating this incident..</p>
<p><small>Apr <var data-var='date'> 8</var>, <var data-var='time'>14:14:35</var> GMT+0</small><br /><strong>Resolved</strong> -
  Coldfront is back up and accepting logins..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmnnabux30qmbq6fm9rbvrvx8</id>
  <published>2026-04-06T14:27:54.839+00:00</published>
  <updated>2026-04-06T14:27:54.839+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmnnabux30qmbq6fm9rbvrvx8"/>
  <title>Starfish dashboard inaccessible</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 1 hour and 3 minutes</p>
    <p><strong>Affected Components:</strong> Starfish</p>
    <p><small>Apr <var data-var='date'> 6</var>, <var data-var='time'>14:27:54</var> GMT+0</small><br /><strong>Investigating</strong> -
  The Starfish dashboard is inaccessible. We are looking into the issue..</p>
<p><small>Apr <var data-var='date'> 6</var>, <var data-var='time'>15:31:02</var> GMT+0</small><br /><strong>Resolved</strong> -
  Starfish has resolved the issue and the dashboard is once again available..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmn6ac2960e2v140x1wt0rf80</id>
  <published>2026-04-06T13:00:00.000+00:00</published>
  <updated>2026-04-06T13:00:01.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmn6ac2960e2v140x1wt0rf80"/>
  <title>FASRC monthly maintenance April 6th 2026 9am-1pm</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 4 hours</p>
    <p><strong>Affected Components:</strong> FASRC Two-Factor (OpenAuth), , , Cannon Open OnDemand/VDI, FASSE login nodes, FASSE Open OnDemand/VDI, Login Nodes - Holyoke, Login Nodes - Boston, Netscratch (Global Scratch), , 
Login Nodes → 
VDI/OpenOnDemand → 
login.rc.fas.harvard.edu →</p>
    <p><small>Apr <var data-var='date'> 6</var>, <var data-var='time'>13:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>Apr <var data-var='date'> 6</var>, <var data-var='time'>17:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>
<p><small>Apr <var data-var='date'> 6</var>, <var data-var='time'>13:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  FASRC monthly maintenance will take place on April 6th 2026\. Our maintenance tasks should be completed between 9am-1pm.

**NOTICES:**

* Training: Upcoming training from FASRC and other sources can be found on our Training Calendar. at &lt;https://www.rc.fas.harvard.edu/upcoming-training/&gt;
* Status Page: You can subscribe to our status to receive notifications of maintenance, incidents, and their resolution at &lt;https://status.rc.fas.harvard.edu/&gt; (click Get Updates for options).
* We&#039;d love to hear success stories about your or your lab&#039;s use of FASRC. Submit your story [here](https://www.rc.fas.harvard.edu/user-stories/).

**MAINTENANCE TASKS**

Cannon cluster will be paused during this maintenance?: **NO**  
FASSE cluster will be paused during this maintenance?: **NO**

* [two-factor.rc.fas.harvard.edu](http://two-factor.rc.fas.harvard.edu) [OpenAuth](https://docs.rc.fas.harvard.edu/kb/openauth/) cut-over to new server  
   * Audience: New accounts or anyone requesting an OpenAuth token  
   * Impact: two-factor will be unavailable while moving to a new server
* RStudio Server (Open OnDemand)  
   * Audience: RStudio Server users on Cannon and FASSE  
   * Impact: We will be decommissioning some versions of RStudio Server so we can properly maintain all production versions. Versions to be decommissioned:  
         * R 4.1.3 (Bioconductor 3.14, RStudio 2022.02.0)  
         * R 4.1.0 (Bioconductor 3.13, RStudio 1.4.1717)  
         * R 4.0.3 (Bioconductor 3.12, Rstudio 1.3.1093)  
         * R 4.0.0 (Bioconductor 3.11, Rstudio 1.3.1093)  
   * If you use one of these versions, we recommend replacing it with the most recent version, R 4.4.2 (Bioconductor 3.20, RStudio 2024.12.0). You must reinstall previously installed libraries.
* Domain controller replacement  
   * Audience: Internal  
   * Impact: End users should not see any impact
* OOD/Open OnDemand reboots  
   * Audience: All OOD users, reboot of the head nodes  
   * Impact: Running sessions will _not_ be affected
* Login node reboots  
   * Audience; All login node users  
   * Impact: Login nodes will reboot during the maintenance window
* Netscratch 90-day retention cleanup  
   * Audience; All netscratch users  
   * Impact: Files older than 90 days will be removed per our [scratch policy](https://docs.rc.fas.harvard.edu/kb/policy-scratch/). Please note that this cleanup can happen at any time, not just during maintenance.

Thank you,  
FAS Research Computing  
&lt;https://docs.rc.fas.harvard.edu/&gt;  
&lt;https://www.rc.fas.harvard.edu/&gt;.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmnf48s6u02wctz2d7zg0wjmn</id>
  <published>2026-03-31T21:15:24.557+00:00</published>
  <updated>2026-04-01T12:11:15.205+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmnf48s6u02wctz2d7zg0wjmn"/>
  <title>Scheduler is degraded</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 14 hours and 56 minutes</p>
    <p><strong>Affected Components:</strong> , GPU nodes (Holyoke), , Cannon Compute Cluster (Holyoke), seas_compute, Kempner Cluster GPU, Kempner Cluster CPU, SLURM Scheduler - Cannon, Boston Compute Nodes, 
Cannon Cluster → 
Kempner Cluster →</p>
    <p><small>Apr <var data-var='date'> 1</var>, <var data-var='time'>12:11:15</var> GMT+0</small><br /><strong>Resolved</strong> -
  This incident has been resolved. The scheduler is running normally..</p>
<p><small>Mar <var data-var='date'> 31</var>, <var data-var='time'>21:15:24</var> GMT+0</small><br /><strong>Investigating</strong> -
  The scheduler is in a degraded state due to [thrashing](https://en.wikipedia.org/wiki/Thrashing%5F%28computer%5Fscience%29)  
We are actively working to resolve this problem..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmnerzlwh0g8hr8lzi8oq5mr2</id>
  <published>2026-03-31T15:32:20.956+00:00</published>
  <updated>2026-03-31T15:32:20.956+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmnerzlwh0g8hr8lzi8oq5mr2"/>
  <title>two-factor.rc.fas.harvard.edu (openauth) error</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 1 hour and 12 minutes</p>
    <p><strong>Affected Components:</strong> FASRC Two-Factor (OpenAuth)</p>
    <p><small>Mar <var data-var='date'> 31</var>, <var data-var='time'>15:32:20</var> GMT+0</small><br /><strong>Investigating</strong> -
  We are currently investigating this incident. Requesting a new token or re-requesting your token from two-factor is not currently working. .</p>
<p><small>Mar <var data-var='date'> 31</var>, <var data-var='time'>16:44:08</var> GMT+0</small><br /><strong>Resolved</strong> -
  This incident has been resolved. two-factor.rc.fas.harvard.edu is working normally again..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmnes00kk0cz8e9ndj1i7o3xk</id>
  <published>2026-03-25T14:30:00.000+00:00</published>
  <updated>2026-03-25T14:30:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmnes00kk0cz8e9ndj1i7o3xk"/>
  <title>The web front end to two-factor.rc.fas.harvard.edu is currently not allowing logins, generating new tokens is currently unavailable</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 6 days and 30 minutes</p>
    <p><strong>Affected Components:</strong> FASRC Two-Factor (OpenAuth)</p>
    <p><small>Mar <var data-var='date'> 25</var>, <var data-var='time'>14:30:00</var> GMT+0</small><br /><strong>Investigating</strong> -
  We are currently investigating this incident..</p>
<p><small>Mar <var data-var='date'> 31</var>, <var data-var='time'>15:00:25</var> GMT+0</small><br /><strong>Resolved</strong> -
  This incident has been resolved..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmn634bsn0co6fzrz7gjvlmtv</id>
  <published>2026-03-25T13:34:01.221+00:00</published>
  <updated>2026-03-25T14:10:34.144+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmn634bsn0co6fzrz7gjvlmtv"/>
  <title>Network issues - Cluster degraded</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 5 days, 7 hours and 7 minutes</p>
    <p><strong>Affected Components:</strong> , GPU nodes (Holyoke), Network - Holyoke/MGHPCC, Cannon Compute Cluster (Holyoke), seas_compute, SLURM Scheduler - Cannon, Isilon Storage Holyoke (Tier 1), Boston Compute Nodes, 
Cannon Cluster →</p>
    <p><small>Mar <var data-var='date'> 25</var>, <var data-var='time'>14:10:34</var> GMT+0</small><br /><strong>Identified</strong> -
  Mounts to Holyoke Isilon (specifically /n/sw) are broken on numerous nodes across the cluster. We have a check rolling out to find these nodes so we can remediate them individually. Until remediated the cluster will be in a degraded state. Running jobs may randomly die or fail as they hit nodes that have stale mounts.

It will be risky to run jobs for the next hour and then, after that point, the cluster will have a large number of nodes closed waiting for them to drain so we can reboot them and fix the mounts..</p>
<p><small>Mar <var data-var='date'> 25</var>, <var data-var='time'>13:34:01</var> GMT+0</small><br /><strong>Investigating</strong> -
  A network issue affecting storage critical to the cluster is It&#039;s causing instability. The cluster is currently in a degraded state as a result. We are looking into the problem. Updates to follow...</p>
<p><small>Mar <var data-var='date'> 25</var>, <var data-var='time'>14:31:18</var> GMT+0</small><br /><strong>Monitoring</strong> -
  Mounts to Holyoke Isilon (specifically /n/sw) are broken on numerous nodes across the cluster. We have a check rolling out to find these nodes so we can remediate them individually. Until remediated the cluster will be in a degraded state. Running jobs may randomly die or fail as they hit nodes that have stale mounts.

It will be risky to run jobs for the next hour and then, after that point, the cluster will have a large number of nodes closed waiting for them to drain so we can reboot them and fix the mounts.

At this time we are unaware of any holy-isilon problems other than the effect this had on cluster nodes/running jobs. We will update should we identify any data storage concerns..</p>
<p><small>Mar <var data-var='date'> 30</var>, <var data-var='time'>20:41:25</var> GMT+0</small><br /><strong>Resolved</strong> -
  This incident has been resolved by draining and rebooting any nodes with stuck mounts..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmmxh7njd00qve87a87odxjyz</id>
  <published>2026-03-19T12:58:36.175+00:00</published>
  <updated>2026-03-19T12:58:36.175+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmmxh7njd00qve87a87odxjyz"/>
  <title>ColdFront is down.</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 1 hour and 54 minutes</p>
    <p><strong>Affected Components:</strong> Coldfront</p>
    <p><small>Mar <var data-var='date'> 19</var>, <var data-var='time'>12:58:36</var> GMT+0</small><br /><strong>Identified</strong> -
  ColdFront is down. We are working to bring it back up. The instance got replaced last night, but it had trouble configuring itself on the way up again..</p>
<p><small>Mar <var data-var='date'> 19</var>, <var data-var='time'>14:52:54</var> GMT+0</small><br /><strong>Resolved</strong> -
  Cold front is back up. Thank you for your patience..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmmpcvot800j93wkll4ba6gwu</id>
  <published>2026-03-13T20:35:07.736+00:00</published>
  <updated>2026-03-13T20:35:07.736+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmmpcvot800j93wkll4ba6gwu"/>
  <title>Key access issue to CSBN, HERS, FIINE, Portal Approve (p3approve)</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 5 days, 21 hours and 59 minutes</p>
    <p><strong>Affected Components:</strong> portal.rc.fas.harvard.edu</p>
    <p><small>Mar <var data-var='date'> 13</var>, <var data-var='time'>20:35:07</var> GMT+0</small><br /><strong>Investigating</strong> -
  We are currently investigating this incident. This only affects specific services. Users of CSBN, HERS, FIINE, Portal Approve (p3approve) may be affected. Email coming from these systems may also be delayed.  
No ETA.</p>
<p><small>Mar <var data-var='date'> 19</var>, <var data-var='time'>18:33:57</var> GMT+0</small><br /><strong>Resolved</strong> -
  This incident has been resolved..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmm9e0oz109onj5z9yxz1dcjr</id>
  <published>2026-03-02T16:22:42.655+00:00</published>
  <updated>2026-03-02T16:22:42.655+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmm9e0oz109onj5z9yxz1dcjr"/>
  <title>Starfish dashboard unavailable</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 1 hour and 16 minutes</p>
    <p><strong>Affected Components:</strong> Starfish</p>
    <p><small>Mar <var data-var='date'> 2</var>, <var data-var='time'>16:22:42</var> GMT+0</small><br /><strong>Investigating</strong> -
  The Starfish dashboard is not responding. We are currently investigating this issue with the vendor..</p>
<p><small>Mar <var data-var='date'> 2</var>, <var data-var='time'>17:38:26</var> GMT+0</small><br /><strong>Resolved</strong> -
  This incident has been resolved..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmls7rae90625fg9xy9b3jc5m</id>
  <published>2026-03-02T14:00:00.000+00:00</published>
  <updated>2026-03-02T14:00:01.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmls7rae90625fg9xy9b3jc5m"/>
  <title>FASRC monthly maintenance Monday March 2nd, 2026 9am-1pm</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 4 hours</p>
    <p><strong>Affected Components:</strong> Login Nodes - Holyoke, FASSE Compute Cluster (Holyoke), SLURM Scheduler - Cannon, , , GPU nodes (Holyoke), , SLURM Scheduler - FASSE, , Cannon Compute Cluster (Holyoke), , seas_compute, Cannon Open OnDemand/VDI, FASSE login nodes, Kempner Cluster CPU, Kempner Cluster GPU, FASSE Open OnDemand/VDI, Login Nodes - Boston, Netscratch (Global Scratch), , Boston Compute Nodes, 
Login Nodes → 
Cannon Cluster → 
VDI/OpenOnDemand → 
Kempner Cluster → 
FASSE Cluster → 
login.rc.fas.harvard.edu →</p>
    <p><small>Mar <var data-var='date'> 2</var>, <var data-var='time'>14:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>Mar <var data-var='date'> 2</var>, <var data-var='time'>14:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  Monthly maintenance will take place on Monday March 2nd, 2026\. Our maintenance tasks should be completed between 9am-1pm.

**NOTICES:**

* Training: Upcoming training from FASRC and other sources can be found on our Training Calendar. at &lt;https://www.rc.fas.harvard.edu/upcoming-training/&gt;
* Status Page: You can subscribe to our status to receive notifications of maintenance, incidents, and their resolution at &lt;https://status.rc.fas.harvard.edu/&gt; (click Get Updates for options).
* We&#039;d love to hear success stories about your or your lab&#039;s use of FASRC. Submit your story [here](https://www.rc.fas.harvard.edu/user-stories/).

**MAINTENANCE TASKS**

Cannon cluster will be paused during this maintenance?: **YES**  
FASSE cluster will be paused during this maintenance?: **YES**

* Slurm scheduler update  
   * Audience: All cluster users  
   * Impact: Jobs will be paused during maintenance
* OOD node reboots  
   * Audience; All Open OnDemand users  
   * Impact: OOD nodes will reboot during the maintenance window
* Login node reboots  
   * Audience: All login node users  
   * Impact: Login nodes will reboot during the maintenance window
* Netscratch retention purge  
   * Audience: All users of Netscratch  
   * Impact: Files older than 90 days will be removed. Please note that retention cleanup can and does run at any time, not just during the maintenance window.

Thank you,  
FAS Research Computing  
&lt;https://docs.rc.fas.harvard.edu/&gt;  
[https://www.rc.fas.harvard.edu/](https://www.rc.fas.harvard.edu/upcoming-training/).</p>
<p><small>Mar <var data-var='date'> 2</var>, <var data-var='time'>18:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmm5ekmnr0020fzjvxkjv1iox</id>
  <published>2026-02-27T21:27:09.251+00:00</published>
  <updated>2026-03-04T15:09:37.681+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmm5ekmnr0020fzjvxkjv1iox"/>
  <title>Tape outage</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 4 days, 17 hours and 42 minutes</p>
    <p><strong>Affected Components:</strong> NESE (NorthEast Storage Exchange), Tape - (Tier 3)</p>
    <p><small>Mar <var data-var='date'> 4</var>, <var data-var='time'>15:09:37</var> GMT+0</small><br /><strong>Resolved</strong> -
  This incident has been resolved. Normal tape operations are restored..</p>
<p><small>Feb <var data-var='date'> 27</var>, <var data-var='time'>21:27:09</var> GMT+0</small><br /><strong>Investigating</strong> -
  NESE Tape service will be down or operating with degraded service (no store and recall) Friday from 12 Noon EST until as late as Monday, 2 March at 9 AM.  
  
SUMMARY OF ISSUE:  
  
NESE Tape service is currently not able to store or recall files to and from tape due to vendor firmware issues in the IBM TS4500 tape library. The issue is related to the library robotics and cartridge database and we do NOT expect any data loss from this issue.  
  
The issue is apparently due to an issue with the inventory database related to a recent firmware update. This database can be scrubbed and reconstructed by the library, which will scan the bar code labels on all the cartridges to rebuild the inventory. Association of files in Globus to tapes is handled separately from the tape library and is not affected by the firmware update..</p>
<p><small>Mar <var data-var='date'> 2</var>, <var data-var='time'>14:03:01</var> GMT+0</small><br /><strong>Identified</strong> -
  NESE Tape Service is still working with IBM technical support at restoring the inventory. The expected downtime is extended until Tuesday March 3rd, 9am.  
Apologies for the inconvenvenience..</p>
<p><small>Mar <var data-var='date'> 3</var>, <var data-var='time'>14:04:46</var> GMT+0</small><br /><strong>Monitoring</strong> -
  The tape library outage is further extended to Wednesday March 4th at 9am awaiting a hardware replacement part due today. Data can still be uploaded to lab collections via Globus, but be mindful of the 10 TB buffer file limit. The outage affects storage and recall from tape..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmm3jn9o500pvn3vlsrovaqif</id>
  <published>2026-02-26T14:13:35.695+00:00</published>
  <updated>2026-02-27T22:04:05.371+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmm3jn9o500pvn3vlsrovaqif"/>
  <title>Starfish dashboard is unavailable</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 1 day, 7 hours and 50 minutes</p>
    <p><strong>Affected Components:</strong> Starfish</p>
    <p><small>Feb <var data-var='date'> 27</var>, <var data-var='time'>22:04:05</var> GMT+0</small><br /><strong>Resolved</strong> -
  This incident has been resolved. The Starfish dashboard is available..</p>
<p><small>Feb <var data-var='date'> 26</var>, <var data-var='time'>14:13:35</var> GMT+0</small><br /><strong>Investigating</strong> -
  The starfish dashboard is unavailable. We are currently investigating this issue with Starfish...</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmlvay24p0nlme0oeqgve2zzk</id>
  <published>2026-02-25T14:00:00.000+00:00</published>
  <updated>2026-02-25T14:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmlvay24p0nlme0oeqgve2zzk"/>
  <title>Starfish maintenance Feb 25, 2026 all day</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 1 day</p>
    <p><strong>Affected Components:</strong> Starfish</p>
    <p><small>Feb <var data-var='date'> 25</var>, <var data-var='time'>14:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  Starfish will be unavailable starting Wednesday, February 25th at 9AM until Thursday, February 26th at 9AM, for routine maintenance. The online dashboard will be inaccessible during this time..</p>
<p><small>Feb <var data-var='date'> 26</var>, <var data-var='time'>14:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>
<p><small>Feb <var data-var='date'> 25</var>, <var data-var='time'>14:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmm0ruk3v0117ca6jfq7m0o6l</id>
  <published>2026-02-24T15:39:56.874+00:00</published>
  <updated>2026-02-24T15:39:56.889+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmm0ruk3v0117ca6jfq7m0o6l"/>
  <title>Authentication outage</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 5 minutes</p>
    <p><strong>Affected Components:</strong> Authentication</p>
    <p><small>Feb <var data-var='date'> 24</var>, <var data-var='time'>15:39:56</var> GMT+0</small><br /><strong>Investigating</strong> -
  Authentication issues with openauth/radius. This incident was created by an automated monitoring service..</p>
<p><small>Feb <var data-var='date'> 24</var>, <var data-var='time'>15:44:57</var> GMT+0</small><br /><strong>Resolved</strong> -
  Openauth/radius is now operational. This update was created by an automated monitoring service..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmkx2dbd201zt5svdsbm0pm92</id>
  <published>2026-02-19T13:00:00.000+00:00</published>
  <updated>2026-02-19T13:00:01.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmkx2dbd201zt5svdsbm0pm92"/>
  <title>NESE tape maintenance Feb 19th 2026</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 9 hours</p>
    <p><strong>Affected Components:</strong> NESE (NorthEast Storage Exchange)</p>
    <p><small>Feb <var data-var='date'> 19</var>, <var data-var='time'>13:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>Feb <var data-var='date'> 19</var>, <var data-var='time'>13:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  From our partners at NESE. Details follow:

We are installing four new tape frames, which will bring the tape system raw storage capacity to 253 petabytes.

**Service Affected:** NESE Tape Service

**Maintenance Window:** 8:00 AM - 5:00 PM (EST)

* The tape service will be unavailable.
* All upgrade activities are expected to be completed on the same day.

NOTES:

* Monitor the MGHPCC Slack #nese channel for status updates and announcements
* Monitor &lt;https://nese.instatus.com/&gt; for real-time updates on progress

Subscribe to &lt;https://nese.instatus.com/subscribe/email&gt; for updates and announcements.</p>
<p><small>Feb <var data-var='date'> 19</var>, <var data-var='time'>22:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmlidbff700e2v2z4sn8zkflz</id>
  <published>2026-02-11T16:15:00.000+00:00</published>
  <updated>2026-02-11T19:45:06.985+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmlidbff700e2v2z4sn8zkflz"/>
  <title>OOD inaccessible</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 3 hours and 30 minutes</p>
    <p><strong>Affected Components:</strong> , Cannon Open OnDemand/VDI, FASSE Open OnDemand/VDI, 
VDI/OpenOnDemand →</p>
    <p><small>Feb <var data-var='date'> 11</var>, <var data-var='time'>19:45:06</var> GMT+0</small><br /><strong>Resolved</strong> -
  This incident has been resolved and OOD is working normally..</p>
<p><small>Feb <var data-var='date'> 11</var>, <var data-var='time'>16:15:00</var> GMT+0</small><br /><strong>Investigating</strong> -
  OpenOnDemand for both Cannon and FASSE may be inaccessible for some users. Errors may include: 

&quot;Error -- can&#039;t find user for &lt;username&gt;&quot;

&quot;502 proxy errors&quot;

For users that are able to access OOD, performance may be degraded or sessions may get stuck. 

We are currently investigating the root causes of this incident. Updates to follow .</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmlfnw5qt0xla10jy3plyxdx0</id>
  <published>2026-02-09T21:10:00.000+00:00</published>
  <updated>2026-02-09T21:10:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmlfnw5qt0xla10jy3plyxdx0"/>
  <title>Security updates needed for www.rc.fas.harvard.edu and docs.rc.fas.harvard.edu</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 8 minutes</p>
    <p><strong>Affected Components:</strong> docs.rc.fas.harvard.edu, www.rc.fas.harvard.edu</p>
    <p><small>Feb <var data-var='date'> 9</var>, <var data-var='time'>21:10:00</var> GMT+0</small><br /><strong>Identified</strong> -
  Security updates will require a brief interruption for our primary websites [www.rc.fas.harvard.edu](http://www.rc.fas.harvard.edu) and [docs.rc.fas.harvard.edu](http://docs.rc.fas.harvard.edu)

We will endeavour to keep this update as short as possible. Each site may be unavailable for a few minutes..</p>
<p><small>Feb <var data-var='date'> 9</var>, <var data-var='time'>21:10:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>Feb <var data-var='date'> 9</var>, <var data-var='time'>21:17:51</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmlfj7meu0zfk5p7vmpixtiwd</id>
  <published>2026-02-09T18:54:59.922+00:00</published>
  <updated>2026-02-09T18:54:59.922+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmlfj7meu0zfk5p7vmpixtiwd"/>
  <title>License server issue</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 18 minutes</p>
    <p><strong>Affected Components:</strong> Software &amp; Modules, , FIINE billing portal, License Servers, Starfish, FASRC Downloads Site, Citrix, Grafana Cloud (FASRC), docs.rc.fas.harvard.edu, portal.rc.fas.harvard.edu, Spinal, Bauer MiniLIMS, www.rc.fas.harvard.edu, FASRC Offsite Hosting, FASRC Ticket System (ServiceNow), Coldfront, 
Websites &amp; Tools →</p>
    <p><small>Feb <var data-var='date'> 9</var>, <var data-var='time'>18:54:59</var> GMT+0</small><br /><strong>Investigating</strong> -
  New sessions of Matlab are hanging. 

We are currently investigating this incident..</p>
<p><small>Feb <var data-var='date'> 9</var>, <var data-var='time'>19:04:52</var> GMT+0</small><br /><strong>Identified</strong> -
  The affected softwares include: 

Matlab

Mathematica

Gurobi

We are continuing to work on a fix for this incident..</p>
<p><small>Feb <var data-var='date'> 9</var>, <var data-var='time'>19:13:20</var> GMT+0</small><br /><strong>Resolved</strong> -
  The license server is back up, and all software should be performing as expected. 

This incident has been resolved..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmkx2d5pn01lsytjor57r552r</id>
  <published>2026-02-09T13:00:00.000+00:00</published>
  <updated>2026-02-09T13:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmkx2d5pn01lsytjor57r552r"/>
  <title>NESE tape maintenance Feb 9th 2026</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 9 hours</p>
    <p><strong>Affected Components:</strong> NESE (NorthEast Storage Exchange)</p>
    <p><small>Feb <var data-var='date'> 9</var>, <var data-var='time'>13:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  From our partners at NESE. Details follow:

In the process of the tape front-end file caching system upgrade, we will be installing a new IBM Storage Scale System 6000\. We will provide an additional update for when the software integration and data transfer from the current IBM Elastic Storage System 5000 will be performed.

**Service Affected:** NESE Tape Service

**Maintenance Window: No Downtime expected**

NOTES:

* Monitor the MGHPCC Slack #nese channel for status updates and announcements
* Monitor &lt;https://nese.instatus.com/&gt; for real-time updates on progress
* Subscribe to &lt;https://nese.instatus.com/subscribe/email&gt; for updates and announcements.</p>
<p><small>Feb <var data-var='date'> 9</var>, <var data-var='time'>13:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>Feb <var data-var='date'> 9</var>, <var data-var='time'>22:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmlbcsgmd000u3t5jsy7daltl</id>
  <published>2026-02-06T20:44:10.404+00:00</published>
  <updated>2026-02-06T20:44:11.631+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmlbcsgmd000u3t5jsy7daltl"/>
  <title>Grafana Cloud (FASRC) is down</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    
    <p><strong>Affected Components:</strong> Grafana Cloud (FASRC)</p>
    <p><small>Feb <var data-var='date'> 6</var>, <var data-var='time'>20:44:11</var> GMT+0</small><br /><strong>Investigating</strong> -
  Grafana Cloud (FASRC) is down at the moment. This incident was automatically created by Instatus monitoring..</p>
<p><small>Feb <var data-var='date'> 6</var>, <var data-var='time'>20:48:52</var> GMT+0</small><br /><strong>Resolved</strong> -
  Grafana Cloud (FASRC) is back up. This incident was automatically resolved by Instatus monitoring..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmkvgvpg4095lzxnkkb2spait</id>
  <published>2026-02-02T14:00:00.000+00:00</published>
  <updated>2026-02-02T14:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmkvgvpg4095lzxnkkb2spait"/>
  <title>FASRC monthly maintenance Monday February 2nd, 2026 9am-1pm</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 4 hours</p>
    <p><strong>Affected Components:</strong> Cannon Open OnDemand/VDI, Login Nodes - Holyoke, , , FASSE Compute Cluster (Holyoke), GPU nodes (Holyoke), , SLURM Scheduler - FASSE, , SLURM Scheduler - Cannon, Cannon Compute Cluster (Holyoke), , FASSE Open OnDemand/VDI, seas_compute, FASSE login nodes, Kempner Cluster CPU, Kempner Cluster GPU, Login Nodes - Boston, Netscratch (Global Scratch), , Boston Compute Nodes, 
Login Nodes → 
Cannon Cluster → 
VDI/OpenOnDemand → 
Kempner Cluster → 
FASSE Cluster → 
login.rc.fas.harvard.edu →</p>
    <p><small>Feb <var data-var='date'> 2</var>, <var data-var='time'>14:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  Monthly maintenance will take place on Monday February 2nd, 2026\. Our maintenance tasks should be completed between 9am-1pm.

**NOTICES:**

* Training: Upcoming training from FASRC and other sources can be found on our Training Calendar. at &lt;https://www.rc.fas.harvard.edu/upcoming-training/&gt;
* Status Page: You can subscribe to our status to receive notifications of maintenance, incidents, and their resolution at &lt;https://status.rc.fas.harvard.edu/&gt; (click Get Updates for options).
* We&#039;d love to hear success stories about your or your lab&#039;s use of FASRC. Submit your story [here](https://www.rc.fas.harvard.edu/user-stories/).

**MAINTENANCE TASKS**

Cannon cluster will be paused during this maintenance?: **YES**  
FASSE cluster will be paused during this maintenance?: **YES**

* MaxTime change  
   * Audience: Cluster users  
   * Impact: In order to improve scheduling efficiency and stability, we will be setting a maximum run time on all partitions that have MaxTime set to UNLIMITED to a MaxTime of 3 days. The unrestricted partition will be set to 365 days. Partitions that already have MaxTime set will retain their current setting. Partition owners wishing to set a different MaxTime for their partition should contact FASRC. Note that we do no guarantee uptime and so users should utilize checkpointing to save state in case of node failure.
* Slurm upgrade to 25.11.2  
   * Audience: All cluster users  
   * Impact: Jobs will be paused during maintenance
* OOD node reboots  
   * Audience; All Open OnDemand users  
   * Impact: OOD nodes will reboot during the maintenance window
* Login node reboots  
   * Audience; All login node users  
   * Impact: Login nodes will reboot during the maintenance window

Thank you,  
FAS Research Computing  
&lt;https://docs.rc.fas.harvard.edu/&gt;  
[https://www.rc.fas.harvard.edu/](https://www.rc.fas.harvard.edu/upcoming-training/).</p>
<p><small>Feb <var data-var='date'> 2</var>, <var data-var='time'>14:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>Feb <var data-var='date'> 2</var>, <var data-var='time'>18:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Incident/cmkpl3ux008buvdviozj30q33</id>
  <published>2026-01-22T15:06:02.990+00:00</published>
  <updated>2026-01-22T16:04:46.351+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/incident/cmkpl3ux008buvdviozj30q33"/>
  <title>Coldfront down</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 59 minutes</p>
    <p><strong>Affected Components:</strong> Coldfront</p>
    <p><small>Jan <var data-var='date'> 22</var>, <var data-var='time'>16:04:46</var> GMT+0</small><br /><strong>Resolved</strong> -
  Coldfront is operational. Thank you for your patience..</p>
<p><small>Jan <var data-var='date'> 22</var>, <var data-var='time'>15:06:02</var> GMT+0</small><br /><strong>Investigating</strong> -
  We are currently investigating an issues with Coldfront. No ETA..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmk1ivije002e49jtjj5n83yl</id>
  <published>2026-01-12T14:00:00.000+00:00</published>
  <updated>2026-01-12T14:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmk1ivije002e49jtjj5n83yl"/>
  <title>FASRC monthly maintenance Monday January 12th, 2026 9am-1pm</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 4 hours</p>
    <p><strong>Affected Components:</strong> , , FASSE Compute Cluster (Holyoke), SLURM Scheduler - Cannon, GPU nodes (Holyoke), , SLURM Scheduler - FASSE, , Cannon Compute Cluster (Holyoke), , Cannon Open OnDemand/VDI, FASSE Open OnDemand/VDI, seas_compute, Login Nodes - Holyoke, FASSE login nodes, Kempner Cluster CPU, Kempner Cluster GPU, Login Nodes - Boston, , Boston Compute Nodes, 
Login Nodes → 
Cannon Cluster → 
VDI/OpenOnDemand → 
Kempner Cluster → 
FASSE Cluster → 
login.rc.fas.harvard.edu →</p>
    <p><small>Jan <var data-var='date'> 12</var>, <var data-var='time'>14:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  Monthly maintenance will take place on January 12th, 2026\. Our maintenance tasks should be completed between 9am-1pm.

**NOTICES:**

* Changes to SEAS partitions, please see tasks below.
* Changes to job age priority weighting, please see tasks below.
* Status Page: You can subscribe to our status to receive notifications of maintenance, incidents, and their resolution at &lt;https://status.rc.fas.harvard.edu/&gt; (click Get Updates for options).
* We&#039;d love to hear success stories about your or your lab&#039;s use of FASRC. Submit your story [here](https://www.rc.fas.harvard.edu/user-stories/).

**MAINTENANCE TASKS**

Cannon cluster will be paused during this maintenance?: **YES**  
FASSE cluster will be paused during this maintenance?:**YES**

* Slurm upgrade to 25.11.1  
   * Audience: All cluster users (Cannon and FASSE)  
   * Impact: Jobs will be paused during maintenance
* In conjunction with SEAS we will modify seas\_gpu and seas\_compute time limits  
   * Audience: SEAS users  
   * Impact:  
   seas\_gpu: will be set to 2 days maximum  
   seas\_compute: will be set to 3 days maximum  
   Existing pending jobs longer than these limits will be set to 2 day and 3 day run times depending on partition.
* Job Age Priority Weight Change  
   * Audience: Cluster users  
   * Impact: We will be adjusting the weight applied to the priority earned by jobs by virtue of their age. Currently job priority is made up of two factors, Fairshare and Job Age. The Job Age factor is currently set such that jobs gain priority over 3 days with a maximum priority equivalent to jobs with Fairshare of 0.5\. This keeps low fairshare jobs from languishing at the bottom of the queue. With the current settings though, users with low fairshare can gain significant advantage over users with higher relative fairshare. To remedy this we will be adjusting the Job Age weight to cap out at an equivalent Fairshare of 0.1\. This will still allow jobs with 0 fairshare to gain priority and thus not languish while letting fairshare govern a wider range of higher priority jobs.
* Login node reboots  
   * Audience; All login node users  
   * Impact: Login nodes will reboot during the maintenance window
* Open OnDemand (OOD) node reboots  
   * Audienc:; All OOD users  
   * Impact: OOD nodes will reboot during the maintenance window
* Netscratch retention will run  
   * Audience: All cluster netscratch users  
   * Impact: Files older than 90 days will be removed. Please note that retention cleanup can and does run at any time, not just during the maintenance window.

Thank you,  
FAS Research Computing  
&lt;https://docs.rc.fas.harvard.edu/&gt;  
[https://www.rc.fas.harvard.edu/](https://www.rc.fas.harvard.edu/upcoming-training/).</p>
<p><small>Jan <var data-var='date'> 12</var>, <var data-var='time'>14:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>Jan <var data-var='date'> 12</var>, <var data-var='time'>18:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmi4yi1m400u3o9chvqic8p37</id>
  <published>2025-12-08T11:00:00.000+00:00</published>
  <updated>2025-12-08T11:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmi4yi1m400u3o9chvqic8p37"/>
  <title>Monthly Maintenance and MGHPCC Power Work - Dec. 8, 2025 6am-6pm</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 12 hours</p>
    <p><strong>Affected Components:</strong> Login Nodes - Holyoke, , GPU nodes (Holyoke), FASSE Compute Cluster (Holyoke), Isilon Storage Holyoke (Tier 1), , SLURM Scheduler - FASSE, Virtual Infrastructure - Holyoke, , Cannon Compute Cluster (Holyoke), , Cannon Open OnDemand/VDI, SLURM Scheduler - Cannon, FASSE Open OnDemand/VDI, License Servers, seas_compute, , FASSE login nodes, Kempner Cluster CPU, Kempner Cluster GPU, Login Nodes - Boston, Virtual Infrastructure - Boston, Isilon Storage Boston (Tier 1), , Boston Compute Nodes, 
Login Nodes → 
VDI/OpenOnDemand → 
Kempner Cluster → 
FASSE Cluster → 
Cannon Cluster → 
login.rc.fas.harvard.edu →</p>
    <p><small>Dec <var data-var='date'> 8</var>, <var data-var='time'>11:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  Monthly maintenance will take place on December 8th. Our maintenance tasks should be completed between 9am-1pm. However: 

_Additionally_, MGHPCC will be performing power upgrades on the odd side of Row 8A where much of our computer resides. This is the final upgrade for this row. Current estimate for this work is a 12 hour window 6am-6pm.

A list of the affected partitions is provided at the bottom of this notice. The nodes in those partitions will be drained prior to the work and will be powered down. Once the work is completed, those nodes will be returned to service. 

**Notices:**

* New FASSE partition `fasse_gpu_h200`. This partitions has 2 H200 nodes and a 3day limit. It is available now.
* 11/26 - 11/28 are university holidays (Thanksgiving). No on-site support, FASRC staff will return on 12/1.
* Training: Upcoming training from FASRC and other sources can be found on our Training Calendar. at &lt;https://www.rc.fas.harvard.edu/upcoming-training/&gt;
* Status Page: You can subscribe to our status to receive notifications of maintenance, incidents, and their resolution at &lt;https://status.rc.fas.harvard.edu/&gt; (click Get Updates for options).
* We&#039;d love to hear success stories about your or your lab&#039;s use of FASRC. Submit your story [here](https://www.rc.fas.harvard.edu/user-stories/).

**MAINTENANCE TASKS**

Cannon cluster will be paused during this maintenance?: **PARTIAL OUTAGE/YES**  
FASSE cluster will be paused during this maintenance?: **PARTIAL OUTAGE/YES**

* Power work on Row 8A odd  
   * Audience: Users of the partitions listed below  
   * Impact: These nodes and partitions will be fully or partially down all day
* OneFS (Isilon) upgrade  
   * Audience: All Isilon (Tier 1) shares  
   * Impact: Some VMs will be impacted including Cannon OOD, CBScentral, MCZapps/MCZbase, Portal, and Rclic1 (license server)
* Slurm upgrade to 25.05.5  
   * Audience: All cluster users  
   * Impact: Jobs will be paused during maintenance
* Login node reboots  
   * Audience: All login node users  
   * Impact: Login nodes will reboot during the maintenance window

**Impacted Cannon Partitions (Full or Partial Outage):**

* arguelles\_delgado\_gpu\_a100
* arguelles\_delgado\_gpu\_mixed
* bigmem\_intermediate
* blackhole\_gpu
* eddy
* gershman
* gpu\_requeue
* hejazi
* hernquist\_ice
* hoekstra
* huce\_ice
* iaifi\_gpu
* iaifi\_gpu\_priority
* iaifi\_gpu\_requeue
* itc\_gpu
* jshapiro
* kempner
* kempner\_dev
* kempner\_priority
* kempner\_h100
* kempner\_h100\_priority
* kempner\_h100\_priority2
* kempner\_h100\_priority3
* kempner\_interactive
* kempner\_requeue
* kovac
* kozinsky
* kozinsky\_gpu
* kozinsky\_priority
* kozinsky\_requeue
* murphy\_ice
* ortegahernandez\_ice
* rivas
* seas\_compute
* seas\_gpu
* serial\_requeue
* siag\_combo
* siag\_gpu
* sur
* zhuang.</p>
<p><small>Dec <var data-var='date'> 8</var>, <var data-var='time'>11:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>Dec <var data-var='date'> 8</var>, <var data-var='time'>23:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmisykt3v0ayak7rmnsl4btnt</id>
  <published>2025-12-05T14:00:00.000+00:00</published>
  <updated>2025-12-05T14:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmisykt3v0ayak7rmnsl4btnt"/>
  <title>holylfs04 migrations</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 4 days, 1 hour and 10 minutes</p>
    <p><strong>Affected Components:</strong> HolyLFS04 (Tier 0)</p>
    <p><small>Dec <var data-var='date'> 5</var>, <var data-var='time'>14:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  The holylfs04 migration to holylfs06 has begun. All holylfs04 folders will be **read-only** for the duration of the migration, from **Friday, December 5th at 9AM until end of day on Monday, December 8th.** 

All labs with holylfs04 have been informed via email; please email [rdm@rc.fas.harvard.edu](mailto:rdm@rc.fas.harvard.edu) if you have any questions..</p>
<p><small>Dec <var data-var='date'> 9</var>, <var data-var='time'>15:09:30</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmhjab8ow00xbfmtmh0g1qmos</id>
  <published>2025-12-01T11:00:00.000+00:00</published>
  <updated>2025-12-01T11:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmhjab8ow00xbfmtmh0g1qmos"/>
  <title>NESE tape system maintenance 12/1/25-12/5/25</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 5 days</p>
    <p><strong>Affected Components:</strong> NESE (NorthEast Storage Exchange), Tape - (Tier 3)</p>
    <p><small>Dec <var data-var='date'> 1</var>, <var data-var='time'>11:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  NESE, the Northeast Storage Exchange at MGHPCC which supplies the Tier3 tape service used by FASRC, will be offline for maintenance on the system Dec 1st - 5th. There will be ongoing performance-affecting maintenance until Dec 12th. Please see below for details.

WHO: Any lab who has or is moving data to tape.

IMPACT: No access 12/1/25 - 12/5/25\. Reduced performance 12/5/25 - 12/12/25.

&gt; NESE tape system maintenance and major software upgrade is scheduled to begin on December 1, 2025\. As a result, the NESE Tape service will be offline from December 1 to December 5.
&gt; 
&gt; Starting December 8 through December 12, the service will be back online with reduced performance. All maintenance activities are planned to conclude on December 12, 2025.
&gt; 
&gt; * Monitor: &lt;https://nese.instatus.com/&gt; for real-time updates on progress
&gt; * Subscribe to &lt;https://nese.instatus.com/subscribe/email&gt; for updates and announcements.</p>
<p><small>Dec <var data-var='date'> 1</var>, <var data-var='time'>11:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>Dec <var data-var='date'> 6</var>, <var data-var='time'>11:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmhnyg8g701g4yv5cvuwz2hjz</id>
  <published>2025-11-14T22:00:00.000+00:00</published>
  <updated>2025-11-14T22:00:01.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmhnyg8g701g4yv5cvuwz2hjz"/>
  <title>Starfish dashboard maintenance Nov. 14th 5-6PM</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 1 hour</p>
    <p><strong>Affected Components:</strong> Starfish</p>
    <p><small>Nov <var data-var='date'> 14</var>, <var data-var='time'>22:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>Nov <var data-var='date'> 14</var>, <var data-var='time'>23:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>
<p><small>Nov <var data-var='date'> 14</var>, <var data-var='time'>22:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  There is a planned upgrade of the Starfish dashboard scheduled for Friday November 14th starting at 5PM.   
The dashboard will be down for an hour while the upgrade is performed..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmggy7q9801vdtke67ycm4dxq</id>
  <published>2025-11-03T11:00:00.000+00:00</published>
  <updated>2025-11-03T11:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmggy7q9801vdtke67ycm4dxq"/>
  <title>Monthly Maintenance and MGHPCC Power Work - Nov. 3, 2025 6am-6pm</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 9 hours and 4 minutes</p>
    <p><strong>Affected Components:</strong> , , SLURM Scheduler - FASSE, , Kempner Cluster CPU, Cannon Compute Cluster (Holyoke), , Cannon Open OnDemand/VDI, SLURM Scheduler - Cannon, FASSE Open OnDemand/VDI, seas_compute, Kempner Cluster GPU, Login Nodes - Holyoke, , GPU nodes (Holyoke), FASSE Compute Cluster (Holyoke), FASSE login nodes, Login Nodes - Boston, Netscratch (Global Scratch), Boston Compute Nodes, 
Login Nodes → 
VDI/OpenOnDemand → 
Kempner Cluster → 
FASSE Cluster → 
Cannon Cluster →</p>
    <p><small>Nov <var data-var='date'> 3</var>, <var data-var='time'>11:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  Monthly maintenance will take place on November 3rd. Additionally, MGHPCC will be performing power upgrades on the even side of Row 8A where much of our computer resides. A further upgrade will take place Dec. 8th on the odd side.

A list of the affected partitions is provided at the bottom of this notice. The nodes in those partitions will be drained prior to the work and will be powered down. Once the work is completed, those nodes will be returned to service. Current estimate is a 12 hour window. We will adjust as we know more.

**MAINTENANCE TASKS**  
Cannon cluster will be paused during this maintenance?: **PARTIAL OUTAGE/YES**  
FASSE cluster will be paused during this maintenance?: **PARTIAL OUTAGE/YES**

* Power work on Row 8A Even  
   * Audience: Users of the partitions listed below  
   * Impact: These nodes and partitions will be fully or partially down all day
* Slurm upgrade to 25.05.4  
   * Audience: All cluster users  
   * Impact: Jobs will be paused during maintenance
* Block repo.anaconda.com cluster wide  
   * Audience: Anyone attempting to use repo.anaconda.com  
   * Impact: This change should not impact your Python workflow on the cluster. But if it does, consider using the open-source channel, `conda-forge`, through Miniforge distribution to install Python packages. This can be done by following our instructions on &lt;https://docs.rc.fas.harvard.edu/kb/python-package-installation/&gt;
* Change Slurm User to Local User  
   * Audience: All cluster users  
   * Impact: Behind the scenes. No impact to users
* Login node reboots (morning)  
   * Audience: Anyone logged into a FASRC Cannon or FASSE login node  
   * Impact: All login nodes will rebooted during this maintenance window
* Netscratch cleanup ( &lt;https://docs.rc.fas.harvard.edu/kb/policy-scratch/&gt; )  
   * Audience: Cluster users  
   * Impact: Files older than 90 days will be removed. Please note that retention cleanup can and does run at any time, not just during the maintenance window.

**AFFECTED PARTITIONS** 
Nov. 3, 2025 - All Day Power Work  
Partial or Full Outage Apples to:

arguelles\_delgado\_h100

bigmem

dvorkin

eddy

enos

gpu

gpu\_h200

gpu\_requeue

hsph

hsph\_gpu

intermediate

itc\_cluster

joonholee

jshapiro

kempner\_dev

kemkpner\_eng

kempner\_requeue

mweber\_compute

mweber\_gpu

olveczky\_sapphire

sapphire

seas\_compute

seas\_gpu

serial\_requeue

yao

yao\_gpu

yao\_priority

test.</p>
<p><small>Nov <var data-var='date'> 3</var>, <var data-var='time'>11:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>Nov <var data-var='date'> 3</var>, <var data-var='time'>20:04:13</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully including power work at MGHPCC.

A reminder that additional all-day power work will take place on Dec 8th, along with our maintenance from 9am-1pm.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmg1cda8s098z4gs4ylropic7</id>
  <published>2025-10-06T13:00:00.000+00:00</published>
  <updated>2025-10-06T13:00:01.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmg1cda8s098z4gs4ylropic7"/>
  <title>FASRC monthly maintenance Monday October 6th, 2025 9am-1pm</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 4 hours</p>
    <p><strong>Affected Components:</strong> , Network - Cambridge, Network - Boston, Network - Holyoke/MGHPCC, Login Nodes - Boston, Netscratch (Global Scratch), Login Nodes - Holyoke, FASSE login nodes, 
Login Nodes →</p>
    <p><small>Oct <var data-var='date'> 6</var>, <var data-var='time'>13:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>Oct <var data-var='date'> 6</var>, <var data-var='time'>13:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  FASRC monthly maintenance will take place Monday October 6th, 2025 from 9am-1pm

**NOTICES**

* Training: Upcoming training from FASRC and other sources can be found on our Training Calendar. at &lt;https://www.rc.fas.harvard.edu/upcoming-training/&gt;
* Status Page: You can subscribe to our status to receive notifications of maintenance, incidents, and their resolution at &lt;https://status.rc.fas.harvard.edu/&gt; (click Get Updates for options).
* Upcoming holidays: Columbus / Indigenous Peoples’ Day - October 13

**MAINTENANCE TASKS**  
Cannon cluster will be paused during this maintenance?: **NO**  
FASSE cluster will be paused during this maintenance?: **NO**

* DNS server reboots  
   * Audience: All FASRC services  
   * Impact: Rolling reboot should have no impact
* Login node reboots  
   * Audience: Anyone logged into a FASRC Cannon or FASSE login node  
   * Impact: All login nodes will rebooted during this maintenance window
* Netscratch cleanup ( &lt;https://docs.rc.fas.harvard.edu/kb/policy-scratch/&gt; )  
   * Audience: Cluster users  
   * Impact: Files older than 90 days will be removed. Please note that retention cleanup can and does run at any time, not just during the maintenance window.

Thank you,  
FAS Research Computing  
&lt;https://docs.rc.fas.harvard.edu/&gt;  
&lt;https://www.rc.fas.harvard.edu/&gt;.</p>
<p><small>Oct <var data-var='date'> 6</var>, <var data-var='time'>17:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance is now in progress.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmg5awjiw04n710wcsl3zi0gv</id>
  <published>2025-09-30T06:00:00.000+00:00</published>
  <updated>2025-09-30T07:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmg5awjiw04n710wcsl3zi0gv"/>
  <title>VPN concentrator rolling updates overnight</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    
    
    <p><small>Sep <var data-var='date'> 30</var>, <var data-var='time'>07:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>
<p><small>Sep <var data-var='date'> 30</var>, <var data-var='time'>06:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Networking will be patching the VPN concentrators overnight. This will be done in a rolling order so that one ore more are always online.   
  
This may cause active VPN connections to drop, but they can be re-connected shortly after. ETA is one hour total..</p>
<p><small>Sep <var data-var='date'> 30</var>, <var data-var='time'>06:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  Networking will be patching the VPN concentrators overnight. This will be done in a rolling order so that one ore more are always online.   
  
This may cause active VPN connections to drop, but they can be re-connected shortly after. ETA is one hour total..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmevo0dad0012rwy6qv5mv34j</id>
  <published>2025-09-08T13:00:00.000+00:00</published>
  <updated>2025-09-08T13:00:01.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmevo0dad0012rwy6qv5mv34j"/>
  <title>FASRC monthly maintenance Monday September 8th, 2025 9am-1pm</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 4 hours</p>
    <p><strong>Affected Components:</strong> Kempner Cluster CPU, Kempner Cluster GPU, , Cannon Compute Cluster (Holyoke), Boston Compute Nodes, Netscratch (Global Scratch), , , SLURM Scheduler - FASSE, , GPU nodes (Holyoke), Login Nodes - Boston, FASSE login nodes, Login Nodes - Holyoke, , FASSE Open OnDemand/VDI, Cannon Open OnDemand/VDI, SLURM Scheduler - Cannon, FASSE Compute Cluster (Holyoke), seas_compute, 
Login Nodes → 
Cannon Cluster → 
FASSE Cluster → 
VDI/OpenOnDemand → 
Kempner Cluster →</p>
    <p><small>Sep <var data-var='date'> 8</var>, <var data-var='time'>13:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>Sep <var data-var='date'> 8</var>, <var data-var='time'>17:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>
<p><small>Sep <var data-var='date'> 8</var>, <var data-var='time'>13:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  FASRC monthly maintenance will take place Monday September 8th, 2025 from 9am-1pm

**NOTICES**

* Training: Upcoming training from FASRC and other sources can be found on our Training Calendar. at &lt;https://www.rc.fas.harvard.edu/upcoming-training/&gt;
* Status Page: You can subscribe to our status to receive notifications of maintenance, incidents, and their resolution at &lt;https://status.rc.fas.harvard.edu/&gt; (click Get Updates for options).
* Upcoming holidays: Labor Day, Monday September 1st

**MAINTENANCE TASKS**  
Cannon cluster will be paused during this maintenance?: **YES**  
FASSE cluster will be paused during this maintenance?: **YES**

* Slurm Upgrade to 25.05.2  
   * Audience: All cluster users  
   * Impact: Jobs and the scheduler will be paused during this upgrade
* Domain controller work  
   * Audience: Internal network  
   * Impact: No impact expected
* Login node reboots  
   * Audience: Anyone logged into a FASRC Cannon or FASSE login node  
   * Impact: All login nodes will rebooted during this maintenance window
* Netscratch cleanup ( &lt;https://docs.rc.fas.harvard.edu/kb/policy-scratch/&gt; )  
   * Audience: Cluster users  
   * Impact: Files older than 90 days will be removed. Please note that retention cleanup can and does run at any time, not just during the maintenance window.

Thank you,  
FAS Research Computing  
&lt;https://docs.rc.fas.harvard.edu/&gt;  
&lt;https://www.rc.fas.harvard.edu/&gt;.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cmehjmf9q000dam879fafq3vq</id>
  <published>2025-08-20T21:00:00.000+00:00</published>
  <updated>2025-08-20T21:00:01.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cmehjmf9q000dam879fafq3vq"/>
  <title>Starfish upgrade Wednesday, August 20th 5PM-7PM</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 2 hours</p>
    <p><strong>Affected Components:</strong> Starfish</p>
    <p><small>Aug <var data-var='date'> 20</var>, <var data-var='time'>21:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>Aug <var data-var='date'> 20</var>, <var data-var='time'>21:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  Starfish will be performing an upgrade on Wednesday, August 20th from 5PM-7PM. The web interface will be unavailable during that timeframe..</p>
<p><small>Aug <var data-var='date'> 20</var>, <var data-var='time'>23:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.rc.fas.harvard.edu,2005:Maintenance/cme1un2le0626okcc24iokqze</id>
  <published>2025-08-11T10:00:00.000+00:00</published>
  <updated>2025-08-11T10:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.rc.fas.harvard.edu/maintenance/cme1un2le0626okcc24iokqze"/>
  <title>SEAS: seas_gpu partition GPU upgrades 8/11 - 8/14</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 4 days, 9 hours and 39 minutes</p>
    <p><strong>Affected Components:</strong> seas_compute</p>
    <p><small>Aug <var data-var='date'> 11</var>, <var data-var='time'>10:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  Between 8/11/25 6AM - 8/14/25 5PM FASRC will be upgrading 14 of the H100 GPU nodes in the `seas_gpu `partition to H200 GPUs. This sill also affect `mweber_gpu`

A reservation has been set which will drain the nodes of jobs prior to the maintenance. The SEAS GPU partition will be running at 75% capacity during these updates. FASRC has hundreds of GPU, so users should feel free to utilize `gpu_requeue` if needed for their jobs.

Affected nodes: 

`mweber_gpu` nodes (13):

```
holygpu8a[18204,18301-18304,18401-18404,18501-18502,18601-18602]
```

seas\_gpu nodes (14):

```
holygpu8a[16101-16104,16201-16204,16301-16304,16401-16402]
```

Please reach out to [rchelp@rc.fas.harvard.edu](mailto:rchelp@rc.fas.harvard.edu) if you have any questions or concerns..</p>
<p><small>Aug <var data-var='date'> 15</var>, <var data-var='time'>14:13:24</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is still in progress - imaging of the new H200 nodes is ongoing. Current ETA is end of day Friday. For further questions, please contact rchelp@rc.fas.harvard.edu.</p>
<p><small>Aug <var data-var='date'> 15</var>, <var data-var='time'>19:39:27</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully. SEAS H200 nodes have been imaged and are back in service. .</p>
<p><small>Aug <var data-var='date'> 11</var>, <var data-var='time'>10:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>

        ]]>
  </content>
</entry>

</feed>