{"id":3068,"date":"2009-10-08T18:07:14","date_gmt":"2009-10-08T22:07:14","guid":{"rendered":"https:\/\/www.bu.edu\/tech\/research\/scv_import\/computation\/storage\/proj-diskspace\/"},"modified":"2026-06-01T12:59:36","modified_gmt":"2026-06-01T16:59:36","slug":"proj-diskspace","status":"publish","type":"page","link":"https:\/\/www.bu.edu\/tech\/support\/research\/computing-resources\/file-storage\/proj-diskspace\/","title":{"rendered":"Project Disk Space"},"content":{"rendered":"<h2>Overview<\/h2>\n<p>The Project Disk Space file system comprises twelve petabytes of usable high performance online storage for research computing projects. Project Disk Space is allocated to individual research projects for exclusive use by its members, facilitating collaboration.<\/p>\n<p>Each project is allocated a limited amount of Free Baseline quota. Those projects requiring additional may either purchase Project Disk through the <a title=\"Buy-in\" href=\"https:\/\/www.bu.edu\/tech\/support\/research\/computing-resources\/service-models\/buy-in\/\">Buy-in program<\/a> or rent additional Project Disk through the <a title=\"SAAS\" href=\"https:\/\/www.bu.edu\/tech\/support\/research\/computing-resources\/file-storage\/proj-diskspace\/#SAAS\">Storage-as-a-Service<\/a> program.<\/p>\n<p>All Project Disk Space is protected by hardware RAID (protecting against disk failures) and daily <a title=\"Snapshots\" href=\"https:\/\/www.bu.edu\/tech\/support\/research\/computing-resources\/file-storage\/#Snapshots\">Snapshots<\/a> (protect against accidental deletion of files).<\/p>\n<h2>Kinds of Project Disk Partitions<\/h2>\n<p>There are four Project Disk Space partitions on the SCC: <strong>\/project<\/strong>, <strong>\/projectnb<\/strong>, <strong>\/restricted\/project<\/strong>, and <strong>\/restricted\/projectnb<\/strong>. These four partitions have identical performance characteristics. The two <strong>\/restricted<\/strong> partitions are dbGaP compliant for data that needs it (primarily Genomics projects). The two <strong>\/project<\/strong> partitions are backed up nightly to an independent off-site system for disaster recovery and the two <strong>nb<\/strong> partitions are not-backed-up. Regardless, <a title=\"Snapshots\" href=\"https:\/\/www.bu.edu\/tech\/support\/research\/computing-resources\/file-storage\/#Snapshots\">Snapshots<\/a> are implemented on all four partitions enabling users to easily retrieve accidentally deleted files.<\/p>\n<h2>Data Protection Requirements<a name=\"DBGAP\" href=\"#DBGAP\">&#x1f517;<\/a><\/h2>\n<p>A portion of the SCC Project Disk Space is set up to be used for processing and storing Confidential data such as a HIPAA Limited Data Set (DOB, DOD, dates of treatment, City, Zip Code) and <a href=\"http:\/\/www.ncbi.nlm.nih.gov\/gap\/\">dbGaP<\/a> data. <strong>Restricted Use data<\/strong>, such as <strong>HIPAA <\/strong>or individually identifiable health information <strong>may not be stored<\/strong> on any partition of the SCC. The allowed Confidential data may be stored only in the \/restricted\/projectnb and \/restricted\/project partitions and can be accessed from all SCC compute nodes but only the\u00a0<strong>scc4.bu.edu<\/strong>\u00a0login node and <strong>scc-ondemand.bu.edu<\/strong>\u00a0web interface.<\/p>\n<p><em><strong>Public<\/strong><\/em> and <em><strong>Internal <\/strong><\/em>data may be stored on \/project and \/projectnb and accessed from the other login nodes as well as all compute nodes.<\/p>\n<p>Please see <a href=\"http:\/\/www.bu.edu\/policies\/data-classification-policy\/\">http:\/\/www.bu.edu\/policies\/data-classification-policy<\/a>\u00a0 for definitions and more information.<\/p>\n<p>For questions about how your data is classified, please send email to <a href=\"mailto:bumcinfosec@bu.edu\">bumcinfosec@bu.edu<\/a>.For questions about using SCC and Project Disk Space, send email to <a href=\"mailto:help@scc.bu.edu\">help@scc.bu.edu<\/a>.<\/p>\n<h2>Allocations<\/h2>\n<p>Project Disk allocations can be in the form of any of three types: Free, Buy-in, or Storage-as-a-Service. Functionally, rented and purchased Project Disk augment and are largely indistinguishable from free storage.<\/p>\n<p>Forms for requesting both Free and Storage-as-a-Service space can be found with the other project management web pages on TechWeb on your <a href=\"https:\/\/acct.bu.edu\/cgi-bin\/perl\/secure\/redirect_sccmgmt.pl\">SCC Management Page<\/a>. For Buy-in space, email <a href=\"mailto:buyin@rcs.bu.edu\">buyin@rcs.bu.edu<\/a>.<\/p>\n<h3>Free Baseline Quota<a name=\"BASELINE\" href=\"#BASELINE\">&#x1f517;<\/a><\/h3>\n<p>By default, new projects on the SCC are generally created with 200 GB in <b>\/project<\/b> and 800 GB in <b>\/projectnb<\/b>. The Lead Project Investigator (LPI) can specify whether or not it should be dbGaP compliant. Additional Not Backed-Up space can be purchased through either <a href=\"https:\/\/www.bu.edu\/tech\/support\/research\/computing-resources\/service-models\/buy-in\/\">Buy-In<\/a> or <a href=\"https:\/\/www.bu.edu\/tech\/support\/research\/computing-resources\/file-storage\/proj-diskspace\/#SAAS\">Storage-as-a-Service<\/a>. For LPIs with multiple projects, there is an additional limit of a maximum of 3000 GB (with a maximum of 600 GB of that backed up) of Free Baseline quota across all projects.<\/p>\n<p>Application form: <a href=\"https:\/\/acct.bu.edu\/cgi-bin\/perl\/secure\/redirect_sccmgmt.pl\">SCC Management Page<\/a><\/p>\n<h3>Purchasing\/Renting Storage through the <a name=\"BUYIN\"><\/a>Buy-in and <a name=\"SAAS\"><\/a>Storage-as-a-Service programs<a name=\"BUYINandSAAS\" href=\"#BUYINandSAAS\">&#x1f517;<\/a><\/h3>\n<p>The highly successful Buy-in Program is a convenient way to acquire dedicated storage at highly subsidized rates for an extended period of time (5 years). Any Researcher interested should contact <a href=\"mailto:buyin@rcs.bu.edu\">buyin@rcs.bu.edu<\/a> or review the <a href=\"https:\/\/www.bu.edu\/tech\/support\/research\/computing-resources\/service-models\/buy-in\/storage\/\">Buy-in options<\/a> web pages. The current cost is [TBD, expected to be ~$100\/Terabyte\/5years].<\/p>\n<p>The Storage-as-a-Service program offers researchers an option to acquire additional disk quota for a flexible time duration at a subsidized rate of [TBD, expected ~$25\/Terabyte\/year]. Allocations are in whole Terabyte (1000 Gigabyte) units only. To purchase an allocation through this program, the LPI should fill in the request form for Storage-as-a-Service and include their Financial Contact information. The Financial Contact will receive details on how to send an Internal Service Request for transmitting payment. The application form is found on each LPI&#8217;s <a href=\"https:\/\/acct.bu.edu\/cgi-bin\/perl\/secure\/redirect_sccmgmt.pl\">SCC Management Page<\/a>.<\/p>\n<p>All grant rules apply when using grant funds.<\/p>\n<p><strong>Buy-in vs Storage-as-a-Service Comparison<a name=\"BUYINvsSAAS\" href=\"#BUYINvsSAAS\" style=\"text-decoration: none;\">&#x1f517;<\/a><\/strong><\/p>\n<div style=\"overflow: auto;\">\n<table class=\"research\" width=\"580\" border=\"1\">\n<tbody>\n<tr class=\"section-heading\" valign=\"top\" align=\"left\">\n<th style=\"background-color: #ffffff;\"><\/th>\n<th>Buy-in<\/th>\n<th>Storage-as-a-Service<\/th>\n<\/tr>\n<\/tbody>\n<tbody>\n<tr valign=\"top\" align=\"left\">\n<td><b>Model<\/b><\/td>\n<td>Purchase<\/td>\n<td>Rental<\/td>\n<\/tr>\n<tr valign=\"top\" align=\"left\">\n<td><b>Time Horizon<\/b><\/td>\n<td>5 years<\/td>\n<td>6 months+<\/td>\n<\/tr>\n<tr valign=\"top\" align=\"left\">\n<td><b>Annual Cost<\/b><\/td>\n<td>TBD (expected ~$20\/TB)<\/td>\n<td>TBD (expected ~$25\/TB)<\/td>\n<\/tr>\n<tr valign=\"top\" align=\"left\">\n<td><b>Billing Schedule<\/b><\/td>\n<td>Full five year cost paid up front when large storage array purchase occurs. This is generally 0-4 months after a request comes in.<\/td>\n<td>Billed (and pro-rated for periods less than one year) annually by fiscal year<\/td>\n<\/tr>\n<tr valign=\"top\" align=\"left\">\n<td><b>Minimum Purchase<\/b><\/td>\n<td>10 TB<\/td>\n<td>1 TB<\/td>\n<\/tr>\n<tr valign=\"top\" align=\"left\">\n<td><b>Storage Availability<\/b><\/td>\n<td>Generally immediately via &#8220;Loaner&#8221; space until actual purchase but not always<\/td>\n<td>Immediately<\/td>\n<\/tr>\n<tr valign=\"top\" align=\"left\">\n<td><b>Capital Expense?<\/b><\/td>\n<td>Yes<\/td>\n<td>No<\/td>\n<\/tr>\n<tr valign=\"top\" align=\"left\">\n<td><b>Fully fungible\/<br \/>\nTransferable between projects<\/b><\/td>\n<td>Yes<\/td>\n<td>Yes<\/td>\n<\/tr>\n<tr valign=\"top\" align=\"left\">\n<td><b>Recommended For<\/b><\/td>\n<td>Large, long-term purchases<\/td>\n<td>Small and\/or short term purchases<\/td>\n<\/tr>\n<tr valign=\"top\" align=\"left\">\n<td><b>How to begin purchase?<\/b><\/td>\n<td>Email <a href=\"mailto:buyin@rcs.bu.edu\">buyin@rcs.bu.edu<\/a><\/td>\n<td>Submit appropriate form on your <a href=\"https:\/\/acct.bu.edu\/cgi-bin\/perl\/secure\/redirect_sccmgmt.pl\">SCC Management Page<\/a>. Note that only Lead Project Investigators (LPIs) can submit this form, not IT\/Administrative Contacts.<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<h2>Accessing Project Disk Space<\/h2>\n<p>When a project is created on the SCC, subdirectories will be created for the project under the appropriate <b>\/project<\/b>, <b>\/projectnb<\/b>, <span style=\"white-space: nowrap;\"><b>\/restricted\/project<\/b><\/span>, and\/or <span style=\"white-space: nowrap;\"><b>\/restricted\/projectnb<\/b><\/span> directories. These subdirectories will have the same name as the project and will be writable by any member of the project. The structure and access to the files and subdirectories created under the project\u2019s directory is entirely at the discretion of the project members. The Unix \u201cgroup\u201d file permission mechanism can be used to control permissions for the project\u2019s subdirectories (see the <a href=\"http:\/\/scv.bu.edu\/cgi-bin\/perl\/manscript\/SGI\/chmod\/1\/\">man page for \u201cchmod\u201d<\/a> for more details).<\/p>\n<h2>Limitations on Number of Files<a name=\"NUMBEROFFILES\" href=\"#NUMBEROFFILES\">&#x1f517;<\/a><\/h2>\n<p>In addition to the quota on the total size of your files, there is also a limitation on the number of files you can have. The system does not operate well if people have many millions of very small files. It is much better to have a smaller number of somewhat larger files. This limitation only affects a fairly small number of people. There are three formulas in play for this calculation. If your directory has a file size quota of 16TB or more, you are allowed 2 files per MB of quota. This works out to 33.5 million files at 16TB and 200 million files (which is the maximum allowable in any partition) at 100TB or higher. For smaller allocations, you are allowed 1 file for each 32KB of space you have up to 1 TB or 33 million files. After that, the file quantity limit only goes up after 16TB as explained earlier. Here is a table with the # of files limits for certain quotas.<\/p>\n<table>\n<tbody>\n<tr>\n<th>Quota (GB)<\/th>\n<th>Quota (Files)<\/th>\n<\/tr>\n<tr>\n<td>200<\/td>\n<td>6.5 Million<\/td>\n<\/tr>\n<tr>\n<td>1,024<\/td>\n<td>33.5 Million<\/td>\n<\/tr>\n<tr>\n<td>16,384<\/td>\n<td>33.5 Million<\/td>\n<\/tr>\n<tr>\n<td>100,000<\/td>\n<td>200 Million<\/td>\n<\/tr>\n<tr>\n<td>200,000<\/td>\n<td>200 Million<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h2>Quota Enforcement<a name=\"QUOTA\" href=\"#QUOTA\">&#x1f517;<\/a><\/h2>\n<p>Project Disk Space quotas on the SCC are enforced by the file system. Daily email reminders are sent to the project&#8217;s Lead Project Investigator and all project members to let them know when the project is over its quota, including breaking down how much space each user is using. Projects have a <b>soft limit<\/b> equal to their granted quota and a <b>hard limit<\/b> 10% greater (with a maximum of 100GB over the quota, regardless of its size). Projects can never exceed their hard limit and can only go over their soft limit for a maximum of 7 days. A project over its limit simply needs to delete enough files to get under the soft limit to have full write access restored immediately.<\/p>\n<p>To help manage the project members\u2019 Project Disk usage, LPIs may specify a limit for each individual researcher. By default, each project member\u2019s limit is set to project\u2019s full allocation. A LPI may reassign individual quotas at any time using the Project Disk Space update form found at the link above. These individual quotas are enforced by the honor system, with email reminders sent daily to the Lead Project Investigator and user who is over his or her personal quota.<\/p>\n<p>LPIs and users may review a daily record of their project\u2019s and individual Project Disk usage via their <a href=\"https:\/\/acct.bu.edu\/cgi-bin\/perl\/secure\/redirect_sccmgmt.pl\">SCC Management Page<\/a>; more detailed information is available for LPIs. They may also use the command <a href=\"https:\/\/www.bu.edu\/tech\/support\/research\/rcs-archive\/system-usage-old\/using-scc\/managing-files\/#QUOTA\"><code><span style=\"white-space: nowrap;\"><span class=\"command\">pquota -u<\/span> <span class=\"placeholder\">projectname<\/span><\/span><\/code><\/a> on the system to see a breakdown of Project Disk usage for a given project.<\/p>\n<p>Please note that the <code><span class=\"command\">quota -v<\/span><\/code> command will display a user\u2019s home directory usage, not Project Disk Space usage.<\/p>\n<p>Two helpful Linux commands for determining disk usage are <code><span class=\"command\">du<\/span><\/code> and <code><span class=\"command\">df -h .<\/span><\/code>. Researchers who keep all of their files in their own subdirectory can <code><span class=\"command\">cd<\/span><\/code> to that directory and type <code><span class=\"command\">du -sk<\/span><\/code> to display their usage. You can see your project\u2019s overall usage, available space, and <b>hard limit<\/b> quota by running the command <code><span class=\"command\">df -h .<\/span><\/code> anywhere inside of your group&#8217;s appropriate Project Disk Space directory.<\/p>\n<h2>Backed up vs. Not-backed-up Project Disk<a name=\"BACKEDUPvsNOTBACKEDUP\" href=\"#BACKEDUPvsNOTBACKEEDUP\">&#x1f517;<\/a><\/h2>\n<p>Most computational research projects will need a combination of <strong>\/project<\/strong> (backed-up) and <strong>\/projectnb<\/strong> (not-backed-up) disk space. Files on the <strong>\/project<\/strong> partitions are backed up nightly while those on the <strong>\/projectnb<\/strong> partitions are not backed up. The <strong>\/projectnb<\/strong> partitions are appropriate for most files used in computational research on the SCC.<\/p>\n<p>Backing up files requires additional resources and expense. We ask that you use the <strong>\/project<\/strong> partitions only for files that need to be backed up. These will be restorable in the event of catastrophic failure.<\/p>\n<p>Files that should be stored in <strong>\/project<\/strong> and backed up are those that are being edited (e.g. codes), files that do not have a copy elsewhere, and files that cannot be regenerated.<\/p>\n<p>Files that should be stored in <strong>\/projectnb<\/strong>:<\/p>\n<ol><\/ol>\n<ul>\n<li>Data which exists elsewhere and is copied to the Project Disk Space for high performance access during computation.<\/li>\n<li>Data which can be easily regenerated.<\/li>\n<li>Data which is needed for only a short time.<\/li>\n<li>Newly generated data which will be copied to another system for storage.<\/li>\n<\/ul>\n<ol><\/ol>\n<p>If you accidentally delete or corrupt files stored in any of the Project Disk partitions, you may locate them in the <a title=\"Snapshots\" href=\"https:\/\/www.bu.edu\/tech\/about\/research\/computation\/file-storage\/#Snapshots\">Snapshots<\/a> and copy them into your directory.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Overview The Project Disk Space file system comprises twelve petabytes of usable high performance online storage for research computing projects. Project Disk Space is allocated to individual research projects for exclusive use by its members, facilitating collaboration. Each project is allocated a limited amount of Free Baseline quota. Those projects requiring additional may either purchase&#8230;<\/p>\n","protected":false},"author":1692,"featured_media":0,"parent":15386,"menu_order":1,"comment_status":"closed","ping_status":"closed","template":"","meta":[],"_links":{"self":[{"href":"https:\/\/www.bu.edu\/tech\/wp-json\/wp\/v2\/pages\/3068"}],"collection":[{"href":"https:\/\/www.bu.edu\/tech\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.bu.edu\/tech\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.bu.edu\/tech\/wp-json\/wp\/v2\/users\/1692"}],"replies":[{"embeddable":true,"href":"https:\/\/www.bu.edu\/tech\/wp-json\/wp\/v2\/comments?post=3068"}],"version-history":[{"count":51,"href":"https:\/\/www.bu.edu\/tech\/wp-json\/wp\/v2\/pages\/3068\/revisions"}],"predecessor-version":[{"id":161924,"href":"https:\/\/www.bu.edu\/tech\/wp-json\/wp\/v2\/pages\/3068\/revisions\/161924"}],"up":[{"embeddable":true,"href":"https:\/\/www.bu.edu\/tech\/wp-json\/wp\/v2\/pages\/15386"}],"wp:attachment":[{"href":"https:\/\/www.bu.edu\/tech\/wp-json\/wp\/v2\/media?parent=3068"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}