From: Jhon Mojica - NOAA Affiliate <jhon.mojica@noaa.gov>
Date: Sun, Mar 31, 2024 at 2:15 PM
Subject: NCEI datasets accession number # 0247167 and #0248362
To: Rebecca Wenker - NOAA Affiliate <rebecca.wenker@noaa.gov>

Dear Rebeca,

As we talked couple of months ago, before the end of March 2024, I want to share and submit in the CPD with you the update datasets for the projects with NCEI accession number:

# 0247167: Water Quality Data from the R/V Walton Smith along the Florida Reef Tract, Florida Bay, and Southwest Florida.

# 0248362: Water Quality, Nutrients, Chlorophyll-A and Microbial Source Tracking Data in Biscayne Bay, Government Cut and Offshore Reefs - Florida

Regarding # 0247167. I send you the files: SFER_data_complete

Regarding # 0248362. I send you the files: BB data (Win standards) complete.

In the middle of 2023, we started a new project called Coral Gables Water Wase (CGWW), soon we will get a first-year data sampling, and I want to know the procedure to get a new NCEI accession number or protocol to submit this new dataset.

Thanks for your help, and please let me know if you need more details about the data shared and the new one.

Have a great weekend,

Best.
__________
From: Jhon Mojica - NOAA Affiliate <jhon.mojica@noaa.gov>
Date: Fri, Apr 5, 2024 at 11:27 AM
Subject: Re: NCEI datasets accession number # 0247167 and #0248362
To: Rebecca Wenker - NOAA Affiliate <rebecca.wenker@noaa.gov>

Dear Rebeca, Please check below the replies (in red):

Accession 0247167:

The updated file that you sent for this accession, "SFER_data_complete.xlsx", contains an additional sheet ("Sheet1") that I believe isn't needed in the data file?
Reply: Yes, that's correct. I sent the document again deleting this sheet_1. New documents: AOML_SFP_regional_WQ_surface_v18.CSV (DATA), AOML_SFP_regional_WQ_Metadata.CSV (METADATA).

Also, the format of this file differs from the previous data file currently archived in accession 0247167, as well as the one that Alexandra sent before her departure back in December, which I have attached ("AOML_SFP_regional_WQ_surface_v17 (updated 10-11-23).csv"). The file you sent has several additional columns with data that wasn't previously archived, nor were they in the file Alexandra sent as a potential update, so I wanted to make sure that they were intended to be included and in this format!
Reply: I'm sorry I shared with you a raw file instead of the one in the right format (mistake to be in a rush before fieldwork departure). Please check the file attached where I just include the new data that keeps the same format (number of columns). New file Version 18.

If the new data file format is indeed the one you'd like to use, we would need a data dictionary file defining the column headers and contents, units used, any codes in the data, etc. For example, in the accession link above, the file "AOML_SouthFlorida_WaterQualityMetadata.csv" was submitted as a data dictionary to describe the data file. I also attached the file "SFER_Cruise_Metadata.csv" which Alexandra included in her email in December as a data dictionary.
Reply: No need to include the new data columns or more information in the dictionary. I'm sharing with you the new version 18, where you can see the new data from line 10631, that covers from March 2, 2023, until November 15, 2023, keeping the same previous format. On the side, I'll talk with my colleagues here to define and be sure if we can/need/want to share the new data columns. In a positive case, I'll include more info in the data dictionary and include the new columns so that not perturb the previous data shared. I'll keep you posted about it.

Finally, Alexandra mentioned in her email that "...it might also be worth changing the description [for this accession] slightly since we are no longer collecting samples on the R/V Walton Smith. It should say something like 'Water Quality Data from the South Florida Ecosystem Restoration Cruises along the Florida Reef Tract, Florida Bay and Western Florida Shelf.'" I can make that change to the title if that is the case, and update the abstract to note which years the R/V Walton Smith was used. Just let me know!
Reply: To contextualize you. We recorded the data on board the R/V Hogarth for November 2023, January 2024, and March 2024; but we got a green light to come back and use the R/V Walton Smith for the next expeditions. Then to avoid in a future this possible confusion seems a better idea to update the name of this dataset to: 'Water Quality Data from South Florida Ecosystem Restoration Cruises along the Florida Reef Track'.

Accession 0248362:

The updated file that you sent for this accession, "BB data (WIN Standards)_complete.xlsx", also differs in format from the data file currently archived in accession 0248362, so I wanted to ensure that this was what you meant to update it with! To me the format in the file you sent is slightly more confusing for a potential data user, compared to the previous format. However, let me know if this new format is the one you intend to move forward with. If so, we would need a data dictionary for it as well.
Reply: Agree with you, this new format is more confusing for potential data users. I shared this with you because we follow the regulations as a NELAC lab certified. However right now to avoid confusion with potential users and keep consistency, I'm sharing the file in the previous format. File name: AOML_BiscayneBay_WaterQualityData_FY23.CSV. I'll ask here at AOML if we want to update the files with this new NELAC format. In the affirmative case, I'll share it again with you, including a new data dictionary. So far, we can keep the same previous format.

You included the Site Descriptions as a sheet in the Excel file which is helpful, and I am curious if it differs from the "AOML_BiscayneBay_SiteDescriptions.csv" file currently archived. If so, we would need to remove that current file to ensure the accuracy of the metadata.
Reply: The site description document is right and matches the previous one I shared. But as I'm sharing a new file in the previous format, the site description document is correct and enough.

Finally, a note for both this accession and the previous one - while we accept Excel files with multiple sheets, separate CSV files are preferred if possible! An example would be in the 0248362 accession link above, where there are separate CSV files for the site descriptions, water quality data, and data dictionary/metadata.
Reply: All the files shared are in CSV format, I'll keep this in mind for the ones in the future.

Submitting new data via S2N:

To submit the first-year sampling data from the Coral Gables Water Wase (CGWW) project, you can follow the instructions in the attached PDF file "Archiving instructions using Send2NCEI.pdf" to submit via S2N: https://www.ncei.noaa.gov/archive/send2ncei/. Once submitted, we will evaluate it and once it is ready to go will assign it a NCEI accession number. If the document doesn't cover everything you need help with, please feel free to reach out to me for clarification!
Reply: I'll explore this document and come back to you when we get a full year of data.

Thanks for all your assistance with this submission.

Best regards.