An informatics seminar was given by Roy Mendelssohn, Env Research Division, SWFisheries Center, Pacific Grove on 18August05 at the NOAA Southwest Fisheries Science Center conference room titled “Data Integration and Interoperability in PACOOS, in NOAA, in IOOS and Beyond”. From the perspective of a domain scientist engaged with data analysis work and participating in ongoing national data committees, he provided insights drawing on recent experience putting together a working data system model in a short period of time.

Some informatics extensibility todo’s
-make rdb spatially enabled
-make metadata fgdc compliant
-make semantic & syntactic metadata conform to a given std
-take steps now to ease implementation/federation
-participate so to influence final requirements

Some unresolved issues
-how to stop people from misusing data
-how to balance serving products versus data
-how to balance data release versus data quality
-where do heavy lifting so to build in flexibility: local system or transport layer

Some notes and notions
-renaming: NMF->NOAA Fisheries->National Fisheries Service -> One NOAA
-DB systems have data and metadata separated while netcdf/hsd are self-describing file formats that have them reside together.
-data redundency as strategy for serving data in different ways
-dimension information; HDF not allow you to share dimensions in viewing cruise data but must choose view, ie all data at one station or look across stations

Three taking-back steps to avoid
-focusing on maps (describing) rather than data (analyzing)
-focusing on 2D (GIS) rather than 4D (netcdf space and time)
-using data strctures that have no scientific meaning (ie polygon or vector in GIS where no one collects a polygon)

Interoperability involving 3 interrelated issues:
-have the DATA
-describe the data with METADATA
-produce something that works DATA TRANSPORT

Since categories matter, note that DMAC has 6 categories of expert teams
http://dmac.ocean.us/dacsc/about_steering.jsp

  • standards process
  • archive
  • sys eng/enterprise architecture
  • modeling
  • metadata & discovery
  • data transport and access