Tue 28 Mar 2006
O’Reilly’s Ambient Findability (Peter Morville) provided a broad contemporary overview of data and information issues. Here are three recent publications relevant to today’s data environments:
-A Nature journal issue with Microsoft on 2020 Vision
http://research.microsoft.com/towards2020science/nature.htm;
http://www.nature.com/nature/journal/v440/n7083/index.html
-The term “dataspace” creates a wider-than-one-system conceptual umbrella for collections
MFranklin,. AHalevy, and DMaier, 2005.
From Databases to Dataspaces: A New Abstraction for Information Management,
ACM SIGMOD Record;
http://portal.acm.org/citation.cfm?id=1107499.1107502
-A look at the handling large amounts of streamed data - historic and real-time
Schandrasekaran and MFranklin. 2004.
Remembrance of Streams Past: Overload-Sensitive Management of Archived Streams.
Proceedings of the 30th VLDB Conference, Toronto, Canada
http://www.cs.berkeley.edu/~franklin/Papers/ChandrasekaranVLDB2004.pdf
One Response to “Data environment reads”
Leave a Reply
You must be logged in to post a comment.


April 5th, 2006 at 3:28 pm
The dataspaces paper was an exciting validation of the way it looks like our data management system will have to be set up, unfortunately, I’m left feeling like it created more questions than it answered. First and foremost would be the question of how. How do we, or how have others, set up the broad spectrum querying tools, or implement an index of multiple database tables and flat files? I think that what I really wanted from this was a real world example. Not a cursory overview of a potentially applicable example, but a well documented case study in the sciences. Short of that, I’m thinking that it might be beneficial to start looking into technologies like spotlight and google desktop search for insight into the methods they’ve used for indexing and searching heterogeneous data models.
Any ideas on websites or papers that might further develop the ideas set forth here?