Data mining middleware for wide-area high-performance networks

In this paper, we describe two distributed, data intensive applications that were demonstrated at iGrid 2005 (iGrid Demonstration US109 and iGrid Demonstration US121). One involves transporting astronomical data from the Sloan Digital Sky Survey (SDSS) and the other involves computing histograms fro...

Full description

Saved in:
Bibliographic Details
Published in:Future generation computer systems 2006-10, Vol.22 (8), p.940-948
Main Authors: Grossman, Robert L., Gu, Yunhong, Hanley, David, Sabala, Michal, Mambretti, Joe, Szalay, Alex, Thakar, Ani, Kumazoe, Kazumi, Yuji, Oie, Lee, Minsun, Kwon, Yoonjoo, Seok, Woojin
Format: Article
Language:eng
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In this paper, we describe two distributed, data intensive applications that were demonstrated at iGrid 2005 (iGrid Demonstration US109 and iGrid Demonstration US121). One involves transporting astronomical data from the Sloan Digital Sky Survey (SDSS) and the other involves computing histograms from multiple high-volume data streams. Both rely on newly developed data transport and data mining middleware. Specifically, we describe a new version of the UDT network protocol called Composible-UDT, a file transfer utility based upon UDT called UDT-Gateway, and an application for building histograms on high-volume data flows called BESH (for Best Effort Streaming Histogram). For both demonstrations, we include a summary of the experimental studies performed at iGrid 2005.
ISSN:0167-739X
1872-7115