Home > Sharing Large Data Files

Current News

Sharing Large Data Files

10/29/2003

J'e's back! While away, he had an aaaHHHAAA!!! moment, similar to his first browsing experience, and thinks he may just have spotted the "Internet2 Killer App." Read on to see why.

------------------------------------------

by J'e St. Sauver
University of Oregon Computing Center


This year's Fall Internet2 Member Meeting took place in Indianapolis, from October 12th to 17th. Besides being a nice opportunity to learn what's been going on in the Internet2 community (while also providing a chance to hash out issues with colleagues from other I2 schools face-to-face over a beer and a bowl of Cincinnati-style 3-, 4-, or 5-way chili), the I2 Member Meeting included the widely overlooked announcement of what may very well be the long-awaited "Internet2 killer app," a program from the University of Tennessee Knoxville Computer Science Department that g'es by the somewhat odd name of LoRS, part of the LoCI project.

Watching the LoRS demo at the Indy I2 meeting gave me the same sort of "aaaHHHAAA!!!" moment that I recall from when I first saw someone use an early version of Netscape to access a simple Web page: clearly, here was something that's going to profoundly change the way we do things online.

If you happen to have attended the LoRS session like I did, then you had had the chance to see an application that satisfies a fundamental need, much in the way that e-mail or the Web d'es. The need that LoRS satisfies is the need to be able to efficiently distribute large files, files that are too big to conveniently send by e-mail, files too big to conveniently download via the Web. You know the sort of files I'm talking about - large multi-gigabyte (or even multi-terabyte) experimental physics datasets, or CD-sized Linux ISOs, or those wonderful multi-hundred megabyte PowerPoint marketing presentations we all so love.

Yes, I know: people currently do move large (if not huge) files all the time via ftp, or chopped into digestible chunks via e-mail, or via Web pages.

Unfortunately, when folks move files using traditional tools, they don't tend to get very good network throughput, even over well-engineered, high-capacity, lightly loaded networks like Internet2's Abilene. (For example, the median throughput on Abilene for bulk file transfers is still less than 2.5Mbps. See Table 1 in the I2 weekly NetFlow report . One reason you're not seeing experimental physicists with fast Ethernet connections routinely saturating 100Mbps links is no mystery: it is simply a manifestation of our old friend, the TCP bandwidth delay product and its negative impact on untuned single-threaded network application throughput. (For a nice discussion of this, see: http://www.psc.edu/networking/perf_tune.html).

Serving large files from a single location, or even from a comparatively small set of distributed mirrors, also d'esn't scale very well. Ask anyone who hosts a Linux distribution mirror what they see when a Linux distributor kicks a new release out the door!



Recommended Reading
  • Web 2.0 :: Wednesday, October 8, 2008

    :::::: THE BUZZ

    : The Institutional Path For Change in This Age: Andragogy, not Pedagogy

    :::::: PRODUCTS AND APPS

    : College Students Find WiFi Essential to Education, Survey Reports
    : Digital Arts Alliance Adds Fordham U
    : Amazon To Host Microsoft Solutions in the Cloud
    : Online University Aims To Boost Rural Math and Science Teachers

  • News Update :: Tuesday, October 7, 2008

    :::::: NEWS

    : Coming to Terms with Cloud Computing
    : IBM Aims To Boost Mainframe Competency with Scholarship Program
    : Microsoft's 'Dublin' App Server Tied to .NET 4.0
    : Payment Card Security Toughens with DSS 1.2 Release
    : 6 Universities Join NASA Astrobiology Institute
    : Amazon To Host Microsoft Solutions in the Cloud
    : CRM Pushing into New Areas of Higher Ed
    : U Washington Aims To Streamline Data Access with Amalga
    : Silverlight 2 Release Candidate Available

  • IT Trends :: Thursday, October 2, 2008

    :::::: INTERVIEW

    :: CRM Pushing into New Areas of Higher Ed

    :::::: IT NEWS

    :: Integrated Collaborative Environment Leverages Web 2.0
    :: You Say You Want a Runtime Revolution?
    :: Visual Studio To Include jQuery Library
    :: Browser Makers Seek Clickjacking Fix
    :: China's Southeast University Upgrades Wireless LANs Across 6 Campuses
    :: Aruba To Increase Wireless Performance with ARM 2.0
    :: More Universities Sign with Hothand Wireless To Deliver Mobile Marketing

  • SmartClassroom :: Wednesday, October 1, 2008

    :::::: ELEARNING TIPS

    : Avoiding the 5 Most Common Mistakes in Using Blogs with Students

    :::::: NEWS and PRODUCT UPDATES

    : Stanford Testing iPhone Application Suite
    : North Seattle CC Adds Plato Online Algebra Course to Math Formula
    : Second Life Mashup Helps Boost Distance Ed Retention at Huntington JC
    : DePaul Weaves SS&C Tech Finance Material into Hybrid Graduate Course
    : Serena Acquisition Takes Aim at Microsoft Project

  • Web 2.0 :: Wednesday, October 1, 2008

    :::::: THE BUZZ

    : The Generative Nature of the Digital Economy and Its Challenge to Educators

    :::::: ELEARNING TIPS

    : Avoiding the 5 Most Common Mistakes in Using Blogs with Students

    :::::: PRODUCTS AND APPS

    : College Crime Gets Google Maps Mashup at UCrime.com
    : Second Life Mashup Helps Boost Distance Ed Retention at Huntington JC
    : UW-Stout Taps Echo360 Lecture Capture To Connect with Distance Students
    : Turnitin Integrates Plagiarism Tool into New Online Writing Service

  • News Update :: Tuesday, September 30, 2008

    :::::: NEWS

    : Second Life Mashup Helps Boost Distance Ed Retention at Huntington JC
    : Seton Hall Monitors Recruitment Dollars with Coremetrics
    : Universities Tackle Mainframes in IT Courses
    : Windows 7 Bits To Be Released at PDC'08
    : Serena Acquisition Takes Aim at Microsoft Project
    : United States Tops List of Sources for Botnet Attacks
    : Malicious Code Hidden in Rich Content Files Tough To Detect, According to Finjan Report
    : Purdue Team Driving SiCortex Low-power Supercomputer in 2008 Cluster Challenge
    : U Arizona To Optimize Wireless Networks on Campus