src/www/templates/index.html
{% extends "base.html" %}
{% block content %}
{% load staticfiles %}
<main class="bs-docs-masthead" id="content" role="main">
<div class="container">
<p class="lead">
<span class="logo-home center-block"></span>
<span class="logo-text"><span class="logo-first-half">Data</span>Hub</span>
<br>
A Data Ecosystem for Individuals, Teams and People
</p>
<!-- <p class="lead">
<span class="logo-text"></span>
<span><span class="logo-home center-block"></span>
<span class="logo-text" style="top:50%; right:50%"><span style="color:#00aeef">Data</span><span>Hub</span></span></span>
<br />
A Data Ecosystem for Individuals, Teams and People
</p> -->
</div>
</main>
<br />
<div class="container bs-docs-container">
<div class="row">
<div class="col-md-9" role="main">
<!-- Overview
================================================== -->
<div class="bs-docs-section">
<h1 id="overview" class="page-header">What is DataHub?</h1>
<h3><small>For End Users:</small></h3>
<ul>
<li>A way to store your data centrally, without having to set up your own database</li>
<li>A way to collaborate with others</li>
<li>A way to seamlessly share your data with friends and colleagues</li>
<li>A suite of tools to process your data</li>
</ul>
<h3><small>For Developers:</small></h3>
<ul>
<li>A database-agnostic, language-agnostic, <a href="https://github.com/mitreid-connect/">MITREid (OpenID + OAuth2)</a> integrated platform for your mobile and web apps</li>
<li>A web client for easy manipulation of your data</li>
<li>An application ecosystem for data processing, including ingestion, curation, integration, discovery, query, analytics, visualization, and machine learning</li>
<li>A restful API</li>
<li>An MIT licensed open source project from <a href="http://csail.mit.edu">MIT CSAIL's</a> <a href="http://livinglab.mit.edu">Living Lab</a></li>
</ul>
</div>
<!-- Reading Materials
================================================== -->
<div class="bs-docs-section">
<h1 id="all-publications" class="page-header">Publications</h1>
<h3 id="papers"><small>Papers</small></h3>
<ul>
<li>
<a href="https://dl.acm.org/citation.cfm?id=2947619">Decibel: the relational dataset branching system</a> [PVLDB, 2016]
</li>
<li>
<a href="https://dl.acm.org/citation.cfm?id=2814584">Towards a Unified Query Language For Provenance and Versioning</a> [PUSENIX, 2015]
</li>
<li>
<a href="https://dl.acm.org/citation.cfm?id=2824035">Principles of dataset versioning: exploring the recreation/storage tradeoff</a> [PVLDB, 2015]
</li>
<li>
<a href="www-cs-students.stanford.edu/~adityagp/papers/datahubdemo.pdf">Collaborative Data Analytics with DataHub</a> [PVLDB, 2015]
</li>
<!--
<li>
<a href="{% static "www/papers/datahub-generic.pdf" %}">DataHub: A Collaborative Dataset Management Platform</a> [Technical Report, 2015]
</li>
-->
<li>
<a href="http://db.csail.mit.edu/pubs/datahubcidr.pdf">
DataHub: Collaborative Data Science & Dataset Version Management at Scale
</a>
[Paper, CIDR 2015]
</li>
<!-- <li>
<a href="http://people.csail.mit.edu/anantb/public/docs/research/datahub/datahub-nedb.pdf">DataHub: A hosted platform for organizing, managing, sharing, collaborating, and processing data
</a>
[Technical Report]
</li> -->
</ul>
<h3 id="talks"><small>Talks</small></h3>
<ul>
<li>
<a href="http://research.microsoft.com/apps/video/default.aspx?id=238430">
DataHub: A hosted platform for organizing, managing, sharing, collaborating, and processing data
</a>
[Talk by Anant Bhardwaj, Microsoft Research, 2015] [<a href="http://research.microsoft.com/apps/video/default.aspx?id=238430">Video</a> | <a href="http://nwds.cs.washington.edu/files/nwds/pdf/anantb-datahub-talk.pdf">Slides</a>]
</li>
<li>
<a href="http://db.cs.washington.edu/nwds/past_talks.html#anant_bhardwaj_01_19_15">
DataHub: A hosted platform for organizing, managing, sharing, collaborating, and processing data
</a>
[Talk by Anant Bhardwaj, University of Washington, 2015] [<a href="http://db.cs.washington.edu/nwds/past_talks.html#anant_bhardwaj_01_19_15">Abstract</a> | <a href="http://nwds.cs.washington.edu/files/nwds/pdf/anantb-datahub-talk.pdf">Slides</a>]
</li>
<li>
<a href="http://www.cse.iitb.ac.in/page20?talkdetails=836">
DataHub: A hosted platform for organizing, managing, sharing, collaborating, and processing data
</a>
[Talk by Anant Bhardwaj, IIT Bombay, 2015] [<a href="http://www.cse.iitb.ac.in/page20?talkdetails=836">Abstract</a> | <a href="http://nwds.cs.washington.edu/files/nwds/pdf/anantb-datahub-talk.pdf">Slides</a>]
</li>
<li>
<a href="http://www.cidrdb.org/cidr2015/Slides/18_CIDR15_Slides_Paper18.pdf">
DataHub: Collaborative Data Science & Dataset Version Management at Scale
</a>
[Talk by Aditya Parmeshwaran, CIDR 2015] [<a href="http://www.cidrdb.org/cidr2015/Slides/18_CIDR15_Slides_Paper18.pdf">Slides</a>]
</li>
<li>
<a href="http://www.gbcacm.org/seminars/evening/2014/datahub-collaborative-data-analytics-and-visualization-platform.html">
DataHub: A Collaborative Data Analytics and Visualization Platform
</a>
[Talk by Sam Madden, Greater Boston Chapter of the ACM, 2014]
</li>
<li>
<a href="http://www.vldb.org/2013/keynotes.html">
The DataHub: A Collaborative Data Analytics and Visualization Platform
</a>
[Keynote by Sam Madden, VLDB 2013]
</li>
</ul>
<h3 id="blogs"><small>Blogs</small></h3>
<ul>
<li>
<a href="https://www.cs.umd.edu/~amol/DBGroup/2015/06/26/datahub.html">Why Git and SVN Fail at Managing Dataset Versions</a>
</li>
<li>
<a href="http://istc-bigdata.org/index.php/beyond-data-lakes-the-datahub/">
Beyond Data Lakes: The DataHub
</a>
[ISTC Blog, Oct 2014]
</li>
<li>
<a href="http://istc-bigdata.org/index.php/datahub-a-hosted-data-platform-for-large-scale-analytics-in-process-of-being-deployed-at-mit/">
DataHub, A Hosted Data Platform for Large-Scale Analytics, in Process of Being Deployed at MIT
</a>
[ISTC Blog, Oct 2013]
</li>
</ul>
</div>
<!-- Team
================================================== -->
<div class="bs-docs-section">
<h1 id="team" class="page-header">Team</h1>
<p>DataHub is hosted at MIT Computer Science & Artificial Intelligence Lab (CSAIL) with collaborators from the University of Maryland - College Park, and the University of Illinois at Urbana-Champaign.</p>
<h3><small>Contributors:</small></h3>
<dl class="dl-horizontal">
<dt><a class="team-member" href="http://people.csail.mit.edu/anantb/"> Anant Bhardwaj</a></dt>
<dd>PhD Student, MIT CSAIL</dd>
<dt><a class="team-member" href="http://people.cs.uchicago.edu/~aelmore/"> Aaron Elmore</a></dt>
<dd>Asst. Professor, University of Chicago</dd>
<dt><a class="team-member" href="http://db.lcs.mit.edu/madden/"> Sam Madden</a></dt>
<dd>Professor, MIT CSAIL</dd>
<dt><a class="team-member" href="http://people.csail.mit.edu/karger/"> David Karger</a></dt>
<dd>Professor, MIT CSAIL</dd>
<dt><a class="team-member" href="http://web.engr.illinois.edu/~adityagp/">Aditya Parameswaran</a></dt>
<dd>Asst. Professor, UIUC</dd>
<dt><a class="team-member" href="http://www.cs.umd.edu/~amol/">Amol Deshpande</a></dt>
<dd>Assoc. Professor, UMD</dd>
<dt><a class="team-member" href="#"> Michael Maddox</a></dt>
<dd>PhD Student, MIT CSAIL</dd>
<dt><a class="team-member" href="#"> David Goehring</a></dt>
<dd>PhD Student, MIT CSAIL</dd>
<dt><a class="team-member" href="https://www.cs.umd.edu/~amitc/"> Amit Chavan</a></dt>
<dd>PhD Student, UMD</dd>
<dt><a class="team-member" href="https://www.cs.umd.edu/~bsouvik/">Souvik Bhattacherjee</a></dt>
<dd> PhD Student, UMD</dd>
<dt><a class="team-member" href="#">Silu Huang</a></dt>
<dd>PhD Student, UIUC</dd>
<dt><a class="team-member" mailto="sbuckley@mit.edu"> Stephen C. Buckley</a></dt>
<dd>Executive Director, MIT Big Data Initiative</dd>
<dt><a class="team-member" href="https://github.com/justinanderson"> Justin Anderson</a></dt>
<dd>Programmer, MIT Big Data Initiative</dd>
<dt><a class="team-member" href="https://github.com/RogerTangos"> Albert Carter</a></dt>
<dd>Programmer, MIT Big Data Initiative</dd>
<dt><a class="team-member" href="https://github.com/DNSServer"> Denis Babani</a></dt>
<dd>Volunteer Programmer, MIT Big Data Initiative</dd>
</dl>
<h3><small>Alumni:</small></h3>
<dl class="dl-horizontal">
<dt><a class="team-member" href="http://bigdata.csail.mit.edu/user/9">Elizabeth Bruce</a></dt>
<dd>Former Executive Director, MIT Big Data Initiative</dd>
<dt><a class="team-member" href="http://www.mit.edu/~eugenewu/">Eugene Wu</a></dt>
<dd>Asst. Professor, Columbia University</dd>
</dl>
</div>
<div class="bs-docs-section">
<h1 id="resources" class="page-header">Resources</h1>
<dl class="dl-horizontal">
<dt><a href="{% static 'docs/html/index.html' %}">Documentation</a></dt>
<dd>API documentation</dd>
<dt><a href="https://github.com/datahuborg/datahub/tree/master/src/examples">Example Code</a></dt>
<dd>sample code in various programming languages including C++, Java, Go, Python, and JavaScript</dd>
<dt><a href="https://github.com/datahuborg/datahub">GitHub Repo</a></dt>
<dd>the DataHub source code repository on GitHub</dd>
</dl>
</div>
<div class="bs-docs-section">
<h1 id="acknowledgements" class="page-header">Acknowledgements</h1>
<p>This research is funded by NSF under grants 1513972, 1513407, 1513443, by British Telecom, by EMC, and by Intel Science and Technology Center for Big Data</p>
</div>
<!-- Contact -->
<div class="bs-docs-section">
<h1 id="contact" class="page-header">Contact</h1>
<p>Please email <a href="mailto:sbuckley@mit.edu">sbuckley@mit.edu</a> or subscribe to our <a href="https://mailman.mit.edu/mailman/listinfo/datahub">discussion list</a>.</p>
</div>
<!-- /Contact -->
</div>
<div class="col-md-3">
<div class="bs-docs-sidebar hidden-print hidden-xs hidden-sm" role="complementary">
<ul class="nav bs-docs-sidenav">
<li><a href="#overview">What is DataHub?</a></li>
<li><a href="#papers">Papers</a></li>
<li><a href="#talks">Talks</a></li>
<li><a href="#blogs">Blogs</a></li>
<li><a href="#team">Team</a></li>
<li><a href="#resources">Resources</a></li>
<li><a href="#acknowledgements">Acknowledgements</a></li>
<li><a href="#contact">Contact</a></li>
</ul>
</div>
</div>
</div>
</div>
{% endblock %}