开源项目:(49)

    项目名称:Apache Falcon
    项目描述:apache|:|Apache Falcon is a data processing and management solution for Hadoop designed for data motion, coordination of data pipelines, lifecycle management, and data discovery. Falcon enables end consumers to quickly onboard their data and its associated...

    项目名称:Apache Hadoop
    项目描述:apache|:|Hadoop is a distributed computing platform. This includes the Hadoop Distributed Filesystem (HDFS) and an implementation of MapReduce.|,|openhub|:|Hadoop is a framework for running applications on large clusters of commodity hardware. The Hadoop f...

    项目名称:Apache Tajo
    项目描述:apache|:|The main goal of Apache Tajo project is to build an advanced open source data warehouse system in Hadoop for processing web-scale data sets. Basically, Tajo provides SQL standard as a query language. Tajo is designed for both interactive and batch...

    项目名称:Apache HBase
    项目描述:apache|:|Use Apache HBase software when you need random, realtime read/write access to your Big Data. This project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware. HBase is an open...

    项目名称:Nanocubes
    项目描述:oschina|:|   Nanocubes 是一个大数据可视化的工具,32Tb Twitter数据,在一台16GB内存的机器上流畅、交互式地可视化。 运行 Nanocubes 你需要一个支持 WebGL 的浏览器,目前在 Chrome 和 Firefox 上测试成功,但开发主要基于Chrome。 Brightkite: 4.5M checkins from the      BrightKite social network, from the SNAP repository (tablet-friend...