SQL layers on NoSQL databases

What are the SQL layer solution over NoSQL databases such as key/value stores?

Phoenix: A SQL layer on HBase:

They also show some performance results:

https://github.com/forcedotcom/phoenix/wiki/Performance

F1 – The Fault-Tolerant Distributed RDBMS Supporting Google’s Ad Business:

http://research.google.com/pubs/pub38125.html

With F1, we have built a novel hybrid system that combines the
scalability, fault tolerance, transparent sharding, and cost beneﬁts
so far available only in “NoSQL” systems with the usability,
familiarity, and transactional guarantees expected from an RDBMS.

Tenzing A SQL Implementation On The MapReduce Framework:

http://research.google.com/pubs/pub37200.html

Tenzing is a query engine built on top of MapReduce for ad hoc
analysis of Google data. Tenzing supports a mostly complete SQL
implementation (with several extensions) combined with several key
characteristics such as heterogeneity, high performance, scalability,
reliability, metadata awareness, low latency, support for columnar
storage and structured data, and easy extensibility. Tenzing is
currently used internally at Google by 1000+ employees and serves
10000+ queries per day over 1.5 petabytes of compressed data. In this
paper, we describe the architecture and implementation of Tenzing, and
present benchmarks of typical analytical queries.

HAWQ from EMC:

http://www.emc.com/about/news/press/2013/20130225-04.htm

HAWQ (pronounced hawk) represents the EMC Greenplum engineering effort
that brings 10 years of large-scale data management research and
development to the Apache Hadoop framework. Leveraging the feature
richness and maturity of the industry leading Greenplum MPP analytical
database, this innovation has resulted in the world’s first true SQL
parallel database on top of the Hadoop Distributed File System (HDFS).

http://www.theregister.co.uk/2013/02/25/emc_pivotal_hd_hadoop_hawq_database/

Project Hawq, the SQL database layer that rides atop of HDFS rather
than trying to replace it with a NoSQL data store

Apache Hive: http://hive.apache.org/

It defines a SQL-like language called HiveQL.

Stinger Initiative: Making Apache Hive 100 Times Faster: http://hortonworks.com/blog/100x-faster-hive/

Cloudera Impala

http://blog.cloudera.com/blog/2012/10/cloudera-impala-real-time-queries-in-apache-hadoop-for-real/

Source code:

https://github.com/cloudera/impala

it uses the same metadata, SQL syntax (Hive SQL), ODBC driver and user
interface (Hue Beeswax) as Apache Hive, providing a familiar and
unified platform for batch-oriented or real-time queries.

Spire:

Home: https://drawntoscalehq.com/

Spire is the first SQL database for large, user-facing applications
built on Hadoop. Spire is built to power large-scale websites, mobile
apps, and machine-to-machine data.

Unlike any other Hadoop and SQL solution, Spire scales to tens of
thousands of reads and writes per second, with full ANSI SQL and
intuitive management tools.

Architecturally similar to Google F1, Spire makes it simple to build
applications for the Big Data Era.

Hadapt: http://hadapt.com/

Hadapt unifies SQL and Hadoop, enabling customers to analyze all of their data (structured, unstructured, and multi-structured) in a single platform – no connectors, complexities, or rigid structure.

How to use encfs on Windows 10?

ByQ A Mar 24, 2018Nov 22, 2019

I am happy using encfs on Linux. But how to use encfs on Windows 10? I would suggest EncFS MP. It support Encfs on Windows. Features of EncFSMP: Mounts EncFS folders on Windows and OS X Can create, edit, export and change the password of EncFS folders Is 100% compatible with EncFS 1.7.4 on Linux…

How to print the name of the current file being edited in Emacs?

ByEric Ma Mar 24, 2018Mar 24, 2018

In Emacs, how to print the name of the current file that I am editing? The built-in function buffer-file-name gives the full path of your file. To get the file name: M-: buffer-file-name Read more: How to merge a commit from another branch to my current branch in git? Linux Kernel: xt_quota: report initial quota…

How to play .swf files on Linux?

ByEric Ma Mar 24, 2018Mar 24, 2018

How to play the flash (.swf) files downloaded from the Web on Linux? The best solution that I find to play .swf files on Linux is run Adobe Flash Player Projector (download from here, a single .EXE file) and run it under wine. Read more: How to Get Rid of DTS/AC3 Audio using ffmpeg on…

QA | Tutorial

How to enable user themes in Ubuntu 18.04?

ByQ A Sep 14, 2018Nov 22, 2019

The way for Ubuntu 17 to installing the gnome-shell-extensions package does not work any more for Ubuntu 18.04. How to enable user themes in Ubuntu 18.04? The updated gnome-shell-extensions package actually adds the User Theme extension back. You can use that. First, install the package sudo apt install gnome-shell-extensions Second, log out and login again…

Redis Architecture, consistency model, etc.

ByQ A Mar 24, 2018Jun 26, 2018

Technical discussions on Redis. Redis internal documentation: http://redis.io/topics/internals Redis manifesto, the philosophy behind Redis: http://oldblog.antirez.com/post/redis-manifesto.html Redis Architecture: Overview Of Redis Architecture Redis data model and eventual consistency: http://antirez.com/news/36 Read more: Linear Consistency Model for Computer Systems What are the differences between NUMA architecture and SMP architecture? Linux boots failed with “sulogin: can not open password…

How to convert .pptx slides to .jpg or .png images on Linux in command line?

ByEric Ma Mar 24, 2018Mar 24, 2018

How to convert .pptx slides to .jpg or .png images on Linux in command line? This following method works best for me. First, convert .pptx file to .pdf using libreoffice: libreoffice –headless –convert-to pdf file.pptx –headless makes libreoffice run in batch mode and not start the GUI. The pdf file will be named file.pdf by…

Similar Posts

Leave a Reply Cancel reply