2012年7月15日星期日

How Will SQL 2012 Integrate With Hadoop?

Microsoft plans to integrate the "Big Data" capabilities of Hadoop into SQL 2012 by mid-2012. Microsoft hopes that Hadoop's capability to process information sets also large and unstructured for classic implies (such as SQL 2012) will give SQL 2012 the power to manage data of all sizes and sorts, no matter if stored on one particular database or across dozens of laptop or computer clusters. In short, Microsoft believes that integration will address the weaknesses of each database management systems when giving consumers a purpose to purchase SQL 2012.
SQL 2012 is a relational database management program (RDBMS) designed to use tables to shop and present information or to show relationships among information sets. It functions mainly as a means to mine homogeneous information on a small-to-medium scale, not as a means to analyze daunting amounts of facts spread out more than a large number of computers. SQL 2012's table presentation fails when applied to discordant, unrelated information. Even so, SQL's interface and implementation is typically viewed as far more user-friendly and much easier to learn than that of Hadoop's MapReduce. Specifically, Microsoft will use Hadoop Connectors for SQL Server along with the SQL Server Parallel Information Warehouse to permit information to become transferred from Hadoop into SQL. Customers will then windows 7 anytime upgrade key advantage from employing SQL's relatively easy tools to analyze "big information."

Hadoop has emerged as 1 with the preeminent platforms when coping with large-scale information evaluation. Capable of storing, analyzing, and presenting discordant information on a petabyte scale via Hadoop's MapReduce software program, the platform nonetheless contains drawbacks. MapReduce demands a moderate degree of Javascripting to use correctly, and its time-intensive queries have so far created it inadequate for small-scale data mining.

With open-source and qualified third-party assistance, it can be doable Microsoft will succeed in its vision of merging the strengths of each applications while eliminating their weaknesses. As it is in Microsoft's finest business interests to accommodate developer-level customers too as less-qualified users, a single can anticipate they are going to strive to provide a comparatively friendly interface and shallower understanding curve.

Microsoft has partnered with Hadoop community partners like Cloudera, Hortonworks, and open-source developers to address MapReduce's issues of speed and efficiency. The new Hadoop v0.23 and subsequent releases will enhance performance by reducing minimum MapReduce job latency and offer higher-level query interface performance.
Hadoop and SQL's integration will function more elements to secure SQL's functionality as a Company office mac 2011 product key Intelligence platform. Microsoft will include the an open database connectivity (ODBC) driver for Hive to let all Windows applications to run queries against the Hive data warehouse and an Excel Hive Add-in to permit customers to move data straight from Hive into PowerPivot or Excel.

没有评论:

发表评论