000764435 000__ 04348cam\a2200469Mi\4500 000764435 001__ 764435 000764435 005__ 20230306142416.0 000764435 006__ m\\\\\\\\d\\\\\\\\ 000764435 007__ cr\un\nnnunnun 000764435 008__ 161122s2016\\\\xxu\\\\\o\\\\\000\0\eng\d 000764435 019__ $$a963932242$$a966183365$$a966592859 000764435 020__ $$a9781484222478$$q(electronic book) 000764435 020__ $$a1484222474$$q(electronic book) 000764435 020__ $$z1484222466 000764435 020__ $$z9781484222461 000764435 035__ $$aSP(OCoLC)ocn964359292 000764435 035__ $$aSP(OCoLC)964359292$$z(OCoLC)963932242$$z(OCoLC)966183365$$z(OCoLC)966592859 000764435 040__ $$aIDEBK$$beng$$erda$$cIDEBK$$dN$T$$dEBLCP$$dGW5XE$$dYDX 000764435 049__ $$aISEA 000764435 050_4 $$aQA76.73.S67 000764435 050_4 $$aQA75.5-76.95 000764435 08204 $$a005.75/6$$223 000764435 08204 $$a004 000764435 1001_ $$aPal, Sumit. 000764435 24510 $$aSQL on big data :$$btechnology, architecture, and innovation /$$cSumit Pal. 000764435 264_1 $$a[United States] :$$bApress,$$c2016. 000764435 300__ $$a1 online resource. 000764435 336__ $$atext$$btxt$$2rdacontent 000764435 337__ $$acomputer$$bc$$2rdamedia 000764435 338__ $$aonline resource$$bcr$$2rdacarrier 000764435 5050_ $$aAt a Glance; Contents; About the Author; About the Technical Reviewer; Acknowledgments; Introduction; Chapter 1: Why SQL on Big Data?; Why SQL on Big Data?; Why RDBMS Cannot Scale; SQL-on-Big-Data Goals; SQL-on-Big-Data Landscape; Open Source Tools; Apache Drill; Apache Phoenix; Apache Presto; BlinkDB; Impala; Hadapt; Hive; Kylin; Tajo; Spark SQL; Spark SQL with Tachyon; Splice Machine; Trafodion; Commercial Tools; Actian Vector; AtScale; Citus; Greenplum; HAWQ; JethroData; SQLstream; VoltDB; Appliances and Analytic DB Engines; IBM BLU; Microsoft PolyBase; Netezza; Oracle Exadata 000764435 5058_ $$aTeradataVertica; How to Choose an SQL-on-Big-Data Solution; Summary; Chapter 2: SQL-on-Big-Data Challenges & Solutions; Types of SQL; Query Workloads; Types of Data: Structured, Semi-Structured, and Unstructured; Semi-Structured Data; Unstructured Data; How to Implement SQL Engines on Big Data; SQL Engines on Traditional Databases; How an SQL Engine Works in an Analytic Database; Why Is DML Difficult on HDFS?; Challenges to Doing Low-Latency SQL on Big Data; Approaches to Solving SQL on Big Data; Approaches to Reduce Latency on SQL Queries; File Formats; Text/CSV Files; JSON Records 000764435 5058_ $$aAvro FormatSequence Files; RC Files; ORC Files; Parquet Files; How to Choose a File Format?; Data Compression; Indexing, Partitioning, and Bucketing; Why Indexing Is Difficult; Partitioning; Advantages; Limitations; Bucketing; Recommendations; Summary; Chapter 3: Batch SQL-Architecture; Hive; Hive Architecture Deep Dive; How Hive Translates SQL into MR; Hive Query Compiler; Analytic Functions in Hive; Common Real-Life Use Cases of Analytic Functions; TopN; Clickstream Sessionization; Grouping Sets, Cube, and Rollup; ACID Support in Hive; Serialization and SerDe in Hive 000764435 5058_ $$aPerformance Improvements in HiveOptimization by Using a Broadcast Join; Pipelining the Data for Joins; Dynamically Partitioned Joins; Vectorization of Queries; Use of LLAP with Tez; CBO Optimizers; Join Order; Bushy Trees; Table Sizing; Recommendations to Speed Up Hive; Upcoming Features in Hive; Summary; Chapter 4: Interactive SQL-Architecture; Why Is Interactive SQL So Important?; SQL Engines for Interactive Workloads; Spark; Spark Stack; Spark Architecture; Spark SQL; Spark SQL Architecture; Spark SQL Optimization-Catalyst Optimizer; Spark SQL with Tachyon (Alluxio) 000764435 5058_ $$aAnalytic Query Support in Spark SQLGeneral Architecture Pattern; Impala; Impala Architecture; Impala Optimizations; HDFS Caching; File Format Selection; Recommendations to Make Impala Queries Faster; Code Generation; SQL Enhancements and Impala Shortcomings; Apache Drill; Apache Drill Architecture; Key Features; Query Execution; Vertica; Vertica with Hadoop; Hadoop MapReduce Connector; Vertica Hadoop Connector for HDFS; Jethro Data; Others; MPP vs. Batch-Comparisons; Capabilities and Characteristics to Look for in the SQL Engine; Technical Decisions; Soft Decisions; Summary 000764435 506__ $$aAccess limited to authorized users. 000764435 650_0 $$aSQL (Computer program language) 000764435 650_0 $$aBig data. 000764435 852__ $$bebk 000764435 85640 $$3SpringerLink$$uhttps://univsouthin.idm.oclc.org/login?url=http://link.springer.com/10.1007/978-1-4842-2247-8$$zOnline Access$$91397441.1 000764435 909CO $$ooai:library.usi.edu:764435$$pGLOBAL_SET 000764435 980__ $$aEBOOK 000764435 980__ $$aBIB 000764435 982__ $$aEbook 000764435 983__ $$aOnline 000764435 994__ $$a92$$bISE