Hive Impala Presto, comparison of functionality

√: Yes; ×: No; Blue: The main differences between impala and presto

hive 0.11 impala 1.1.1 presto 0.52
Implement: java c++ backend
java frontend
java
DataType:
integer
string
floating point
timestamp
uniontype × ×
DDL:
create/alter/drop table ×
create view ×
truncate table × ×
desc
create index × ×
DML:
load data ×
insert ×
explain
tablesample × ×
group by
order by
having
limit
inner/left/right/full join (no full join)
union
Sub Queries
With
Lateral View × ×
Function:
UDF × ×
Mathematical Functions √31 √30 √31
String Functions √37 √22 √15
Date and Time Functions √17 √18 √26
Regex
Type Conversion Functions ×
Conditional Functions
Aggregate Functions √19 only 5 √16
Windowing ×
Distinct
Url
Json ×

功能上,Presto和Impala有几个不同的地方,也有和Impala相同的一些缺陷:
1. Presto完全没有数据写入的功能,不能建表、建视图、导数据,只能做select查询操作。
2. Presto不支持Full Outer Join。
3. Presto和Impala一样,都不支持UDF。
4. Presto支持窗口函数,而Impala不支持,不过Impala的开发团队号称明年推出基础的rank over partition 等功能。
5. Presto支持较多的聚合函数,而Impala目前只支持5个非常基础的聚合函数:MAX(), MIN(), SUM(), AVG(), and COUNT()。


Data Warehouse For Ever原创文章,转载请注明出处

  1. 博主,请问一下DML中With是指什么操作? :?:

Leave a Comment


NOTE - You can use these HTML tags and attributes:
<a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>