-
1.
1.1
1.1.1
1.1.2
1.1.3
1.2
2.
2.1
2.2
2.3
2.3.1
2.3.3
2.3.4
2.3.5
2.3.6
3.
3.1
3.1.1
3.1.2
3.1.3
3.1.4
3.1.5
3.1.6
3.1.6.1
3.1.6.2
3.1.6.3
3.1.6.4
3.2
3.2.1
3.2.2
3.2.3
3.2.4
3.2.4.1
3.2.4.2
3.2.4.3
3.2.4.4
3.2.4.5
3.2.4.6
3.2.4.7
3.2.5
3.2.6
3.2.7
3.2.8
3.2.9
3.2.10
3.2.11
3.2.11.1 Shell
3.2.11.2
3.2.11.3 Hive
3.2.11.4 SparkSQL
3.2.11.5 Python
3.2.11.6 PySpark
3.2.11.7 Spark
3.2.11.8 Presto
3.2.11.9 FlinkSQL
3.2.11.10 Flink
3.3
3.3.1
3.3.2
3.3.2.1
3.3.2.2
3.3.2.3
3.3.2.4
3.3.3
3.3.4
3.3.4.1
3.3.4.2
3.3.4.3
3.3.5
3.3.5.1
3.3.5.2
3.3.5.3
3.3.5.4
3.3.5.5
3.3.5.6
3.3.6
4.
4.1
4.2
4.3
4.3.1 (DIM)
4.3.2 (ODS)
4.3.3 (DWD)
4.3.4 (TDM)
4.3.5 (ADM)
4.3.6
4.4
4.5
4.6
4.6.1
4.6.2
4.6.3
5. (FAQ)
5.1
5.2
5.3
5.4 SQL
5.5
5.6
5.7
5.8
6.
: 2018-06-19
1.
1.1
HadoopSparkFlinkPresto
PB“"
1.1.1
1.1.2
1.1.3
1.2
(Project)
estatefinance
: shuxi_demo
HiveYarn
Hadoop
shuxi_demo(shuxi_demo_dev)
(shuxi_demo_prd)
(Flow)
(DAG)
(Task)ShellHiveSparkPrestoFlink11
(Resource)
: jartxtpython
(Function)HiveSparkPrestoFlink
Hive(User Defined FunctionUDF)
HiveSparkSQL
(Instance)
(Waiting)(Running)(Finished)
AID: T_630_20180301115903046_1
T_630_20180301120009801_1
2.
quick_start
Hive
2.1
1. (www.dtwave.com)
2.
shuxi_demo
3.
2-1-1
4.
2-1-2
1
2.2
1
2.3
2-3-1
2.3.1
2-3-2
: ID","
student_info.txtstudent_info.txt
1,,23,50
2,,25,60
3,,22,55
4,,21,50
5,,22,56
6,,23,51
2.3.3
1.
:
quick_start2-3-4
2.
2-3-4
quick_start student_info
txtstudent_info.txt
2-3-5:
2.3.4
1.
2-3-5
“+”
quick_start