Apache CarbonData 2.0 Preview（关键特性提前预览）

来源: | 作者:华为云折扣网 | 发布时间: 2020-12-20 | 4249 次浏览 | 分享到:

CarbonData是一种高性能大数据存储方案，已在100+企业生产环境上部署应用，其中最大的单一集群数据规模达到几万亿。

, "price" -> "B.price", "state" -> "B.state").asInstanceOf[Map[Any, Any]] dwSelframe.merge(odsframe, col("A.id").equalTo(col("B.id"))).whenMatched( col("A.state") =!= col("B.state")).updateExpr(updateMap).execute()

Get more about usage:

https://github.com/apache/carbondata/blob/master/examples/spark/src/main/scala/org/apache/carbondata/examples/CDCExample.scala

7. Support Flink streaming write to CarbonData

Use Case: Carbonata needs to be integrated with fault-tolerant streaming dataflow engines like Apache Flink, where users can build a flink streaming job and use flink sink to write data to carbon through CarbonSDK. Flink sink will generate table stage files, data from stage files can be inserted to the carbon table by carbon Insert stage command, by making them visible for query.

Example:

spark.sql(""" CREATE TABLE test_flink (stringField string, intField int, shortField short) STORED AS carbondata """) // create flink streaming environment StreamExecutionEnvironment environment = StreamExecutionEnvironment.getExecutionEnvironment() environment.setParallelism(1) environment.enableCheckpointing(2000L) environment.setRestartStrategy(RestartStrategies.noRestart()) DataStreamSource<OUT> stream = environment.addSource(////DataSource like Kafka/////) // create carbon sdk writer factory with LOCAL/S3/OBS builder CarbonWriterFactory factory = CarbonWriterFactory.builder("Local").build(dbName,tableName,tablePath, tableProperties,writerProperties,carbonProperties) // create stream sink and add it to stream StreamingFileSink<IN> streamSink = StreamingFileSink.forBulkFormat(new Path(ProxyFileSystem.DEFAULT_URI), factory).build() stream.addSink(streamSink) // execute flink streaming job which generate’s stage files environment.execute()

Get more about usage:

https://github.com/apache/carbondata/blob/master/examples/flink/src/main/scala/org/apache/carbondata/examples/FlinkExample.scala

8: Add segment

« 上一页 2 345 6…7 下一页 » 查看全文 »

上一篇：基于Docker和D......

下一篇：全球最大CDN服务商......

客服微信

备案号：浙ICP备19010705号-2