
DEbrief
211 FOLLOWERS
News and discussions about data engineering.
DEbrief
6M ago
This episode was delayed due to ongiong situation in Ukraine. Thank for understanding.
Hot updates
Pulsar is updated
Apache Kylin "Extreme OLAP Engine for Big Data"
Three main versions: 2.4, 3.0 and most recent is 4.0.1 v4 released in the autumn 2021
Brings OLAP back to data
Been around since 2015, brought to you by eBAY
Not a friend to HBase, but likes parquet
Web Interface for all data steps
Official python client
with pandas support
Ambari is killed (put in the attic)
Apache Hop 1.1
https://www.leanwithdata.com/blog/2022/02/hop-1.1.0/
At January, 18th graduated from Incubator
Apache Ho ..read more
DEbrief
6M ago
Hot updates
dbt 1.0.0 released
dbt is gaining popularity
Great instrument which solves really existing problem
RedisJSON is out for public preview https://redis.com/blog/redisjson-public-preview-performance-benchmarking/
Need to have Redis 6.x or later probably a good point to talk once again that
RedisJSON* is faster than MongoDB and ElasticSearch on direct read, write, and update workloads.
available in Redis Cloud or you can always buuild it yourself
Basically a bunch of JSON commands for "native" json experience:
JSON.SET
JSON.GET
JSON.NUMINCRBY
Client libraries for Go/Node.js/Pyt ..read more
DEbrief
6M ago
A few hot updates
Apache Geode 1.12.5
enterprise edition is known as gemfire
geodistributed storage
has native clients in Java, C#, and C++ (!)
JTA compliant transaction support
Pinot released 0.9.0
Added Segment Merge and Rollup
Rollup is a technique for tree-like groupby example: city, streets, houses
General info about pinot
Made by guys from LinkedIn and Uber has zookeeper as deps
column-oriented database
It's an OLAP tool for real-time analytics
there are BI tools focused on dashboards and reports used by analists etc
this is more for data exploration for de / ds folks
Near real ..read more