Reddit - Big Data
0 FOLLOWERS
Everything big data from storage to predictive analytics.
Reddit - Big Data
36m ago
Are you leveraging open source SQL databases in your projects?
Check out the article here to see the options out there: https://www.datacoves.com/post/open-source-databases
Why consider Open Source SQL Databases? ?
Cost-Effectiveness: Dramatically reduce your system's total cost of ownership.
Flexibility and Customization: Tailor database software to meet your specific requirements.
Robust Community Support: Benefit from rapid updates and a wealth of community-driven enhancements.
Share your experiences or ask questions about integrating these technologies into your tech stack.
submitted b ..read more
Reddit - Big Data
1d ago
Hi,
I'm studying a bit on big data systems.
I've bounced into this article, from 2019, which explains WAL is a broken strategy and actually inefficient - Written by VictoriaMetrics founder. In short: He says: Flush every second in SSTable format (of your choice), and do the background compaction to slowly build it up to descent size block. He says there are two systems out there using this strategy: VM and ClickHouse.
Would love to hear some expert Big Data take on this.
submitted by /u/amesika
[visit reddit] [comments ..read more
Reddit - Big Data
1d ago
As data scientists and data analysts delve into the intricate world of data, they often encounter a common challenge: filling over gaps. The identified information can be lost due to several reasons, for instance human error, breakdown of sensors as well as lack of collection of data. Getting the missing values problem right is critical because if they are not handled correctly, they can be very detrimental to the functioning of machine learning models and statistical estimation. Click here to read more >>
submitted by /u/taylor-mark
[visit reddit] [comments ..read more
Reddit - Big Data
1d ago
Want to easily share BigQuery insights with your external clients, partners, or vendors?
If complex BI tools or clunky CSV exports are your current solutions, it’s time for an upgrade! Softr now integrates with BigQuery, allowing you to easily connect to your BigQuery database to create dedicated dashboards and reports— without coding or complex analytics tools.
Here’s what you can do:
Data portals: Create intuitive, customized dashboards directly within Softr. No need for third parties and non-technical team members to master complex analytics software.
Secure access control: Fine-tune pe ..read more
Reddit - Big Data
2d ago
As data scientists and data analysts delve into the intricate world of data, they often encounter a common challenge: filling over gaps. The identified information can be lost due to several reasons, for instance human error, breakdown of sensors as well as lack of collection of data. Getting the missing values problem right is critical because if they are not handled correctly, they can be very detrimental to the functioning of machine learning models and statistical estimation. This article covers some data scientists skills and methodologies that are a must for effectively managing missing ..read more
Reddit - Big Data
2d ago
Hi Guys,
I hope you are well.
Free tutorial on Bigdata Hadoop and Spark Analytics Projects (End to End) in Apache Spark, Bigdata, Hadoop, Hive, Apache Pig, and Scala with Code and Explanation.
Apache Spark Analytics Projects:
Vehicle Sales Report – Data Analysis in Apache Spark
Video Game Sales Data Analysis in Apache Spark
Slack Data Analysis in Apache Spark
Healthcare Analytics for Beginners
Marketing Analytics for Beginners
Sentiment Analysis on Demonetization in India using Apache Spark
Analytics on India census using Apache Spark
Bidding Auction Data Analytics in Apache ..read more
Reddit - Big Data
2d ago
ClickHouse Performance Master Class – Tools and Techniques to Speed up any ClickHouse App
We’ll discuss tools to evaluate performance including ClickHouse system tables and EXPLAIN. We’ll demonstrate how to evaluate and improve performance for common query use cases ranging from MergeTree data on block storage to Parquet files in data lakes. Join our webinar to become a master at diagnosing query bottlenecks and curing them quickly. https://hubs.la/Q02t2dtG0
submitted by /u/Altinity_CristinaM
[visit reddit] [comments ..read more