Reddit - Big Data
0 FOLLOWERS
Everything big data from storage to predictive analytics.
Reddit - Big Data
1d ago
A cloud database is a collection of data, or information, that is specially organized for rapid search, retrieval, and management all via the internet. The guide below shows how with Blaze no-code platfrom, you can house your database with no code and store your data in one centralized place so you can easily access and update your data: Online Database - Blaze.Tech
submitted by /u/thumbsdrivesmecrazy
[visit reddit] [comments ..read more
Reddit - Big Data
5d ago
Hey r/bigdata, I've just opened up my collection of python tools aimed at automating data sorting and organization. These are personal tools I made to handle unorganized files and mass amounts of data efficiently. It's still a work in progress but I've incorporated automation in most of the tools to take the headache out of a lot of the redundant tasks. I'm hoping they can help others out there with their workflows. Dive in and try them out, and I’d love to get your feedback to make these tools even better.
https://github.com/nazpins/naztech-automated-data-sorting-tools
submitted by /u/Kilro ..read more
Reddit - Big Data
5d ago
I am planning to buy a laptop and confused which one to pick. Considering high performance, budget under 40k. Thanks in advance!
submitted by /u/Several_Ad9166
[visit reddit] [comments ..read more
Reddit - Big Data
5d ago
So I have a csv containing football data about goals where each goal has a scorer, GCA1(the player that gave assist), GCA2(the player that gave the pass to the assister)
I want to discover patterns of player positions that lead to a goal AKA buildups to a goal
Example: RB passed to a CAM which assisted a goal scored by a ST, or CB passed to a RW which assisted a goal scored by a LW
I want to find the most frequent buildups, think of it as finding frequent itemsets for a supermarket to derive discount decisions. Except my goal is to know which buildups are most common and make up coaching plan ..read more
Reddit - Big Data
1w ago
I’m an OSS developer (primarily working on Dask) and lately I’ve been talking to users about how they’re using Dask for ETL-style production workflows and this inspired me to make something myself. I wanted a simple example that met the following criteria:
- **Run locally (optionally)**. Should be easy to try out locally and easily scalable.
- **Scalable to cloud**. I didn’t want to think hard about cloud deployment.
- **Python forward**. I wanted to use tools familiar to Python users, not an ETL expert.
The resulting data pipeline uses Prefect for workflow orchestration, Dask to scale the da ..read more