DataGeek Blog
389 FOLLOWERS
DataGeek Blog expertise you in the data science quest. This blog hopes to make more collaborative and involve more guest bloggers. Discover the latest news related to data by following this feed.
DataGeek Blog
2M ago
In my early days as a Db2 DBA, it was occasionally easy to be a performance hero. In those days, on 32-bit systems, the default size of Db2’s default buffer pool (primary data cache) was 1,000 4k pages. For many databases, this was far too small.… Read the rest ..read more
DataGeek Blog
3M ago
One key to performance in Snowflake is using partition pruning to read as little data as possible. I’ve previously written about what partition pruning is in Snowflake, but there are really two different types of partition pruning.
Static Partition Pruning
Static partition pruning occurs when Snowflake can use the metadata it collects about every eligible column on every micro-partition to eliminate micro-partitions based on filtering conditions specified in the query (or in a an object definition like a view or dynamic table).… Read the rest ..read more
DataGeek Blog
3M ago
I have to say, I love Snowflake, and I think I’ve found my career home, at least for a while. I’m fairly sure I’m past the honeymoon period at this point, too. I’ve been frustrated with a couple of things, but those few things are not that bad.… Read the rest ..read more
DataGeek Blog
4M ago
It’s summer! Kids are out of school, the pools are open, and …. technical conferences are in full swing? For those who are not aware, the IDUG North America 2024 technical conference is happening next week in Charlotte, NC (June 23-27).… Read the rest ..read more
DataGeek Blog
5M ago
I often start with just a query id and a general description of a performance problem. Almost as often as I’m looking at query profiles or the equivalent in internal tools, I like to see more about the context of the query.… Read the rest ..read more
DataGeek Blog
7M ago
This was one of my favorite types of blog entries to write when I worked with IBM Db2, and I’d like to revive it for Snowflake queries. I spend a good part of my day job helping people solve SQL performance problems on Snowflake.… Read the rest ..read more
DataGeek Blog
8M ago
As someone who regularly makes up scenarios to demonstrate concepts – both professionally and in my free time, I was very used to Db2’s answer to data generation – which is basically either fun with recursive SQL, pay for the additional IBM product Optim Test Data Generation, or use another language.… Read the rest ..read more
DataGeek Blog
9M ago
This blog entry compliments a webinar on February 8th where I will be a panelist answering questions on Amazons RDS for Db2. Register for “Introducing Amazon RDS for Db2”now.
I was lucky enough to be invited to speak at IDUG Australia a few years ago.… Read the rest ..read more
DataGeek Blog
9M ago
I have answered this question so many times and in so many different ways. Conceptually, my article Db2 Basics: The Role of Databases covers the concept of a database or database management system, why we need them and how we use them.… Read the rest ..read more
DataGeek Blog
9M ago
I had the idea of recording my rolls to prove that I’m taking whatever the the SQL random function gives me. Here’s my first attempt at that:
Video showing roll for database rolls for this articleRolls
Roll Type
Roll Value
Roll Result
Platform
3
SQL Lite
Task Category
2
Defining Objects (DDL, etc)
Task
2
Create index
Chaos
1
Use GUIs in every situation where possible
Table listing each individual roll for this episode
Other than experimentation, I don’t have much experience with SQLite, and the experience I do have is mostly in combination with python in a Jupyter Notebook.… R ..read more