Databricksters
Subscribe
Sign in
Home
Notes
Chat
AI & ML
Data Engineering
Archive
About
Latest
Top
Discussions
The Hidden Price of Streaming: Cutting S3 API Calls for Massive Cloud Savings
A practical approach to cutting cloud expenses through smarter S3 API usage
May 20
•
Geethu
3
Share this post
Databricksters
The Hidden Price of Streaming: Cutting S3 API Calls for Massive Cloud Savings
Copy link
Facebook
Email
Notes
More
Databricks AI/BI Dashboard 2.0
Natural Language Interaction with Your Dashboards
May 6
•
Ambarish
3
Share this post
Databricksters
Databricks AI/BI Dashboard 2.0
Copy link
Facebook
Email
Notes
More
April 2025
Juggling a model circus: a PyFunc's tale
alternatively, how to serve multiple models on a single model serving endpoint in Databricks.
Apr 29
•
Veena
and
Debu Sinha
3
Share this post
Databricksters
Juggling a model circus: a PyFunc's tale
Copy link
Facebook
Email
Notes
More
Everything You Ever Wanted to Know about Pandas / PyArrow UDFs in Apache Spark
Vectorized UDFs, Zero-Copy Arrows & 100× Speed-Ups
Apr 25
•
Canadian Data Guy
3
Share this post
Databricksters
Everything You Ever Wanted to Know about Pandas / PyArrow UDFs in Apache Spark
Copy link
Facebook
Email
Notes
More
PyFunc it! We'll do it Live!
Real-Time Data Preprocessing for Custom Databricks Model Serving Endpoints
Apr 22
•
Austin Zaccor
8
Share this post
Databricksters
PyFunc it! We'll do it Live!
Copy link
Facebook
Email
Notes
More
Braving through the pitfalls of LLM judges
A guide on improving your LLM evaluations.
Apr 15
•
Veena
3
Share this post
Databricksters
Braving through the pitfalls of LLM judges
Copy link
Facebook
Email
Notes
More
Databricks Vector Search Similarity Scores Deep Dive
Have you noticed unexpected results from Databricks Vector Search’s similarity_search?
Apr 9
•
Joshua Eason
2
Share this post
Databricksters
Databricks Vector Search Similarity Scores Deep Dive
Copy link
Facebook
Email
Notes
More
March 2025
How to Actually Delete Data in Spark Streaming (Without Breaking Things ) 💥
What Every Data Engineer Needs to Know About GDPR-Ready Pipelines
Mar 25
•
Canadian Data Guy
3
Share this post
Databricksters
How to Actually Delete Data in Spark Streaming (Without Breaking Things ) 💥
Copy link
Facebook
Email
Notes
More
On the Topic of LLMs and Non-Determinism:
Practical Limitations in Combating the Myth of Uncertainty in Deep Learning
Mar 18
•
Austin Zaccor
2
Share this post
Databricksters
On the Topic of LLMs and Non-Determinism:
Copy link
Facebook
Email
Notes
More
Concurrent Execution and Query Throughput in Databricks SQL
Introduction
Mar 11
•
Artem Chebotko
1
Share this post
Databricksters
Concurrent Execution and Query Throughput in Databricks SQL
Copy link
Facebook
Email
Notes
More
February 2025
Archiving Data in Databricks Lakehouse: A Comprehensive Guide to Cost Optimization and Best Practices
As organizations accumulate vast amounts of data, managing storage costs becomes a critical challenge.
Feb 25
•
Yogesh Gowda
15
Share this post
Databricksters
Archiving Data in Databricks Lakehouse: A Comprehensive Guide to Cost Optimization and Best Practices
Copy link
Facebook
Email
Notes
More
12
A Beginner's Guide to MLOps Stacks on Databricks
Equipped with an almost excessive amount of diagrams!
Feb 18
•
Veena
8
Share this post
Databricksters
A Beginner's Guide to MLOps Stacks on Databricks
Copy link
Facebook
Email
Notes
More
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts