Evaluating the Impact of Outlier Treatment in Time Series
1 1
Sensitivity Analysis, Model Validation, Feature Importance & More!
2h ago
A 6-Month Detailed Plan to Build Your Junior Data Science Portfolio
1 2
Step-by-step guide to creating, polishing, and deploying a portfolio that helps you land your first job
#python #pyladies #portfolio #datascience #officehours #careeradvice #machinelearning
12h ago
1 1
Two short anecdotes about transformations, and what it takes if you want to become ”AI-enabled”
16h ago
How Big Tech Is Exploiting Content Creators, And (Trying To) Get Away With It
1 1
Diving into current courtroom dramas around AI, copyright, and the true meaning of “fair use.”
19h ago
Successful AI Ethics & Governance at Scale: Bridging The Organizational and Implementation Gaps
1 1
The road from ethics to executives winds through lawyers and LLMs
23h ago
Can You Tell Free Python Art from Multi-Million Dollar Pieces?
0 1
Follow along for a Python generative art tutorial, inspired by Piet Mondrian and Josef Albers. From Code to Canvas, Part I.
on Thu, 1PM
Demystifying the Correlation Matrix in Data Science
0 1
Understanding the Connections Between Variables: A Comprehensive Guide to Correlation Matrices and Their Applications
on Thu, 7AM
How I Created a Data Science Project Following CRISP-DM Lifecycle
0 1
An end-to-end project using the CRISP-DM framework
on Thu, 1AM
Techniques for Exploratory Data Analysis and Interpretation of Statistical Graphs
0 2
Practical Approaches for Uncovering Insights and Patterns in Statistical Visualizations
on Wed, 5PM
Boosting Algorithms in Machine Learning, Part II: Gradient Boosting
0 1
Uncovering a simple yet powerful, award-winning machine learning algorithm
on Wed, 8AM
From Parallel Computing Principles to Programming for CPU and GPU Architectures
0 1
For early ML Engineers and Data Scientists, to understand memory fundamentals, parallel execution, and how code is written for CPU and GPU.
on Wed, 1AM
Predict Housing Price using Linear Regression in Python
0 1
A walk-through of cost computation, gradient descent, and regularization using Boston Housing dataset
on Tue, 6PM
0 1
How we use a shared Spark server to make our Spark infrastructure more efficient
on Tue, 2PM
Advanced Time Series Forecasting With sktime
0 1
Learn how to optimize model hyperparameters and even the architecture in a few lines of code
on Tue, 10AM
Tracing the Transformer in Diagrams
0 1
What exactly do you put in, what exactly do you get out, and how do you generate text with it?
on Tue, 7AM
Markov Decision Problems in Robotics
0 1
Using a real-world example to explain MDPs, the Bellman equation and value iteration
on Tue, 12AM
Random Forest, Explained: A Visual Guide with Code Examples
0 1
Making tree-mendous predictions with random trees
on Mon, 8PM
Solving the classic Betting on the World Series problem using hill climbing
0 1
A simple example of hill climbing — and solving a problem that’s difficult to solve without optimization techniques
on Mon, 12AM
Watermarking for AI Text and Synthetic Proteins: Fighting Misinformation and Bioterrorism
0 0
Understanding AI applications in bio for machine learning engineers
on Fri, 9PM
I Wasn’t Always a Data Scientist — How I Broke into the Field
0 0
8 strategies I used (and you can too) on my journey to data science
on Nov 6
Football and Geometry — Passing Networks
0 0
Analyzing Bayer Leverkusen’s Passing Networks from Last Season
on Sep 18
Lunar Crater Detection: Computer Vision in Space
0 0
One small step towards autonomous crater-based navigation
on Sep 16
How to Implement Graph RAG Using Knowledge Graphs and Vector Databases
0 0
A Step-by-Step Tutorial on Implementing Retrieval-Augmented Generation (RAG), Semantic Search, and Recommendations
on Sep 12
0 0
An intro to an especially sneaky bias that invades many regression models
on Aug 7
Building a marketing data science team from scratch
0 0
From scratch to a 6-member team: How I built Skyscanner’s marketing data science team, proving value by being focused and strong…
on Aug 4
Doping: A Technique to Test Outlier Detectors
0 0
Using well-crafted synthetic data to compare and evaluate outlier detectors
on Jul 10
LLM Apps, Crucial Data Skills, Multi-AI Agent Systems, and Other July Must-Reads
0 0
The stories that resonated the most with our community in the past month
on Jul 6
Forget Statistical Tests: A/B Testing Is All About Simulations
0 0
How simulations outperform traditional stats in that they are easier to understand, more flexible, and economically meaningful
on Jul 4
MMM: Bayesian Framework for Marketing Mix Modeling and ROAS
0 0
Bayesian framework to model media channels performance, Return on Ad Spend (ROAS), and budget allocation using PyMC
on Jun 17
Thinking, fast and slow with LLMs and PDDL
0 0
ChatGPT is never shy at pretending to perform deep thought, but — like our brain — might need better tools to reason accurately
on Jun 9
Solving a Resource Planning Problem with Mathematical Programming and Column Generation
0 0
Solving the minimum vertex coloring problem via column generation
on Jun 7
0 0
Stepping out of the “comfort zone” — part 3/3 of a deep-dive into domain adaptation approaches for LLMs
on Jun 6
The Meaning of Explainability for AI
0 0
Do we still care about how our machine learning does what it does?
on Jun 5
Deep Learning Illustrated, Part 3: Convolutional Neural Networks
0 0
An illustrated and intuitive guide on the inner workings of a CNN
on Jun 5
Jet Sweep: Route Optimization to Visit Every NFL Team at Home
0 0
Solving the Traveling Salesman Problem with Linear Programming
on Jun 4
How I Use ChatGPT As A Data Scientist
0 0
How ChatGPT improved my productivity as a data scientist
on Jun 3
Long-form video representation learning (Part 2: Video as sparse transformers)
0 0
We explore novel video representations methods that are equipped with long-form reasoning capability. This is part II focusing on sparse…
on Jun 3
How to Detect Objects in Real-Time Using OpenCV and Python
0 0
For the uninitiated, Real-Time Object Detection might sound quite a mouthful. However, with a few awesome libraries at hand, the job…
on Jun 3
What 10 Years at Uber, Meta and Startups Taught Me About Data Analytics
0 0
Advice for Data Scientists and Managers
on Jun 2
Data Science Portfolios, Speeding Up Python, KANs, and Other May Must-Reads
0 0
The stories that resonated the most with our community in the past month
on Jun 2
Deep Dive into Anthropic’s Sparse Autoencoders by Hand ✍️
0 0
Explore the concepts behind the interpretability quest for LLMs
on Jun 1
YOLO — Intuitively and Exhaustively Explained
0 0
The genesis of the most widely used object detection models.
on May 31
Build Your Own ChatGPT-like Chatbot with Java and Python
0 0
Creating a custom LLM inference infrastructure from scratch
on May 31
Self-Attention Explained with Code
0 0
How large language models create rich, contextual embeddings
on May 30
Fine-Tune Smaller Transformer Models: Text Classification
0 0
Using Microsoft’s Phi-3 to generate synthetic data
on May 29
An Introduction to Reinforcement Learning
0 0
A deep dive into the rudiments of reinforcement learning, including model-based and model-free methods
on May 28
Kolmogorov-Arnold Networks (KANs) for Time Series Forecasting
0 0
Discover the Kolmogorov-Arnold Networks (KANs) and apply them for time series forecasting using Python
on May 27