1.
Programming
Time & Space Complexity
HyperLogLog Algorithm
Vectorization using Python
SQL Window Functions
Binary Search Tree Traversal
OOPs Concepts in nutshell
Python : *args vs **kwargs
Python List vs Tuple vs Dictionary
Spark Structured Streaming
Basic code : Spark streaming
Spark Memory and Optimizer
Basic definition of Spark
2.
System Design
OSI Model Layer
Datawarehouse Schema
Slowly Changing Dimension
Database Keys
ACID in Database
CAP Theorem in Nutshell
REST vs GRPC
HTTP Long Polling
Database Sharding
About Kafka
Synchronous and Asynchronous process
Event Driven model
All about Eventual and Strongly Consistency
Demystify Lambda and Kappa Architecture
Vertical vs Horizontal Scaling
What is Caching and its importance
About Zookeeper
Load Balancer
Demystifying HTTP and HTTPS
Authentication vs Authorization
Types of Authentication
About REST and SOAP
Session & Token flow
3.
Product Architecture
Tips on Scaling System
Design Cloud Storage App
Design Photo Sharing App
Design Chat Messenger App
Design a API Rate limiter
Design Video Sharing App
4.
AI ML Engineering
Activation function
Bagging & Boosting
Confusion Matrix
Feature Engineering
AWS : Athena + Sagemaker
Word2Vec Algorithm
Trees in ML World
5.
Cloud Computing
AWS Well Architected Framework
AWS Basic Terminologies
AWS : Search using Kendra
AWS : Redshift + S3
AWS : REST API + Athena (Gateway & Lambda)
AWS : Lambda + API gateway
AWS : Lambda + Athena and S3
AWS : PySpark on EMR Cluster
AWS : S3 + Glue and Athena
Brief on AWS IAM
AWS : ElasticSearch Service
Brief Intro AWS Kinesis
AWS : Setup Cloud Budgets
AWS Monitoring Services
AWS : Refresh CloudFront caches
Comparison AWS technologies
More
Personal Projects
Publications
Linkedin
GitHub
About the Blog
> Tags
taxonomy :: Tags
api
rate
rate limiter
system design
sql
window sql
bst
data structure
traversal
tree
grpc
rest
streaming
datawarehouse
schema
data warehouse
scd
acid
database
http
polling
push
real time
service
cloud
storage
video share
photo sharing
messenger
scale
aws
discover
ElasticSearch
search
kendra
cardinality
count
HLL
Hyperloglog
kinesis
aiml
athena
sagemaker
programming
python
speed
vectorization
datalake
redshift
s3
gateway
lambda
architechture
ec2
emr
spark
budget
cost optimization
money
operation
savings
cloudfront
invalidate
data
etl
glue
cloudtrail
cloudwatch
monitoring
group
iam
permission
role
security
user
performance
reliability
ebs
efs
class
object
oop
args
kwargs
dictionary
list
tuple
availability
cap
consistency
partition
shard
hadoop
Asynchronous
Synchronous
architecture
batch
speed layer
scaling
space complexity
time complexity
cache
distributed systems
keys
zookeeper
https
authentication
authorization
cookie
request
session
SOAP
token
accuracy
acivation
bagging
boosting
classification
confusion matrix
decision trees
feature
feature engineering
function
machine learning
neural network
precision
predict
random forrest
recall
regression
sensitivity
specificity
supervised learning
trees
word2vec