1:48:51 Session 21: Actor Critic based Policy Gradient, Safe RL, Planning, DYNA, Curriculum Learning Mainak's PMRF Tutorials 224 views - 8 months ago
1:50:03 Session 20: Deep Neural Networks, MLP, Backpropagation, Policy Gradient, REINFORCE Mainak's PMRF Tutorials 91 views - 8 months ago
1:54:17 Session 19: Asynchronous Q learning, Classification in ML, MLE, Logistic and Softmax Regression Mainak's PMRF Tutorials 265 views - 8 months ago
1:57:08 Session 18 Synchronous Q-learning, Model-free, based, tabular, with Linear Fn. Approx., Convergence Mainak's PMRF Tutorials 53 views - 8 months ago
1:39:10 Session 17: Off-Policy Evaluation of TD0 with linear function Approximation, Emphatic TD0 Mainak's PMRF Tutorials 41 views - 9 months ago
1:42:49 Session 16 γ contraction, Banach's Fixed Point Theorem, How far is it far from the intended optimal Mainak's PMRF Tutorials 55 views - 9 months ago
1:52:56 Session 15 TD(0) convergence proof (contd), Point of Convergence of TD(0) (linear function approx.) Mainak's PMRF Tutorials 56 views - 9 months ago
1:54:39 Session 14: TD0 with linear function approximation, Glimpse at Stochastic Approximation Algorithm(1) Mainak's PMRF Tutorials 86 views - 9 months ago
1:45:21 Session 13: Function Approximation in RL, Policy Evaluation, SGD Monte Carlo, TD(0) Implementation Mainak's PMRF Tutorials 118 views - 9 months ago
1:49:15 Session 12: On Policy vs Off Policy Algorithms, Importance Sampling, Model-free Q learning, SARSA Mainak's PMRF Tutorials 129 views - 9 months ago
1:44:33 Session 11 Model Free Methods, Monte Carlo, Temporal Difference Algorithm, TD(λ) Algorithm Mainak's PMRF Tutorials 108 views - 10 months ago
1:51:52 Session 10: Stochastic Shortest Path, Bellman Operators, Proof of convergence of Policy Evaluation Mainak's PMRF Tutorials 127 views - 10 months ago
1:55:36 Session 9: Policy Iteration & Q learning code, Finite Horizon MDPs, Dynamic Program, Theory and Exmp Mainak's PMRF Tutorials 138 views - 10 months ago
1:48:24 Session 8 Bellman Equation, Optimal Policy, Iterative Policy Evaluation, Policy & Value Iteration Mainak's PMRF Tutorials 166 views - 10 months ago
1:51:33 Session 7: MDPs, Action, Value, Reward functions, Bellman Equations 1, Examples Mainak's PMRF Tutorials 180 views - 10 months ago
1:53:14 Session 6 Random Processes, Markov Chains and Stationary Distribution Mainak's PMRF Tutorials 163 views - 10 months ago
1:50:48 Session 5 ODE Interpretation in Bandits, UCB, Gradient-Based Algorithms, UCB in Python Mainak's PMRF Tutorials 163 views - 10 months ago
1:42:57 Session 4: Introduction to Reinforcement Learning, Multi-armed Bandits Algorithm and Implementation Mainak's PMRF Tutorials 379 views - 11 months ago
1:54:27 Session 3: Recap on Joint Distributions, Conditional Distributions, and Conditional Expectations Mainak's PMRF Tutorials 145 views - 11 months ago
1:48:30 Session 2: Recap - Continuous Distributions, Transformation of random variables Mainak's PMRF Tutorials 211 views - 11 months ago
1:56:11 Session 1 Recap on Random Variables, Exemplar Discrete Distributions, Expectations Mainak's PMRF Tutorials 461 views - 11 months ago
2:20:38 Session 24: Mixture Models, Expectation Maximization, GMMs, K-means is a specialized GMM Mainak's PMRF Tutorials 435 views - 1 year ago
2:07:14 Session 23: Dimensionality Reduction - Principal Component Analysis, Linear Discriminant Analysis Mainak's PMRF Tutorials 364 views - 1 year ago
2:02:29 Session 22: Unsupervised Learning, Clustering algorithms, K-means, K-medoids, and Hierarchical Mainak's PMRF Tutorials 261 views - 1 year ago
1:41:05 Session 21: Backpropagation, Dropout, Bias-variance tradeoff, Prevent overfitting or underfitting Mainak's PMRF Tutorials 371 views - 1 year ago
1:59:31 Session 20: Perceptron, Perceptron Learning Algorithm, Convergence Proof, MLPs, Forward Propagation Mainak's PMRF Tutorials 250 views - 1 year ago
2:24:19 Session 19: Kernel SVM, KKT conditions, Primal solutions, Sequential minimal optimization, SVR Mainak's PMRF Tutorials 314 views - 1 year ago
2:14:19 Session 18: Constrained Optimization Problems, Lagrangian Multipliers, Duality, SVM - Dual Form Mainak's PMRF Tutorials 268 views - 1 year ago
2:03:59 Session 17: K-Nearest Neighbours, Decision Trees, Formulating Support Vector Machines Mainak's PMRF Tutorials 450 views - 1 year ago
2:04:11 Session 16: Logistic Regression, Stochastic Gradient Descent, Softmax Regression (multiclass) Mainak's PMRF Tutorials 288 views - 1 year ago
2:22:40 Session 15: Discriminant Functions, MLE and MAP estimates, Ridge regression as a MAP Mainak's PMRF Tutorials 266 views - 1 year ago
2:22:42 Session 14: Gradient Descent works! Its Applications, Bayesian Decision Making, Bayesian Risk Mainak's PMRF Tutorials 324 views - 1 year ago
2:30:03 Session 13: What's Machine Learning? Supervised Learning - Linear and Ridge Regression (many views) Mainak's PMRF Tutorials 715 views - 1 year ago
1:57:52 Session 12 Solving difference equations - Fibonacci sequence, Singular Value Decomposition Mainak's PMRF Tutorials 138 views - 1 year ago
2:46:53 Session 11: Eigenvalues, eigenvectors, Eigen Decomposition, Multiplicity, Diagonalization Mainak's PMRF Tutorials 166 views - 1 year ago
2:10:31 Session 10: Elimination for Inverse, Solution to Lin. Equns, Projection, Orthonormal matrices Mainak's PMRF Tutorials 187 views - 1 year ago
2:43:02 Session 9: Sum of subspaces, Theory and Geometry of 4 fundamental subspaces, Rank-Nullity theorem Mainak's PMRF Tutorials 209 views - 1 year ago
2:08:55 Session 8: Introduction to Linear Algebra, Gauss-Jordan Elimination, Vector Spaces and Subspaces Mainak's PMRF Tutorials 259 views - 1 year ago
2:28:08 Session 7: Statistical tests - confidence interval, z-test, t-test - 1 & 2 sample, chi-square test Mainak's PMRF Tutorials 189 views - 1 year ago
2:01:30 Session 6: More advanced distributions, Chi-squared, Student t-distribution, and their properties Mainak's PMRF Tutorials 189 views - 1 year ago
2:33:45 Session 5: Joint Distributions, Conditional Expectation, Markov Ineq., Weak Law of Large Numbers Mainak's PMRF Tutorials 226 views - 1 year ago
2:24:36 Session 4: Memoryless Distributions, Moment Generating Functions, Central Limit Theorem Mainak's PMRF Tutorials 265 views - 1 year ago
2:01:41 Session 3: Fundamental Bridge, Continuous Random Variables, transformation of RVs Mainak's PMRF Tutorials 332 views - 1 year ago
2:11:48 Session 2: Random Variables - Discrete RVs, Expectation, Variances and their properties Mainak's PMRF Tutorials 428 views - 2 years ago
2:25:13 Session 1: Introduction to counting, Probability Space, Conditional Probability Mainak's PMRF Tutorials 2.6K views - 2 years ago
2:11:34 Session 10: Gradient descent, why it works, Linear and Logistic regression, ML estimate Mainak's PMRF Tutorials 153 views - 2 years ago
2:41:16 Session 9: Introduction to convex functions, Jensen’s, Holder’s inequality, Minkowski, Lagrangian Mainak's PMRF Tutorials 182 views - 2 years ago
2:43:07 Session 8: Inner products, vector norms, dual spaces, introduction to matrix norms Mainak's PMRF Tutorials 242 views - 2 years ago
2:43:02 Session 7: Eigenvector decomposition, unitary and normal matrices, Application: PCA and SVD Mainak's PMRF Tutorials 149 views - 2 years ago
2:52:23 Session 6: Projections, Least squares, eigenvalue-eigenvectors, Char. polynomial, similar matrices Mainak's PMRF Tutorials 169 views - 2 years ago
2:45:29 Session 5: Intro to Matrices, vector spaces, span and basis, 4-fundamental subspaces, elimination Mainak's PMRF Tutorials 234 views - 2 years ago
3:15:08 Session 4: MGFs, random vectors, joint distribution, random process, Random walks, Markov chains Mainak's PMRF Tutorials 179 views - 2 years ago
2:34:46 Session 3: continuous rvs, pmf, pdf, inequality, Uniform, Exponential, Normal, transformation of rvs Mainak's PMRF Tutorials 153 views - 2 years ago
3:02:06 Session 2: Discrete random variables, distribution, expectation, lotus, variance Mainak's PMRF Tutorials 151 views - 2 years ago
2:29:27 Session 1: Introduction to counting, RVs, and distributions Mainak's PMRF Tutorials 639 views - 2 years ago