ML-2 Note: Linear Models for Classification and GLMs

ML-2 Note: Linear Models for Classification and GLMs 1. Why Classification Uses Logistic Regression In classification problems, the target variable is discrete. Binary classification example: $$y \in \left\{ 0,1 \right\}$$ Given input features $$x \in \mathbb{R}^d$$ we want to model $$P(y=1|x)$$ Problem with Linear Regression A linear model predicts $$f(x) = w^T x$$ but $$w^T x \in (-\infty, \infty)$$ while probabilities must satisfy $$P(y=1|x) \in [0,1]$$ Thus we need a function that maps ...

March 28, 2026 · Mike

ML-1 Note: Supervised Learning; Linear Regression

ML-1 Note: Supervised Learning; Linear Regression 1. Basic Model of Linear Regression Linear Regression is one of the simplest and most fundamental models in supervised learning. The goal is to model the relationship between input features and a continuous target variable. Model Form For a dataset with feature vector \(x\): \[ y = w^T x + b \] or equivalently \[ \hat{y} = \theta^T x \] where: \(x\): input feature vector \(w\): weight vector \(b\): bias term \(\theta\): parameter vector (including bias) \(\hat{y}\): predicted value Loss Function The most common loss function for linear regression is Mean Squared Error (MSE). ...

March 22, 2026 · Mike

Learning and Practice of Single-Cell Sequencing

Abstract Single-cell RNA sequencing (scRNA-seq) has become an essential technique for studying cellular heterogeneity and complex biological systems. During this winter research training, I systematically studied the general workflow and analytical principles of scRNA-seq based on Single Cell Best Practices and related resources, covering key steps such as data preprocessing, quality control, normalization, dimensionality reduction, clustering, and cell type annotation. To consolidate the knowledge, I first reproduced a complete analysis pipeline using publicly available immune cell data to familiarize myself with standard procedures and tools, then independently applied the same workflow to a publicly available human brain infection-related single-cell dataset, identifying distinct cell populations and infection-associated transcriptional changes across cell types. Overall, this training deepened my understanding of scRNA-seq data analysis, demonstrated the adaptability of standardized workflows to diverse biological contexts, and provided preliminary insights into cellular responses in infected human brain tissue as a foundation for further studies. ...

March 9, 2026 · Mike

Hello World

March 7, 2026 · Mike