Coursera公开课笔记: 斯坦福大学机器学习第六课“逻辑回归(Logistic Regression)” - 我爱公开课



6) Advanced optimization(其他优化算法) 如果分类器用的是回归模型,并且已经训练好了一个模型,可以设置一个阈值: 如果\(h_\theta(x) < 0.5\),则预测y=0,既y属于负例; 如果是线性回归模型,对于肿瘤这个二分类问题,图形表示如下: 注: 以下引自李航博士《 统计学习方法 》1.8节关于分类问题的一点描述: 2) Hypothesis Representation 但是线性回归无法做到,这里我们引入一个函数g, 令逻辑回归的Hypothesis表示为: 这里g称为Sigmoid function或者Logistic function, 具体表达式为: Sigmoid 函数在有个很漂亮的“S"形,如下图所示(引自维基百科): 综合上述两式,我们得到逻辑回归模型的数学表达式: 其中\(\theta\)是参数。 例如,对于肿瘤(恶性/良性),如果输入变量(特征)是肿瘤的大小: 这里Hypothesis表示的是”病人的肿瘤有70%的可能是恶性的“。 数学上可以如下表示: 对于因变量y=0或1这样的二分类问题:   当\(h_\theta(x) < 0.5\)时,y = 0; 再次回顾sigmoid function的图形,也就是g(z)的图形: 当\(g(z) \geq 0.5\)时, \(z \geq 0\);

Read full article from Coursera公开课笔记: 斯坦福大学机器学习第六课“逻辑回归(Logistic Regression)” - 我爱公开课


No comments:

Post a Comment

Labels

Algorithm (219) Lucene (130) LeetCode (97) Database (36) Data Structure (33) text mining (28) Solr (27) java (27) Mathematical Algorithm (26) Difficult Algorithm (25) Logic Thinking (23) Puzzles (23) Bit Algorithms (22) Math (21) List (20) Dynamic Programming (19) Linux (19) Tree (18) Machine Learning (15) EPI (11) Queue (11) Smart Algorithm (11) Operating System (9) Java Basic (8) Recursive Algorithm (8) Stack (8) Eclipse (7) Scala (7) Tika (7) J2EE (6) Monitoring (6) Trie (6) Concurrency (5) Geometry Algorithm (5) Greedy Algorithm (5) Mahout (5) MySQL (5) xpost (5) C (4) Interview (4) Vi (4) regular expression (4) to-do (4) C++ (3) Chrome (3) Divide and Conquer (3) Graph Algorithm (3) Permutation (3) Powershell (3) Random (3) Segment Tree (3) UIMA (3) Union-Find (3) Video (3) Virtualization (3) Windows (3) XML (3) Advanced Data Structure (2) Android (2) Bash (2) Classic Algorithm (2) Debugging (2) Design Pattern (2) Google (2) Hadoop (2) Java Collections (2) Markov Chains (2) Probabilities (2) Shell (2) Site (2) Web Development (2) Workplace (2) angularjs (2) .Net (1) Amazon Interview (1) Android Studio (1) Array (1) Boilerpipe (1) Book Notes (1) ChromeOS (1) Chromebook (1) Codility (1) Desgin (1) Design (1) Divide and Conqure (1) GAE (1) Google Interview (1) Great Stuff (1) Hash (1) High Tech Companies (1) Improving (1) LifeTips (1) Maven (1) Network (1) Performance (1) Programming (1) Resources (1) Sampling (1) Sed (1) Smart Thinking (1) Sort (1) Spark (1) Stanford NLP (1) System Design (1) Trove (1) VIP (1) tools (1)

Popular Posts