Probability and Statistics

Prob­a­bil­ity the­ory is one of the more recently estab­lished fields of math­e­mat­ics, which is cur­rently a very active area of research. It has found numer­ous appli­ca­tions in math­e­mat­i­cal mod­el­ing for com­puter sci­ence, finance, sta­tis­ti­cal mechan­ics, dynam­i­cal sys­tems, bioin­for­mat­ics etc. Top­ics in prob­a­bil­ity the­ory include mar­tin­gale the­ory, sto­chas­tic processes, sto­chas­tic cal­cu­lus, ergod­ic­ity and oth­ers. Prob­a­bil­ity the­ory has drawn from many other areas of math­e­mat­ics such as mea­sure the­ory, inte­gra­tion the­ory, rep­re­sen­ta­tion the­ory, real and com­plex analy­sis and in return has pro­vided some inter­est­ing insights into alge­braic struc­tures, geom­e­try of graphs, dif­fer­en­tial geom­e­try, ergodic the­ory, PDE the­ory and sam­pling algorithms.

Prob­a­bil­ity The­ory Courses [Show]

Machine learn­ing (ML) is a sub-field of arti­fi­cial intel­li­gence that lies at the inter­sec­tion of com­puter sci­ence, sta­tis­tics, and prob­a­bil­ity the­ory. In broad terms, it con­cerns the design, imple­men­ta­tion, analy­sis, and appli­ca­tion of algo­rithms that can ‘learn from expe­ri­ence.’ In the last few decades, it has advanced extremely rapidly and found appli­ca­tions to a wide range of prob­lems includ­ing analy­sis of lan­guage (“nat­ural lan­guage pro­cess­ing,” or NLP) and lin­guis­tics, mod­el­ing of genetic vari­a­tion, quan­ti­ta­tive mod­el­ing in the social sci­ences, spam fil­ter­ing, auto­mated dis­cov­ery of iden­tity theft, and many more.

While ML is a com­plex and wide field, it can be roughly split in two parts: (1) pre­dic­tion (or, more cor­rectly, “super­vised” ML); and (2) mod­el­ing (or, more cor­rectly, “unsu­per­vised” ML). Pre­dic­tion prob­lems require that one pre­dict the value of some vari­able in the future given many obser­va­tions of it in the past; fore­cast­ing the value of a stock given its value (and mar­ket con­di­tions) over the course of a year is an exam­ple. One way to think about mod­el­ing prob­lems is as pre­dic­tion prob­lems in which the vari­able to be pre­dicted has not been observed at all; fore­cast­ing the value of a stock just from mar­ket con­di­tions, with­out ever see­ing its pre­vi­ous val­ues, would be an exam­ple of a mod­el­ing prob­lem. In gen­eral, unsu­per­vised prob­lems con­cern the dis­cov­ery and exploita­tion of hid­den pat­terns in data: the canon­i­cal exam­ple is the prob­lem of col­lect­ing together sim­i­lar images in a large data­base (“clustering”).

From a math major’s per­spec­tive, either or both of the the­o­ret­i­cal and applications-oriented sides of ML might be very inter­est­ing. The­o­ret­i­cal prob­lems can involve the analy­sis of spe­cific pre­dic­tion algo­rithms, either from a com­puter sci­ence or sta­tis­ti­cal per­spec­tive; the analy­sis and design of meth­ods of “infer­ence,” the key algo­rith­mic step in mod­el­ing that esti­mates impor­tant unob­served quan­ti­ties from observed data; or con­nec­tion of machine learn­ing prob­lems with ideas from infor­ma­tion the­ory, prob­a­bil­ity the­ory, or other theoretical—and some­times application—domains. Applications-oriented prob­lems, a catch-all cat­e­gory that includes any prob­lem that ulti­mately con­cerns the analy­sis of real data, show a sim­i­lar degree of vari­ety. They can involve the deploy­ment of exist­ing meth­ods to inter­est­ing data analy­sis prob­lems; the design of novel method­olo­gies that allow for new kinds of analy­ses; the use of unsu­per­vised meth­ods to solve tra­di­tion­ally super­vised prob­lems; and a range of other research activ­i­ties. It should be noted that ML research enjoys a rich and fruit­ful inter­play between the­ory and prac­tice, so that the­o­ret­i­cal projects can often include an empir­i­cal com­po­nent and applied projects often require under­stand­ing rel­e­vant theory—and some­times extend­ing it!

Machine Learn­ing at Prince­ton [Show]

Machine Learn­ing Courses [Show]

Con­tacts [Show]

Leave a Reply