| 0:00:07 | my name is tone each of are i mean assistant professor computer science i been | 
|---|
| 0:00:11 | at columbia for a little over five years now and might area of research is | 
|---|
| 0:00:15 | machine learning i direct the company machine learning lab and you have a large group | 
|---|
| 0:00:20 | of students doing several really exciting projects the machine learning lab is really about this | 
|---|
| 0:00:26 | velocity of combining complication computer science | 
|---|
| 0:00:30 | and marrying into statistics because there is so much data out there where we have | 
|---|
| 0:00:35 | not just an information age of an information overload age and the real hopes to | 
|---|
| 0:00:41 | use computers to help us make sense of the data automatically | 
|---|
| 0:00:45 | we want them to learn much with people learn and also work with the types | 
|---|
| 0:00:49 | of data that we care about so the text we read the images we see | 
|---|
| 0:00:53 | we want computers to be able understand that you to that we're generating everyday at | 
|---|
| 0:00:58 | a faster and faster rate | 
|---|
| 0:01:00 | in biology for example there's millions of variables extends of thousand genes and it's almost | 
|---|
| 0:01:06 | impossible to have someone look at this data and come up with a theory about | 
|---|
| 0:01:10 | how the biology works | 
|---|
| 0:01:12 | so increasingly side to succumbing to computer scientists and machine order saying | 
|---|
| 0:01:17 | we've made all these measurements there's just too many variables and machine learning is one | 
|---|
| 0:01:21 | of the few tools that really can work with this type of data machine learning | 
|---|
| 0:01:25 | can provide us with a network description of visualisation clustering production and so on which | 
|---|
| 0:01:32 | sciences finding very valuable these days | 
|---|
| 0:01:35 | another thing we've been working with its social network analysis and | 
|---|
| 0:01:39 | biological networks is another natural counterpart | 
|---|
| 0:01:42 | looking at networks of proteins figure out how they interact with the proteins functions are | 
|---|
| 0:01:47 | how expression levels vary over time i think machine learning is one of these | 
|---|
| 0:01:52 | unusually lucky feels in that | 
|---|
| 0:01:55 | the foundations it's working from are useful to many other disciplines | 
|---|
| 0:02:01 | are particular research is machine learning applied to really complicated problems and datasets | 
|---|
| 0:02:08 | where there is some additional structure that space so images transform in various ways if | 
|---|
| 0:02:14 | you see picture someone it rotates the latter the right or you move around somebody's | 
|---|
| 0:02:18 | face in an image you still recognizing and so we're trying to incorporate that same | 
|---|
| 0:02:23 | type of | 
|---|
| 0:02:25 | structure into all are machine learning algorithms we also design algorithms and machine learning models | 
|---|
| 0:02:31 | at work on sequences so then we can handle things like a string of text | 
|---|
| 0:02:35 | we've been able to do that very successful using machine learning by modeling the sequence | 
|---|
| 0:02:40 | structure of the text | 
|---|
| 0:02:43 | give me two documents are not tell you they were uttered by the same person | 
|---|
| 0:02:47 | or there are some subtle stylistic things between those two documents that say that this | 
|---|
| 0:02:51 | is in the same person | 
|---|
| 0:02:53 | in my mind that's the really exciting future direction from machine learning one of the | 
|---|
| 0:02:57 | areas we concentrate on my group which is how to incorporate some of this invariance | 
|---|
| 0:03:02 | we know exist special you picture a short you tell to you still recognise that | 
|---|
| 0:03:07 | actually see later on | 
|---|
| 0:03:09 | and that's kind of the key i think too many real-world problems that | 
|---|
| 0:03:15 | invariance for that particular problem | 
|---|