0:00:16i
0:00:16so i have to be too large and
0:00:20discourses
0:00:22i know system
0:00:23it means that we are to go and talk about
0:00:28how we
0:00:29in one meeting
0:00:31from it
0:00:32so i which is we come from an industry which is working
0:00:38it provides it support providing industry
0:00:42as a lot of the and have a lot of a two d
0:00:47a chest in the process of trying to exploit those text today or deixis that
0:00:53is what we get trying to me in two
0:00:56to exploit the state that to extract relevant information
0:01:00so maybe presenting and here's my would be what is also order
0:01:07so i'll do that easy or difficult questions can be addressed by hand
0:01:13so is an introduction about so like that
0:01:18the problem or what kind of for information extraction are talking about
0:01:23so this is easy their relationships as we let us know more about motion all
0:01:31between the two portions of sentences or multiple portions of sentences
0:01:36we are interested in this is off relationship which is close to we extract
0:01:44well as
0:01:45effect from data
0:01:46relationships
0:01:48the sound so it will be child trying to
0:01:52actually support what i see that while it is important to industry
0:01:57so the are a large and relations which are extracted from there so different domains
0:02:03as a response tires set due to faulty here
0:02:07of course it's forty here
0:02:09issue where
0:02:11recording of a hours and then a company going
0:02:14has been all over the
0:02:16so why this is important is that the and
0:02:20they are all these kind of ripples
0:02:23happening in one industry or one particular organization
0:02:27a organizations from past experience
0:02:30will know
0:02:32here comes a novel which can be of what entiated
0:02:37which can be potentially difficult for me
0:02:41that is what really kind of pretty systems we are talking about
0:02:47building for industry which is not only coming from
0:02:51d down which the rules for demand forecast et cetera et cetera
0:02:55but also using a lot of information that can be there in that
0:03:02the second one is
0:03:04actually an example which is coming from i don't know utilities and ask who are
0:03:10always bothered about safety regulations
0:03:13and the success that something safety agency which gives a which
0:03:18at this point about
0:03:20and kind of safety incident that has happened to see
0:03:25manufacturing plant
0:03:26or a construction
0:03:28i or that can make a
0:03:30three and so one
0:03:31so i one of these reports
0:03:34actually gives a broad outline of the regulated agencies
0:03:39about what kind of issues that have what is it easy to what kind of
0:03:43what kind of problem is kind of human activities
0:03:48also we have for the are both in these collected automatically extracted like that
0:03:56kind of knowledge base the all-pole
0:03:59these kinds of reports for
0:04:01future
0:04:03very important
0:04:05each
0:04:06and i
0:04:07it is very prominently we have a lot of reports that are coming on
0:04:12was tracked effect
0:04:13so serious adverse effect was observed in patients with heart disease
0:04:18due to hide the sage all class i don't know an actual okay
0:04:23so this and the discounting you are reported because of the language you know
0:04:29no i was tracked effects are also reported on social media which are noisy text
0:04:35and
0:04:35so on
0:04:37so all the have serious implications because that there are
0:04:42regulatory agencies what keeping track of
0:04:44all these issues that are reported or the what the police and then there
0:04:48and how to get into investigating the and checking whether the
0:04:54they are really a badly to p or not order one of these and so
0:04:59on
0:05:00so i just giving some examples to motivate asking why it is actually a this
0:05:05became a problem for us to be
0:05:09so it's actually we are interested in detecting such fortune relations
0:05:14a for
0:05:17analytical and predictive applications as i say
0:05:20a separate application which i don't think about what is saying you know what is
0:05:26to build only warning systems
0:05:28so is automatically style of statements are detected one i two
0:05:33keep track of a
0:05:35that against abortion
0:05:37a knowledge based act as they are in that domain i don't increments the partial
0:05:43knowledge base part actually generally
0:05:46that the warning signals to
0:05:51okay so that let us get into the complexity
0:05:56what to write it is a problem
0:05:59so the different kinds of course and relation so we saw some time here a
0:06:04few more
0:06:05so it is still files for bankruptcy for mounting financial troubles
0:06:10this is
0:06:11so the ordering relation
0:06:13where
0:06:14if it is on the left hand side
0:06:16a company files for bankruptcy
0:06:19the of course is on the right multi dimensional problems
0:06:24right there are tools that is not
0:06:30personality over here
0:06:31but we know that
0:06:32if you want to drive cautiously over all calls it can lead to a particular
0:06:37effect
0:06:38so
0:06:40standard microphones
0:06:41and there is an accident
0:06:43we will be able to
0:06:46is right that's it into the power tools that
0:06:49kind of in french and reasoning which may also have to be done
0:06:54an explicit and in research project is once again the bars has been caused by
0:06:59what and how much attention
0:07:01but are and where is the egg
0:07:03and in there is an issue which is important
0:07:06it is not mentioned in the sentence over here was to the operation has found
0:07:10that the practical or something
0:07:13one the pauses mentioned over here and it has to be read
0:07:18the more complicated ones are also act there can be multiple colours she in a
0:07:25sentence
0:07:26so my data point
0:07:28but in thousands model behaves one thirty thousand
0:07:33that's motivated to fix it also lets
0:07:37and then no models but actually causing something and it's which is that would be
0:07:42to and shouldn't stall so here there are
0:07:46g so that is unusable in which is the false floor
0:07:50engine spelling and starting to and the engine starting
0:07:55issues reported is actually the costs for model the ultimate lead to record and of
0:08:01course in these equal has financial implications
0:08:04okay
0:08:05so we get into the kind of work that has been down soul most often
0:08:12these where rule based kind of approaches that has been a light
0:08:17so working but
0:08:18i was drug effect dataset it has been there for quite some time
0:08:22a lot of it is a rule based which of course has its own problems
0:08:27the learning approaches are
0:08:30a lot of people have stopped and using that in many situations
0:08:34however the problem of course as lack of training data
0:08:39but that's
0:08:40and that's means all sentences can be mighty complex so therefore rule based approaches do
0:08:45not always give us
0:08:46hundred percent correct rates
0:08:49okay so far
0:08:51task
0:08:52because of these problems coming from multiple domains opened note that the dataset
0:08:58not being able to work with the rules
0:09:00wants to have an unknown to the assets from whatever we would get from multiple
0:09:06domains
0:09:08this task force we have proposed
0:09:11linguistic and informal bidirectional lstm baseline
0:09:15you do and not at all the sentences
0:09:21where each word of a sentence is finally labeled as i there are calls or
0:09:27an effect
0:09:28or not
0:09:29a larger connective sort of for so called effect portion connective or not
0:09:35and then
0:09:37be you at this bush self goals this
0:09:41of course portions that are modeled as
0:09:44and the a consecutive proportions which are marked as if it does effect
0:09:49one of them together for our domain time then built or sub graphs
0:09:55so for a portion graphs we have applied clustering
0:09:58so this time the four steps only need not something
0:10:03we did that notation ourselves and then we went on to the second box of
0:10:07classification and then building
0:10:09a vision graph
0:10:12okay so these are resources so i would be to be created a total of
0:10:18some of them it right available and just talk about that but we change the
0:10:22notations a bit
0:10:23so the first time and i missed reports from each other kind of as talking
0:10:26about with c is recorded reports financially and information about companies et cetera
0:10:33so we picked up about four thousand five hundred sentences from many reports average sentence
0:10:40length somebody a big a high in this case
0:10:44a it is necessary to house and a particular to a dataset which was also
0:10:52fit
0:10:53so that
0:10:54thirteen a hundred those sentences
0:10:58we actually the unaudited
0:11:00so why not intended to more in i
0:11:03one thing works when model
0:11:06"'cause" this
0:11:07and single words from theirs
0:11:10effect
0:11:10whereas when we show that the we saw that i think what causes and not
0:11:15right
0:11:15so
0:11:16we don't agree and notation of it we validated by taking bad
0:11:22i think that what was a part of our "'cause" i don't know
0:11:28there is a collection it just be seen you also which is which are a
0:11:33few sentences so that would be in the average length of sentences is quite high
0:11:39if it is okay dataset which is a noisy images from twitter and social media
0:11:46it has about three thousand which are really matters which are shared by drug companies
0:11:50of and then read one and you
0:11:54which is
0:11:54i think that i read all related events but which are coming in use and
0:11:59not in and now list
0:12:00reports
0:12:02okay so this notation mechanism will be followed in each so first one because the
0:12:09sentences could be complex and there could be constant change
0:12:13so we used a open which is by university of washington to
0:12:19actually breaking down into the multiply clauses and then
0:12:23we set to three annotators
0:12:26each and note that there
0:12:28wants to mark the portions of the sentences as you can see over here
0:12:33as either time effect
0:12:35a larger
0:12:36all cordial connecting from which is
0:12:40so here c is of course you can be and of course
0:12:46a big much cost is coming from the same sentence from multiple faces which are
0:12:52open it has broken need so therefore these and numbered also so forty sentence for
0:12:59one portion
0:13:00would be one
0:13:02the subscript one
0:13:03for an abortion for the subscript to
0:13:07and these are some examples once again that when you have a very complex sentence
0:13:12like this
0:13:14these are two into me show the open it breaks it into two components
0:13:19and then
0:13:21each of them is model so here it is easy one
0:13:24here at a here at this is easy to see two e and so one
0:13:32and this one
0:13:35similarly central sentence can see that she originally
0:13:39so in this case also you will see c one c two c one c
0:13:43two
0:13:43and so on
0:13:46so this is a more for a so based on the cell and audition speech
0:13:53are given by
0:13:55users
0:13:56we now we i think learning model
0:14:00so we only linguistic only because we also use a lot of linguistic information
0:14:08by training and so it does not just the word vectors
0:14:12so of course support vectors is your from the original board
0:14:17then we have a rich just space which we do not by using
0:14:22a lot of information bits comes from the
0:14:25a standard linguistic tools
0:14:28so the part-of-speech tags
0:14:29the university dependency relations between the words
0:14:33also very poor
0:14:34a particular what is the headboard we see that particular dependency and that it is
0:14:40the beginning inside or end of a phrase we have taken verb noun position of
0:14:48three structures
0:14:50we have also utilize wordnet hierarchy and
0:14:53especially because in many situations
0:14:56as evident just for you only chart
0:14:59even that's and non-names down a relationship
0:15:04we head words
0:15:08we have taken into account over here
0:15:11whether it's an entity whether it's a group but it's a phenomenon and so on
0:15:17muscle the desire for the original remark one of these also for
0:15:22it's synonymous
0:15:24so that is how we make this
0:15:27very informed linguistic
0:15:31and so each one of them are one-hot encodings that we have used
0:15:37finally
0:15:39all these information is fed into bidirectional lstm so that
0:15:44as we saw that was effective relationships do not follow a pretty standard structure that
0:15:49effect
0:15:49we give the pause
0:15:51can be well
0:15:55wow pointed out sentences so therefore we use a bidirectional lstm
0:16:00to implement
0:16:03the
0:16:05to finally get the fine you building off a particular what their scores if a
0:16:11non-causal to connect
0:16:13by passing it through a set of hidden layers as and finally taking a softmax
0:16:19layer to take the one with the highest probability
0:16:23okay
0:16:25this is from is
0:16:27sentence
0:16:28a portion of the sentence model
0:16:31effect
0:16:32a portion of the sentence map task force
0:16:35sometimes we get on the calls are only if a we don't get the course
0:16:40and connectives correctly sometimes that's and that's may not need as we saw
0:16:44because it is it can be implicit the causality and so one
0:16:51but no it's our second problem passing mention that extract or something relation was just
0:16:57the first part of the task
0:16:59we want to use this relations to be coded graph or an industrial applications
0:17:05no this case now here comes on the problems that we had only are
0:17:12we this information
0:17:16expressed in different areas in different reports
0:17:20in a different companies and so on
0:17:24so for
0:17:25we know class we could all well as this all groups of fa
0:17:32in order to be our portion graph we could not possibly have a very complex
0:17:37colour red border effect
0:17:40is just there is a relationship in background
0:17:45so here are some examples that you see that all of these could be potentially
0:17:50grouped into what is called a few will design problem
0:17:54okay
0:17:55so if you intended effect filter
0:17:58so these are all expressed differently in different reports by different
0:18:03usable as
0:18:04in the other one language
0:18:05what would be the same event actually different even also one could be for one
0:18:11model of the art one would be for
0:18:13and how the model of another car and so on
0:18:16but i want to use whenever you feel problem
0:18:20then it is what we show that
0:18:24this manifested in the car
0:18:26so if you want your problem but also because the card installed at random and
0:18:31would be something that is done with the car
0:18:35so lonely here if you see these are all initial estimates problems which are people
0:18:40in multiple different trees
0:18:43these are flyer risks so as i was mentioning that if there is some kind
0:18:50all four ignition problem it would be to installing or it could be to an
0:18:56engine they don't it would also need to a fire in fact these are all
0:19:01from real data
0:19:03which has been reported
0:19:05four or even causes and effects we wanted to do
0:19:12just click for the similar you
0:19:15and once again we
0:19:18exploiting the same word vectors that we have
0:19:21ten years because there were some more issues to be taken care of so we
0:19:26proposed
0:19:28utilizing unigrams and bigrams what vectors for bigrams
0:19:33so we used that can separate at all
0:19:36the and z q e
0:19:38works
0:19:39from
0:19:40two different
0:19:42two different poses a cartoon faces are two different effect phrases
0:19:47and then
0:19:49we do that standard clustering k-means clustering very key was determined by looking at rick's
0:19:56a new method to see that
0:19:58whether a particular
0:20:00well as our fa
0:20:01we don't more to yes
0:20:04our proposed user to another class
0:20:06so that does help for this particular domain that i was discussing the recon new
0:20:11support that particular domain became a twenty one
0:20:15just words
0:20:16all utterances anything
0:20:18and finally
0:20:20we do not fit the bill flight this mentions that part of the graph
0:20:24shown bit though
0:20:25the samples
0:20:27phrases that it was showing
0:20:29so there is something like a line fairly or icsi's to starting of view it
0:20:34differently to start
0:20:36but we have also seen everyone turn defect mainly due for you and we also
0:20:41need a sort of really
0:20:44of a problem also leads to for any
0:20:47of you intended effect maybe to five is and so on
0:20:51so this is how we what we got from the information
0:20:57after clustering
0:20:59in fact that are
0:21:01was one
0:21:03so large and relation given you dataset
0:21:07for a whole as belonging to a classical to an effective belonging to another cluster
0:21:13we do you this particular
0:21:18link
0:21:19we used
0:21:19for the time being a very simple reliability mechanism is what we have assigned to
0:21:24it
0:21:25there needs to be more work to actually compute the programmability
0:21:29this is simply observing how many times the scores and effective come together
0:21:34over
0:21:35the number of times that was observed in
0:21:39represent three
0:21:41so
0:21:43based on this now also mention we had five different data sets which we had
0:21:48and not do it
0:21:50and so what we did was treated there are two types of experiments one and
0:21:55h we combined all five dataset
0:21:59together
0:22:00and so whatever number of sentences we got we used and we divided them into
0:22:07training validation testing the data five fold cross validation
0:22:11and in an experiment we train won't be using you want to say
0:22:17and try to see how we performed on each dataset
0:22:21separately
0:22:22so here the results for one side we have we you mixed up all the
0:22:27data
0:22:28and so as you can see something that the report to b c et cetera
0:22:32et cetera
0:22:33these are
0:22:33the performances
0:22:36the performance these are the baselines we have used simple rule crfs
0:22:42only by the lstm and linguistic and informed by lstms
0:22:47so in this case crfs give better advice
0:22:51in most of the case of linguistically informed it better
0:22:55and the reason is also very obvious because crfs a good care of named entities
0:23:00the we drown assigning et cetera
0:23:03the positioning the features are very good and crf which is actually giving you got
0:23:08better performance
0:23:10down our boards and semantics et cetera
0:23:13are seen think you'll observed for that if a bar
0:23:18also
0:23:19this is what the project and it's
0:23:21here we call discourse connectives are all standard english words
0:23:26we don't have anything to do with drug names specific features et cetera so in
0:23:31this case this is giving better performance in retrieving the cost and connectives irrespective of
0:23:37the demi
0:23:39it is so also very similar things are up so here as we mentioned
0:23:45one dataset is used for
0:23:47clean off on how to do this we perform yes
0:23:52and does that is
0:23:54best performing that what happened between semi that on this gives fess likely but the
0:24:00decision
0:24:01sentences
0:24:03a b c is
0:24:06good
0:24:08because b c has good english so therefore most of the clean as follows that
0:24:14okay
0:24:15but this is as usual for on was probably because again the a lot of
0:24:22a domain specificity that is involved in
0:24:26which it cannot learn when it comes from the dataset
0:24:32so this is what we have here for confusion so first three what we have
0:24:39done over here
0:24:40and what the last what the future but still characterization of even more data me
0:24:46just we are
0:24:48i in doing so when hasn't even though good but has a talk or to
0:24:52et cetera because
0:24:54just not enough to see a particular even
0:24:59i
0:25:00for i was trying to think you need more of context to actually applied to
0:25:05real scenario
0:25:07who has got more about the prior this condition sensible one
0:25:11so we are working towards more complex categorization of events
0:25:15also one composite events
0:25:18so here most of the time when there are composed a even change
0:25:23how do we characterize into the budget
0:25:26we are
0:25:27to buy
0:25:29you
0:26:19okay
0:26:35okay so the labeling we don't consider these issues because it's in a sentence but
0:26:41that it that you're or sort and it
0:26:44all these issues company trying to the colours and graph
0:26:48because what is an effect in one sentence is that well as in and then
0:26:57g
0:27:05yes
0:27:06okay so that is we used to the if it's a complex sentence that is
0:27:11why we can use open
0:27:13so there would be read it is this all this stuff
0:27:17you one
0:27:19in another one v c
0:27:23yes
0:28:05i mean
0:28:06definitely the we would like to goal
0:28:09we because we then
0:28:11specifically cause and effect
0:28:14argument in was built a partial effect
0:28:18we were trying to its simpler
0:28:20so that we see that definitely
0:28:23first
0:28:25more
0:28:26twenty rich set of an and then to it
0:28:56i think that that's points
0:28:58where
0:29:23yes
0:29:31definitely have we need to do that
0:29:34so the only issue was there you know we wanted to restrict ourselves look very
0:29:39a set of relations did not in focus
0:29:43but definitely not so much more
0:29:46more