0:00:15before the a final discussion i just want to update
0:00:18on the watch a shared
0:00:20task so we have been collecting a lot of the dialogs chat-oriented dialogues collectively using
0:00:28many chat bots
0:00:29and i'd like to use
0:00:32share with the you some of the results we've got
0:00:36so the objectives of the shared task
0:00:39is to collect a chat-oriented dialogue data that can be made available for research purposes
0:00:45so it includes human chop bought data and also human dialogue sessions and the covering
0:00:51a wide variety of chapel technologist approaches
0:00:54and also languages and cultural backgrounds i'm offering some japanese chat data and there's some
0:00:59other people offering chinese and those sewing these dialogues
0:01:03and the another objective is to develop
0:01:06a framework
0:01:08for the automatic evaluation of a chat-oriented a system so we perform subjective evaluation know
0:01:15that chat data sessions at turn level
0:01:17and we also crowdsource multiple annotations for the s a martin's because it is very
0:01:22subjective
0:01:24and we also applying
0:01:25machine learning approaches to reproducing human annotations that is human
0:01:30subjective evaluations
0:01:31and there
0:01:33three
0:01:34activities related to share task so first one is chat data collection so we could
0:01:40have to human chart buttons human chat-oriented data sessions
0:01:43and the second one is subjective evaluation so manual scoring annotation at each turn level
0:01:50of the collected data sessions
0:01:52and the third activity is targeted metrics so we use machine learning techniques to generative
0:01:58models that a
0:02:00able to automatically generate a scoring given by human
0:02:04annotators
0:02:05and our main enforce kinds the still focusing on tasks one and two
0:02:12and the last full roles
0:02:14in our activity
0:02:16so you can participate as one of these roles all more than one roles so
0:02:22you can be a chat bots provider so the participant all the chapel changing the
0:02:27wants to provide access to it
0:02:28either by distributing a standalone version or by a web access
0:02:33and second role would be data generate a the participants those who is willing to
0:02:39use one of these chat bots and to generate a dialogues
0:02:43and the third the were always data provider so in you know in the industry
0:02:48it is difficult to
0:02:50sort of make access the everything so
0:02:53the participant owns the
0:02:55or hasn't access to chat but they cannot provide access to it
0:02:59but
0:03:00she or he can generate data was within that come within the company or between
0:03:04that the institution and provide the generated
0:03:07dialogue
0:03:08dialogues
0:03:09and the last the role is a data and of data so the participant is
0:03:12willing to annotate
0:03:14some of the generated and provided that fixations so there are four rows
0:03:18and
0:03:19in
0:03:19so we are recruiting people
0:03:22but the spitting one of those
0:03:24these roles
0:03:26and we l kinds of you have a the use chat bots ready to be
0:03:30used by anyone so there are six
0:03:34chat bots joke a
0:03:35iris
0:03:36by you guys the sara stick talk and so i mean
0:03:41and we have been doing some annotation
0:03:44so we are using several annotation schemes
0:03:47so this is the first one and the features used quite often by the community
0:03:51if we choose the appropriateness score so we have about eight acceptable and embodied there
0:03:58is this is a three-way annotation scheme
0:04:01and also another annotation scheme we are using is that it breakdown labels scheme features
0:04:07more focusing on the violation
0:04:09by the talked about
0:04:10and it is a but breakdown a possible breakdown or not a breakdown and in
0:04:15this annotation scheme we are using quite a lot of a human
0:04:19humans down states this data beta for example we are using about the twenty four
0:04:24to thirty people to annotate a single utterance so that we can know that
0:04:30somebody's i we cannot the distribution
0:04:33although these labels and of because this is i i'll affecting the subject subjective nature
0:04:38of
0:04:39chat-oriented dialogue systems
0:04:41and we also adding additional types
0:04:45as the how annotation like positive and negative
0:04:48or offencive tags
0:04:50two utterances
0:04:51those thus we are language or
0:04:53is machine
0:04:55annotations to this data
0:04:58two briefly give you the
0:05:01size
0:05:03so we have clicked in
0:05:05but the still not very much succeeding in collecting a large number of that data
0:05:10we are having about the
0:05:12six hundred data so far with a
0:05:15two twenty thousand a turns and the
0:05:19and then we have over ten thousand annotations of well
0:05:24but still we apply making a progress
0:05:28and i just show you some results from
0:05:32our annotations so this is the us a proper in a score distributions
0:05:36that humans are doing good but the day there are some invalid utterances from humans
0:05:41as well and i we are not this closing
0:05:45what the boards are but
0:05:47bots have several different sees and some good and some about than analysing them would
0:05:52be a very interesting
0:05:53thing to do
0:05:56and we always so i think data and that's no dialogue sessions are being collect
0:06:02data using i reason take talk
0:06:03and also using this data we are performing a
0:06:07organizing a data rate and
0:06:08detection times at that a state tracking times six
0:06:12and also we also annotating other additional dialogue sessions
0:06:17to the data virus picked up and joker
0:06:21and they would be appropriateness score prediction task at the next what set workshop
0:06:27so the next
0:06:29steps
0:06:30so we want to continue promoting the shared task activities because we haven't got enough
0:06:34data and we want to improve the current chart but it can system and we
0:06:39want to hold the
0:06:41the next workshop editions hand other events so
0:06:45the next
0:06:45what should to be
0:06:47at next i w is the s
0:06:51and we have an and some it the proposed out
0:06:53so that we can have many dialogues and annotations during the summer okay
0:07:00so that's about
0:07:01the
0:07:04task update and if you have any question please ask
0:07:06now and if not the i we can go to the next discussion