good morning but often don't what if they make some friends

so they only overseas candidate and of all this advantage and he what shall

if i pass the mic to zero that's the civil use of course this thing

every month a like to make a couple analysis

regarding the logistic

thus

the what the right form of you have that the both the and all the

when well

this the channel will be remains one sre possible

second

the recordings of all live sessions

will be available on the button that phones in a couple of days

finally

it then they that it's a little advisable for all parties then

next week from what

this check it out

no cost of my tools you know this essay the data that it posses

okay and can you can e

yes okay

and been wanting in be not really but it

the oldest code is you know that rumpled luigi capable concluded that

i among the college of these speak at what he can really box

no i in the peace not

possible at night is from each color used incomplete you dysfunctional

and a big challenge to all what is it constantly

you also the two keynote speaker

you're one i don't of the expectation

and three on the on the original in smoking

and when i four point nine at uni marks

in of course

and he's

or the present

and all the on fifty

you all but you have three into like walks

well i'm using a program you know speech by professor i don't usually

i think you know

he on the counter abusive speaker recognition

i agree you look at the beginning recently you can't it equals okay in

at all

do you recall doesn't

we used in a preparation forty function of modern for you have

i do not logistic being that some problem

problem will be not all along so you know

non-target are used

a difficult you really do you

and you can be uncomfortable

and then you can be just you for only

and you have been doing

in a technique at most of the vocabulary

and unfortunately corners change in

in you of on the box

could be sold

people hoping to all

all email

then we give up and change you could be one nine bucks

i should have found that the local u w

i want to institute of technology

but i can't

okay

it should have been pretty for you

but you can provide you know

could you

nothing in japan

i don't know non of what was

each meeting you each of about clark

and the could not used a screen

and then we stick together unique look

i don't the kind

the and just like you don't caleb recording

and they are open it

use and you used in the original from time

i data you have many are not used in

buddies

maybe you have already a number but in

the last presentation time doing this past three

and that we have applied your presentations i

we you don't

i have not and it

apart

what is the voxel several and i mean will be impressed

i the warm and cosy on the mel-scale community

okay you to use the screen and the united together to make a low

and b

in this corner right

strings knowable

if you want to be able to

one example for the only big next year

here we used a buyer institute

you already there

i'm two hundred and one tuning

and pretty enjoyed recruiting

and you

i guess a lecture of ssm about

no like to ask them like to really gmm fishy

so it's all about the of what

the interface

it can be also can see my site

only ten year

see my site

i guess

so this is seen in ages from and i just tons and on behalf

speaker policy look at local when i think comedy

i want to mentions a few of was

i don't you have already enjoying

it's mixture and then i wonder why zero this paper so

and we have also select it to

this paper award

the moment sitting around the means of this paper would

another nine so just got fees the student a powered

so maybe

explain fast and how we chose those i was

so this fast reset acoustic source state best based on the view at school a

also recommendations from a bus

then six point it was they faced papers presentations

and then provide a school for each candidate

so as to be seen be computed it of at school

selection bananas in "'cause" a resurgent hands and unsummed

and prophesied sienna

fantastic for my that pronunciation and meeting

jean francois clustering

question of thus and the myself

i did ms four

so i valuable time so on

then

this is ugly stops a candidate for bass eight a lot

then it all expect that now we need to estimation it back test set for

into a speaker recognitions it and by

then you "'cause" the middle

big so and a battery

paso a use t matrix sign based on a speaker verification but can't

it invites on the

the unseen by shell late again

gene done ten

so you don't mind that initiation

and that in da and d you know vad in what is for speaker verification

is

it and by she s analogy

past and christian us should i'm gonna f c

and then the bad from phonetic i of at inc what representations and so that

is the and

ninety combinations and shows you named julie and so that use out of the and

thirty cutting each of

keyhole sure so i can i couldn't pronounce quality

and the last one is using monte solution fusion ups feast competition you know in

advance and three s p it and i change and one i mean g

and for me

question

and

and it's time to announce to this fun

chosen by something else

so this paper i lost all do nineteen it all expected and i mean age

estimation it but classical set fourteen "'cause" theocratic mentions it can bite then you "'cause"

male quick same and i do not be

from john hopkins university united states make a

conversations and

i four

so all assess a whole stack was that is

if so and nine want to have a few us from then it is possible

only

"'cause" you usually

yes you great how this is a daniel garcia rimmer

thanks a very much to well the comedian for select in our work i don't

with great grand island are the ugliness

but actually a based on just of there is application to jar

also one or even though this one this is actually

has had a using phones in my career and a testing data

the reason one and here it has a look too quickly so everyone are really

good is these are working

so we want to me

and thanks for the was so i had on a on and are still interacting

with other people in this like

i have to a unique opportunity really "'cause" i cannot given that we can all

the common to use you

so it's been actually

so you very much

eight and two minutes or seconds

and that we also and so maybe have his

so we decided to make that

a thank you yes this is that i i'm anyone to second one then used

to having that's great session project yesterday's

okay process is a greater than the

generated so thank you are very much

thank you much and then and i want to pass a microphones to me

and then joel to announce that this dude and i would i mean and that

you

thank you very much

i'll start by us saying if you variance about

the ward off

so yesterday i was really happy to see so many people

at and the jack godfrey and that are of evaluation benchmarking session

the tell me canyon and i curvature

with a panel of six

distinguished colleagues and ryan godfrey representing the

jack godfrey family and unashamed a hack announcing the establishment of this

ward

this

has a significant meaning to all of us

checks passions in life

included science and understanding

not only just trying to achieve the old men performance

but understanding how systems work why they work and the scientific basis for making them

perform better

so that's is passion and of course where is that all begin with student

and jack loved working with students interacting with students

i you know i had the pleasure of well working with jack for a decade

working for him for a decade in working with them for two and a half

decades

and every interaction i had with jack that involve students where there was always something

remarkable

usually we all walked away learning something new in different it might sometimes be about

cultural language

and in the students cases often lead system inspiration which we heard from teachers the

panelists yesterday and just now from a daniel garcia romano and the team at the

johns hopkins h l t c or you where john jack godfrey we used to

work and

with great passion

so and fortunately the recording of the session is available and encased anybody miss that

i encourage you to watch it just see the context for this award and now

let me handed over to tell me canyon

okay so are plans for is a group or more or two

to do this system and a little used in every room is present a real

room from

conversely some of the convolutional will remember from source to

so actually the performance will remember performance was miserable so it doesn't really underscores emotional

armour from everyone used as a front on a proposal on of on one produced

from there is serious i'm not so the military imprisonment this is very differently

personally on a remote we also a good as cousins one thousand two will be

the this is not a scientific principle this remote some of them with remotes

could be able to probably on a list of learning more from there is one

very from here on a remote

so it's really one or several that's over there is a process a three presents

the number of everyone are four hundred and one i'm also a not particularly good

remote simmons so i'm trying to because i think you two remote controls

so we are to go members your be or from using different paper on the

one room b alluded paper

then we are rows one on a similar recall removal and round robin the most

popular "'cause" we're going on one remote but underlings possibly remember system

a and b o one is meaningless from the was

one of these are drawn to the three gram model and language native speaker notices

and using speaker role improvements on

okay sorry we're and no one of the remainder of propose a good proposed remote

this the well known as remember me probably more generally almost improvement is on a

reporter no never remembered hundred and possibly the roses and

so we can probably on if the authors on recently or maybe they can serve

reversals

you

yes

okay

thank you and lee

i nice to operate yes

we are going to be the mean like i think or you another one because

organisers to their nonweighted maker so they don't similar like or not

okay so that it so that

and that simply

and

i very you read on

and congratulations on

i was you or something and joy

and

and then to everybody and thank you also to jack three

yes thank you

and then this is that and of the old fast in many

can you can move onto the next time was decisions

sat they still only

thank you

right

okay the dataset one is you would like to design a certificate of recognition to

a loaded or speakers

so i with this and this one not in any of that

so clear that this kinda stuff t and they ist japan

to those speakers of the topics

the what's their opinion on the shin speech interface the that is and t and

this and was taking

the value of terrible with the results of that

judges filters

and the topics neural speech recognition

miss the set at time either

where is that japan to toss features on the topic was speech recognition

of the civil

which is the most informative japan to their speakers and the topics

we will statistical parametric speech synthesis

the that massively journal of this for your conference to their speakers on the topics

and these fools the in automatic speaker recognition

last i don't is

note that you're and within well known as the of technology to their speakers on

the topics

as speaker recognition why when and how to do

okay now once again by and idea

to give us a summary

for the obviously didn't the option

like this

i everybody conventional slice

okay so capable over some of the highlights all these work your

so overlappers okay and organisers or the one amazing you know given the sequence of

all the whole weight and you got have to do that whilst at the beginning

named and you have proposed a new data to get a chance completely the impression

and make it remotely so really what one and about my heart and everybody here

we did you have been successful

with emotion

so this is this motion vectors these tutorials that was not are only in the

final step earlier estimate role give us some really nice learning and speech and bindings

have given by flavour of that

let's i believe that really less than one and of course of calibration you know

and everybody to death so that's what i don't remember that

we also have the menu or something was privacy issues of training

we should but this what we see mentioned it so i'm sorry of the five

are also one but i would probably do what don and women

you know

consists in a remote just fine you are not at all

so far as we have about eight to seven based within decision obviously duration itself

is the main over and plastic a language based on an easy life are able

we have corpus regularization

most moving in other measures in we have a special session was a lot lately

voice conversion set is this

you know there are nice this and evaluation benchmark in you know only a nist

is used in video audio and video so work about it you know there are

also one about speech applications you know the overall you know isr annotations

and also have a nice class yesterday memorial of jack three on the one friend

and the more in this

so maybe you can imagine working

so first of all you know for this little sister this is a very quickly

so the precision at all possible future the best are war by danielle and an

n-gram so congratulation

so this paper is about how to learn you know mismatch duration by nor and

its car i don't like at the end of it was there you know

inside and it was actually complementary colours also results are so we go back to

what one and one video

we also have proposed a extractor on a which you know

which

one of the best free best and or more so on their solution and you

know that this paper because and are already some uncertainty in your statistically conditions not

like just one year some as in the around how much you're gmm estimation

this for source on that are not a clustering of them or something and variance

you know we also have some supervised training well model the baffle you know and

they show that the future that extracts from this networks or you know about integer

linear programming and of this is more speaker and then used a list you see

what else that's is you know whether a speaker recognition and there's over the of

you know

that's thing different embedding is willing did not improve and robustness of duration you know

that's important so even for the analysis you know can that and you know people

are really trying to so it is the prior to the problem of duration mismatch

and in the bn nn framework

class was the least we now that is able to for the nn so is

still there so far were explored in this

this is one of the feature for this new error vad learning can then be

seen speaker recognition language

score fusion and success housing in extraposed a cinematic space not in the feature really

i don't cts mismatch between training and about how you that

i don't know in addition to that can sell you know the you know classifier

itself does also the topic

would have that it diarization you know the suspicion diarization and one of the domain

mismatch a nice feature going back to channel

and can see that is still problem is how to and whitening the there is

this is how far

and so even when you use the state-of-the-art speaker embedding you know and a speaker

model and you learning is the problem is you don't want it on a mishmash

so

so be released to look into that improving clustering you know you know and we

have little or paper that you know that are sensitive to parameters in validation by

one or whatever was in dover algorithm which perform effective wadding across various additional business

i think was andreas is d

are in this is so

that was to i was information to this basic iteration and working with limited vision

be were there are more

by the you know minimum cost someone their own name

so you know there was information alarm area under different wainwright unit somewhere it happens

people say you know it's to me and mine english you know so that's a

special one so i think is permissible then with a single curve routine that there

is you can also how to separate considering the same tree

that was a good thing is the always this is kind of you know information

a compromise or you know that it is the first one there is a lot

of you know and as a state-of-the-art on nist through to that selecting challenge you

know frequency masking locking also "'cause" i lost motivation for this task also seem to

be helpful

refers to learn on them automatic speech recognition of the matter whether and how diverse

on automatic speaker recognition using feedback control was conversation and finland water

so there are very the ml based you know that you know i'm here is

giving rise to go and one of the meeting you missed and that's the beauty

imponderable we're gonna was done by and you know used and then i and watch

speakers of than going to move you know

so a very nice thought a system that the voice twenty two which was especially

association there was a feature to do speaker recognition in far field you know sixteen

and to focus lost

you can everyone

but you know there

i was literally microphone channel you know i'm so that was also a this is

one of the speaker really in the future so we have to learn how to

do it is as we find that are involved in unison everything but if you

don't of the progress there

we made use progress and now this is a new area where we need one

and how humans or a is the thing

voice processes that is you know and within a very of papers about using spectral

what the variance along rather well against an all us i guess for singing voice

conversion a nice interesting to see he's on your own voice test you know

and i will assume how can sing beautifully you know

so please "'cause" this paper is

yes so this is a

evaluation benchmarking its colour

features the nist speaker recognition evaluation and here s the you know it shows that

the nist two thousand and eighteen a lie detector machine was a very good success

with other features you audio and visual a visual and visualise at or images

and we well as to combine two and here is that we show under different

paper presented their you know combine still that how long as those though and also

all have already spoofing you can measure you know make the design is just and

right column

that was really good session and but personally i was really one global people don't

you know it's features along case you know you know what is mainly

then post it is given versus those different model more robust models are explore you

know and you know basic task was to have a lot of the goal in

mismatched condition like language you know like that so this is due to these features

only shift for a year

another you know statistical thing on the matter you know and here is to think

i like you know and careful you know carefully if you know

if you have the notation so we have to be careful how the segmentation because

one can have or physical access to not have access

so to have one a carefully designed for a human a result and carefully designed

to assess to require in order to buy something recognition so that those design are

actually one that's longer than there is the one going

so you nist speaker recognition you know calibration in the in the back end issues

with an additional across various condition where reducing or side-information and then there's gender dependent

condition integration stage where side information is learned jointly what the rest of the model

all z

an operating base where into that you are we expect or a speaker locations us

to do re there was just considerably also interesting

you see maybe too optimistic irrigation work well over multiple application scenarios you know and

over different or operating point it is optimized parameter of a square mile manhattan distance

metric to maximize the parallel versus area under the raw

quicker or interesting rest of a false positive rate does not ignore the biggest or

weighted sort of this paper from all

okay

and use it for a set of training from of to learn speaker and i

think that could disconnect speaker but it's invariant to environment you know something really you

know how diverse a training is coming over complete having lots of this new image

processing

well one

in this a new problem we should maybe more

and if you is just as in the audience doesn't last long as are you

can be efficiently get should work on that

so other you know so this is specification on no and over a lot of

combined model a baseline forensic speaker comparison assessing shot communications speaker detection the while you're

all alone

combining speaker or and that in and one of the information no this also nist

all home and no not try to combine you know in the same embedding it

is sometimes useful

you know that this is another speaker recognition

no decision and we have personal see that some interesting that's gonna period well i'm

not wrong from the chime challenge you know are defined as this is you know

about the speaker maybe a four pairs and how many conditions of the speaker and

speaker

and so we have speech data with expression and how i-vectors speaker recognition not telephone

number one or back while they're number one and in the expansion

by a sufficient has to

and still the lid system in a single there on the nn framework using a

question that will also good as you know the far-field task maybe shares

analysis is of the for unit is just a that's also missing them in the

second one is

that's so i would like to minimize to this as a null and j and

have you of all this together and you know

so i didn't ask whistle

thirty s and enjoy the

well this animal of this you all animals used in the next one easy for

me

i say that it for the precise and to the for a summary which compress

but i comparison rate was able to the ten minutes

thank you

okay so now i think well as it were quite nicely

of this a ninety then doing so now we have on the to have

what is the one function wireless e

we will bring us to use or what

the see what will happen as in of this a and b and it'll

so of this along with

it is too much

one and two well known you to harness t and we were or whether a

onto other to present something about but unlike the only thing to

and you don't into and it would be having made in china

and this will be job of annihilation fitting problem to one university shot anyway you

using an annuity a simple heuristic

and this is i of our strongly indian problem for what was this

and so this is a slight problem was that are correctly

a remote i looking at different screen

no

that this sharing because i v

the screen for

we have all universities can you see

no we have is that the first six

is doing the first page

i think i think we mentioned a different screen

the presentation more and

so how the writer

yes now we have a state k

okay i don't know used for screenwriter

so this is that okay

okay cool

so this is of course this like target and this is the over a very

indian

and here sure will have a union paging the

time will be at

was so small

is the u

is it but we haven't drawing a went you clustering to and people is you

mean round tuesday into writing

and the two in this study that no in between but you okay

and in most of time as jenny the it will where will disappear at a

higher so

paging the utterance of reasoning where ut are that a lot of well writings that

you can really try to a or a low wall that we know the problem

we probably will range so that a volunteer is to have transportation

and well basically no to that but probably not decided yet o one is just

a it says that you one investing and study will be willing to the us

quite spacious or a percentage but it is that what would

and another one is

it's a all morning is called my it is there a but besides discrete well

but it sort of you don't well problem of the c but with you can

read a asked to be there but we will use that with where go

and proposed seconds one of the hair cells of trying to

try to make some other social activities and this is really a this is quite

that a patient style of china the cost the we are there are is quite

and also

this is the two of us the will be included about something interesting places are

but you see here

it's o a work okay ageing of this you to twenty two that much

thank you thank you for so long data for what to us

for now

so

of all this that and

this

this feedback allow writing papers

and okay we're research thus by davis

okay i'll

okay

okay so no i would like to

that the my two so my only

okay the we the

the sound a physician those tools on sign

the noise okay a wider this is only around this is because there was no

problem

and a local almost university

like to thank you for what was accomplished

it's also

can be casting committees and a score and looking

a secular match and all

okay i have been working a file together with

scoring can work with a single that conditional that

my main task one memorable asks for u s a e

i s and the

it's policy so let me talk about these

so hostile a me mission our sponsors

that's a really is my face that we can we could have a these scores

into one is a very

lc highness final set condition

small a very we have we could the whole far was almost corresponds to a

free the free registration and the

and some possible a very special fee all also as can be funny to a

saul's

because of a classic sponsors an easy one c i think

google can extend okay sequences

so we would like to express some a signal x two source also

somewhere ones

and then we also show the say thanks to our local team members

okay we have to have a big change is due to cope with nineteen

our movements this local team members

well we basically and

a no matter what is a or assignment and the have also be how the

crystal

so mm a rematch okay everyone a few minutes

and okay thank you very much to all the participants

and the

an ask possible command was the a squad steering committees

a whole we can see okay home base to us to use a

thank you

okay

how about you by all the local can be given about the penalties and

so if there was a good idea and i'm not sure how who k u

then we began phone i

okay this will use the reset button

i think about this to the logan

okay this is

i

a somewhat so i can say that it's a

okay as it got this

corpus on all so

some people have left

when there you have all this

i

where n is

so costs on and basically so the

this six phone videos if you

hello services and

no

also presents

in a

well

right

for

i think the l off

cool

usefully

it's l

but i guess online for the

well can i think you

now we will starting from we has at all

the at enough that we get a b

and

with that of the solution this is physical conference i would say that is then

matthew nice

well it's transformed into average of all

so

to have a

all of you

it'll

still with us

before you

it was i in the ones

well bands

and allow things

so there is a much

can some useful once

okay a model

a daily that of things the about this that and all those as well

and this leads us

the doors because there is because

says that

voice

wife and all those we have a bus the may have anything

possible

it just so much

i'll a seal

okay the reference the open

but i was i was able to yesterday the

painting

in obviously the detected

i guess unless

okay

what we

so i mean and thirty minutes maybe i should say that before you say

i think i this is phase okay

we

he's a half period was given this could be made

i receive a lot emails and everything i is a is this to this is

all value so that a

well it's nice it's nice to

the right you know

it newspaper selection

so psyched of image thinking

s

exactly z

okay bye

the i

is this is done using a than any other ideas

and my family of the search string

they still

a u s

thank you and

thanks for the wonderful work i couldn't ask for a better team to work with

and

all these you know last minute changes you will handle them with such extreme politeness

in great i'm so grateful thank you

thank you

yes the right number is and us

so it is

so you fool

cool

okay sufficiently close

yes that's the thing

okay different amendments eight hundred and of is that

okay