|
|||
|
|
|
|
|
|||
|
00:00 |
(Beginning of video)
|
|
|
|||
|
00:01 |
Everyone I'm going to talk about a sixth and final challenge.
|
|
|
|||
|
00:08 |
To look at many of your technical challenges.
|
|
|
|||
|
00:11 |
In detail.
|
|
|
|||
|
00:12 |
Haitian this is all these previous challenges.
|
|
|
|||
|
00:18 |
At 8 to perform a Equestria to go study to better understand.
|
|
|
|||
|
00:26 |
This challenge should be done in context of all the challenges.
|
|
|
|||
|
00:36 |
And this case the boys want to fight all of them.
|
|
|
|||
|
00:47 |
Call from introduction.
|
|
|
|||
|
00:51 |
What is the structure between modalities representation State the first information content apologies and relevant to Dad's house.
|
|
|
|||
|
01:01 |
I'll give you a few gain a deeper understanding of each semester hydrogenated in the context of a.
|
|
|
|||
|
01:08 |
So this is Brianna study topic.
|
|
|
|||
|
01:14 |
How to say hello in Canadian multimodal.
|
|
|
|||
|
01:18 |
So one big part is that of modality biases.
|
|
|
|||
|
01:21 |
Training with multimodal data sometimes which caused us to dominate dominate.
|
|
|
|||
|
01:32 |
This related to the fact that different modalities have different information.
|
|
|
|||
|
01:37 |
And different relevance to the top.
|
|
|
|||
|
01:40 |
The one that example of this is a typical questions from from the internet.
|
|
|
|||
|
01:48 |
Michael Vasquez just like what color is it.
|
|
|
|||
|
01:54 |
Because of by assessment 80%.
|
|
|
|||
|
01:57 |
Banana images of yellow you're so going to see that 80% of the yellow 80% of these questions are going to be answered..
|
|
|
|||
|
02:06 |
Let me see a real currency has a green banana going to start answering.
|
|
|
|||
|
02:12 |
Biases.
|
|
|
|||
|
02:16 |
That's quite a few approaches of approaching of a cackling this discussion is by no means.
|
|
|
|||
|
02:22 |
One simple approach is pretty simple but very difficult to balance.
|
|
|
|||
|
02:29 |
If you see is umbrella upside down and most of the times it is yes you can also put more examples.
|
|
|
|||
|
02:35 |
Umbrellas not Beyonce..
|
|
|
|||
|
02:38 |
Let's book balancing Adidas and the contribution of the other possible answers.
|
|
|
|||
|
02:51 |
Lowercase is to balance training.
|
|
|
|||
|
02:54 |
So many ways of balancing a training objectives that you're not relying Cynthia.
|
|
|
|||
|
03:07 |
Some of that the colors were bananas wrong sometimes.
|
|
|
|||
|
03:15 |
But sometimes this can also cause more dire consequences if a cat is going to be gender bias.
|
|
|
|||
|
03:23 |
Modality biases.
|
|
|
|||
|
03:25 |
One example is that you trying to do image caption.
|
|
|
|||
|
03:29 |
You see the song this this woman along with the computer.
|
|
|
|||
|
03:33 |
Some of the image caption.
|
|
|
|||
|
03:42 |
Attaching gender bias.
|
|
|
|||
|
03:44 |
Between image image.
|
|
|
|||
|
03:49 |
Are you at like I'm almost I should be right for the right reasons right when he's trading a woman sitting from a computer.
|
|
|
|||
|
03:58 |
As many of these biases that I found them but still is not comprehensive.
|
|
|
|||
|
04:08 |
Biases that come from you tomorrow.
|
|
|
|||
|
04:15 |
So this is basically.
|
|
|
|||
|
04:17 |
If you start looking at this image and text tasks can you look at what the person is carry.
|
|
|
|||
|
04:23 |
It looks like the image and verify that it's a woman starts a sign of higher probability to carrying a purse instead of a preface.
|
|
|
|||
|
04:31 |
I want to park it is founded.
|
|
|
|||
|
04:33 |
Either method is biases just a language the person is caring.
|
|
|
|||
|
04:38 |
We can also measure these biases in Aquatica..
|
|
|
|||
|
04:44 |
And your father have any image place the worse than the social biases to visual information actually make them more confident in reinforcing this trend Estero..
|
|
|
|||
|
04:55 |
Visa by assistant result.
|
|
|
|||
|
04:57 |
Cross-modal in fractions.
|
|
|
|||
|
05:00 |
Set modality biases biases that result as having a better understanding of each dimension of hydrogenation.
|
|
|
|||
|
05:12 |
Another thing that mention of heterogeneity is a fact that this these can I set different.
|
|
|
|||
|
05:20 |
So then how can you better measure whether our models are robust to these North apologies in are different.
|
|
|
|||
|
05:28 |
The one contribution.
|
|
|
|||
|
05:31 |
What's the first identify different ways that you can be.
|
|
|
|||
|
05:36 |
So far he's a greedy looking at the modality specific than domain specific ways in which noise.
|
|
|
|||
|
05:49 |
Contiguous.
|
|
|
|||
|
05:56 |
Another type of perfection can okra.
|
|
|
|||
|
06:05 |
Or if you have coordinated modalities across the time dimension.
|
|
|
|||
|
06:08 |
Having all modalities missing across that chunk of.
|
|
|
|||
|
06:13 |
So this paper isn't she contributed all of these tests for make the testing where the modalities are robots in the presence of his mentality specific or multimodal in perfection.
|
|
|
|||
|
06:24 |
Key takeaways that no it shouldn't be surprising there's a very strong trade off.
|
|
|
|||
|
06:36 |
The presence of these imperfections.
|
|
|
|||
|
06:40 |
Look up security at this in some categories of approaches of making sure that your morals are more of us.
|
|
|
|||
|
06:46 |
1 General way to introduce.
|
|
|
|||
|
06:49 |
These imperfections during training so no me I was going to train with my modalities xnxd they're perfect.
|
|
|
|||
|
06:58 |
What if I also dropped individually and make sure tomorrow is still robot in a prediction.
|
|
|
|||
|
07:10 |
List of all the characters of robust data and training in the more exposure these models have please imperfections during training tomorrow.
|
|
|
|||
|
07:21 |
This was taken off of methazine to information without it.
|
|
|
|||
|
07:25 |
Play learning a proper translation perfect for running a joint problems ignore the problem.
|
|
|
|||
|
07:37 |
And again I'm not going to hear that you can look at.
|
|
|
|||
|
07:42 |
Four cases in both of us..
|
|
|
|||
|
07:49 |
So that's just a glimpse of different types of heterogeneous again.
|
|
|
|||
|
08:04 |
Another big step challenge is to get a better understanding of the Cross Motors.
|
|
|
|||
|
08:10 |
August customer interactions the capsule connection.
|
|
|
|||
|
08:17 |
Is referring to the same concept dependencies referring to the concept.
|
|
|
|||
|
08:24 |
Customer interactions look.
|
|
|
|||
|
08:26 |
How are the infractions the result from the connections between these elements.
|
|
|
|||
|
08:32 |
Has many many definition of cause more interactions.
|
|
|
|||
|
08:35 |
Again how can we take this multimodal dataset.
|
|
|
|||
|
08:39 |
Quantify the presence and type of court order interaction.
|
|
|
|||
|
08:44 |
The arcs that are present.
|
|
|
|||
|
08:48 |
Challenge this some of some clips into possible.
|
|
|
|||
|
08:54 |
One big area of work is to put upon the idea.
|
|
|
|||
|
09:01 |
Are they giving cannot be decomposed into some of these you tomorrow functions acting all these individual features and there are some problems.
|
|
|
|||
|
09:11 |
And the first the first option of the representation before eating a glimpse of how this can help how we can take an arbitrary function f perhaps learn through this supposed to meet with you tomorrow stop functions f and f d and looking at the residual and the residual Mew face tape measures the amount of cross-modal interactions that exist in this morning.
|
|
|
|||
|
09:35 |
So is additive model is equal to the nonlinear diffusion model.
|
|
|
|||
|
09:41 |
Not just not edited interaction and not modeled otherwise meal captures the overall quality of the cosmo interactions.
|
|
|
|||
|
09:49 |
August 20th 3 tomorrow.
|
|
|
|||
|
09:52 |
This is good this is a good sanity check for talking about this gives you a sense of the overall quality of the cosmo interactions.
|
|
|
|||
|
10:10 |
What is going to go with you.
|
|
|
|||
|
10:12 |
He said I'm just overall presents can we go to look at the individual crossmotor interactions.
|
|
|
|||
|
10:19 |
The one inside that we have is that if.
|
|
|
|||
|
10:22 |
Directions to Skechers.
|
|
|
|||
|
10:25 |
I'll be on the additive interactions then the second derivative of a respect to X your first feature.
|
|
|
|||
|
10:36 |
Not additive interaction.
|
|
|
|||
|
10:38 |
So give me the definition.
|
|
|
|||
|
10:40 |
Inspires a natural second order extension of your great-aunt approaches.
|
|
|
|||
|
10:46 |
If I have a talk with such as clever as you a sticker on it.
|
|
|
|||
|
10:52 |
Can I take you out of my motor.
|
|
|
|||
|
10:54 |
Take a first derivative with respect to some parts of.
|
|
|
|||
|
10:58 |
I've got off the question.
|
|
|
|||
|
11:00 |
And then take a second derivative of the second puzzle game.
|
|
|
|||
|
11:03 |
So that should then tell me the parts of the image that correspond with those parts.
|
|
|
|||
|
11:10 |
And this method is actually pretty if you can take the parts of the clever tomorrow does attend to the tiny yellow shiny object.
|
|
|
|||
|
11:20 |
Ask the correspondence for the words tiny yellow triangle.
|
|
|
|||
|
11:25 |
I like what you can do that for me.
|
|
|
|||
|
11:27 |
Add another 10 to the word birds as a second-order interaction often and likewise for three small.
|
|
|
|||
|
11:35 |
Lisa ways of identifying these are corresponding cross-modal interactions very specific level.
|
|
|
|||
|
11:48 |
One interesting thing is that you can also do this for all that don't have two corresponding rather complementary interactions.
|
|
|
|||
|
12:01 |
You can see that you take the first step to the side and the worst.
|
|
|
|||
|
12:06 |
This can also capture the second-order interaction with the movement of the speaker when saying those things.
|
|
|
|||
|
12:15 |
They can also help us identify certain complementary cross-modal directions.
|
|
|
|||
|
12:25 |
Is there other ways about capturing all these cost more interactions be on this one so this paper called.
|
|
|
|||
|
12:35 |
She said she taking them all.
|
|
|
|||
|
12:37 |
I'm taking a message to Joshua tomorrow in Port Arthur.
|
|
|
|||
|
12:46 |
List any time using any.
|
|
|
|||
|
12:48 |
Una Moto interpretation Mass Effect.
|
|
|
|||
|
12:53 |
I think a sergeant Gonzalez is on these cross-border interactions.
|
|
|
|||
|
12:56 |
Is there a site was that.
|
|
|
|||
|
12:58 |
Based on these influences attraction is influence of one is much larger than the other.
|
|
|
|||
|
13:06 |
That's it..
|
|
|
|||
|
13:09 |
If the magnitude of both our office similar and there was a point in the same direction as a complimentary pack.
|
|
|
|||
|
13:16 |
Anthem attitude is similar for the opposite direction to the concert.
|
|
|
|||
|
13:22 |
That's one way I wasn't hurting these calls more interactions proposed by this approach.
|
|
|
|||
|
13:28 |
And why do you think things are found.
|
|
|
|||
|
13:31 |
Facts about the language.
|
|
|
|||
|
13:44 |
Buddy Chester.
|
|
|
|||
|
13:49 |
I just have a very cool visualization website.
|
|
|
|||
|
13:52 |
It is publicly available allows you to look at videos and interactions exist in these videos.
|
|
|
|||
|
14:06 |
So what is your services are mostly modality and model specific specific.
|
|
|
|||
|
14:15 |
See you at the mall tomorrow Transformer this recent paper also proposes an interpretation multimodal Transformers Vision language.
|
|
|
|||
|
14:27 |
She said she can take these off attention across your contact data and that can be broken down.
|
|
|
|||
|
14:37 |
Play Stacy's house to you tomorrow image important.
|
|
|
|||
|
14:40 |
Your language language attention without you or importance in saw your text.
|
|
|
|||
|
14:46 |
Legacy that is indeed just a little.
|
|
|
|||
|
14:51 |
Text importance of Surfer and riding a wave highlighter in text.
|
|
|
|||
|
14:57 |
Because I go deeper and look at the other two options language version attention.
|
|
|
|||
|
15:08 |
You are so this highlights the correspondence and complementary interactions across Imaginext.
|
|
|
|||
|
15:18 |
So again this is cool interactive website.
|
|
|
|||
|
15:21 |
You can check out you want to see around with these models and interpretation methods.
|
|
|
|||
|
15:31 |
So one very big challenge sure these are some of glimpse of the approaches interpreting these car models.
|
|
|
|||
|
15:45 |
Typically typically don't have a ground screw.
|
|
|
|||
|
15:48 |
I'll be interpretation.
|
|
|
|||
|
15:50 |
Simple ideas what ideas tomorrow simulation.
|
|
|
|||
|
15:58 |
And then give it to her.
|
|
|
|||
|
16:00 |
Thank you niece evidences can a human then judge for the model predict.
|
|
|
|||
|
16:04 |
And this would be in line with what the model actually break.
|
|
|
|||
|
16:08 |
Someone handed the model was fully Black Box they didn't do any interpretation.
|
|
|
|||
|
16:14 |
I recover a simile where the model predicts.
|
|
|
|||
|
16:17 |
But if your interpretation approach is completely faithful tomorrow and you should be able to recover.
|
|
|
|||
|
16:30 |
Another idea was to be more practical and useful ways how to modify.
|
|
|
|||
|
16:34 |
Again can you take a back off tomorrow.
|
|
|
|||
|
16:36 |
Run some interpretations obtain some evidence that one tomorrow is using to learn.
|
|
|
|||
|
16:41 |
Give that to a human.
|
|
|
|||
|
16:52 |
This continues to fix bugs and eventually of measures of performance.
|
|
|
|||
|
16:59 |
But again this minneopa challenges here tomorrow actually face.
|
|
|
|||
|
17:04 |
It is a exclamation Ashley Faith with a modern receiver.
|
|
|
|||
|
17:07 |
Is this one clearly actually useful in his estimation help you nuts there's a.
|
|
|
|||
|
17:12 |
Hypersexuality.
|
|
|
|||
|
17:17 |
I want the issues that are also disagreements so you can run the same motor on different.
|
|
|
|||
|
17:21 |
Interpretation of different different different explanations so how do you actually resolve the disagreement.
|
|
|
|||
|
17:29 |
All of this should be studying the context of housing that evaluate these interpretation.
|
|
|
|||
|
17:40 |
Another big challenge is that of a the multimodal learning process.
|
|
|
|||
|
17:44 |
You know we seen many times of hydrogenate issues.
|
|
|
|||
|
17:50 |
Play bryndis together are there any learning optimization challenges.
|
|
|
|||
|
17:57 |
And yesterday.
|
|
|
|||
|
18:00 |
The animal modalities.
|
|
|
|||
|
18:03 |
We're trying to do classic Asia from RGV video.
|
|
|
|||
|
18:16 |
And white core finding a half is that.
|
|
|
|||
|
18:19 |
How old is Madonna.
|
|
|
|||
|
18:21 |
They learn at different rates easier to learn from any more quickly learn over..
|
|
|
|||
|
18:33 |
And it take longer so you don't actually learn from that way..
|
|
|
|||
|
18:36 |
And again this this one disadvantage inherit from the idea of these different modalities having different..
|
|
|
|||
|
18:48 |
Is it supposed to rain. The problem but when I get a proposed was to actually keep track of how much is modality is learning from.
|
|
|
|||
|
18:59 |
Better balance.
|
|
|
|||
|
19:06 |
One big issue here is that instead of just training one more.
|
|
|
|||
|
19:11 |
Without a PC or data.
|
|
|
|||
|
19:20 |
I got you a bit slower but going rates across modalities.
|
|
|
|||
|
19:29 |
What's the monument open.
|
|
|
|||
|
19:37 |
She's just barely scratched the surface.
|
|
|
|||
|
19:43 |
I understand the present and type across more interaction.
|
|
|
|||
|
19:49 |
This too many challenges.
|
|
|
|||
|
19:53 |
Qualifications to be an overarching technical challenge each of these previous five challenges in learning representations how can we understand interactions in reasoning process how can you understand the learning challenges in Trinity mall tomorrow.
|
|
|
|||
|
20:12 |
I would that all of these are almost open question.
|
|
|
|||
|
20:16 |
And if this light overwhelms you the idea is that is true.
|
|
|
|||
|
20:20 |
Quantification is prop his overarching challenge that looked at a courthouse.
|
|
|
|||
|
20:31 |
It's a lot a lot open challenges.
|
|
|
|||
|
20:39 |
Christine many caramel tomorrow we start with representation and alignment.
|
|
|
|||
|
20:45 |
Representation did of Catherine Herridge.
|
|
|
|||
|
20:52 |
Alignment the idea of capturing the connections in Tennessee's between.
|
|
|
|||
|
21:06 |
How do you catch a representation and a lineman.
|
|
|
|||
|
21:09 |
K I should use that to better inform our reasoning process a structured inference ideally with interpret about Concepts and capture audio..
|
|
|
|||
|
21:23 |
Sometimes you might do explicit reasoning.
|
|
|
|||
|
21:25 |
To get a prediction.
|
|
|
|||
|
21:27 |
I want to do accessories need to better.
|
|
|
|||
|
21:31 |
Chords to better transfer Knowledge from modality to the.
|
|
|
|||
|
21:36 |
Of course we don't do anything all the time it would be great if he did or said that he might just skip a representation and Alignment to whatever you and toss this weather prediction generation or transfer passwords.
|
|
|
|||
|
21:49 |
They finally quantification is his overarching challenge that revisit all these people challenges and tries to get a better understanding of learning and optimization challenges in this mall tomorrow message.
|
|
|
|||
|
22:04 |
Again this is just a small glimpse into the whole spectrum of Baltimore research.
|
|
|
|||
|
22:09 |
Coming soon to Archive are you going to release this rain you serve a paper that goes to the more in-depth into the Saxon Army captures more stop challenges and also service more than 600 papers.
|
|
|
|||
|
22:21 |
Are there relevant in this area.
|
|
|
|||
|
22:24 |
If we need papers isn't you think you still have this phone.
|
|
|
|||
|
22:29 |
Without core content validity available alongside lecture videos.
|
|
|
|||
|
22:43 |
Set the alarm.
|
|
|
|||
|
22:46 |
And you want to capture something is the fact that most murders are signs of heterogeneous and interconnected.
|
|
|
|||
|
22:52 |
Thank you everyone.
|
|
|
|||
|
22:55 |
(End of video)
|
|
|
0:00 |
|
|
|
0:05 | ||
|
0:10 |
|
|
|
0:15 | ||
|
0:20 | ||
|
0:25 |
|
|
|
0:30 |