WEBVTT
Kind: captions
Language: en

00:00:00.160 --> 00:00:01.360
Katie Naum:
Next, we have&nbsp;&nbsp;

00:00:02.240 --> 00:00:07.200
Michela Pazzani and Albert Hsiao of the&nbsp;
University of California at San Diego&nbsp;&nbsp;

00:00:08.000 --> 00:00:13.040
who will be telling us about explaining machine&nbsp;
management for COVID-19 management, so I’ll let&nbsp;&nbsp;

00:00:13.040 --> 00:00:17.280
you guys take it away. Thank you.
Michael Pazzani:&nbsp;

00:00:17.280 --> 00:00:22.320
Okay so first I'd like to thank NSF, particularly&nbsp;
information intelligence systems, for funding&nbsp;&nbsp;

00:00:22.320 --> 00:00:28.160
this, and this is a collaboration between myself,&nbsp;
a computer scientist and Albert, a radiologist&nbsp;&nbsp;

00:00:28.160 --> 00:00:34.080
with a deep background in deep learning, and we're&nbsp;
using machine learning methods, particularly deep&nbsp;&nbsp;

00:00:34.080 --> 00:00:41.280
learning, to analyze CTs or x-rays to manage&nbsp;
COVID-19. While a diagnosis is important, we're&nbsp;&nbsp;

00:00:41.280 --> 00:00:47.600
also concerned with understanding the severity&nbsp;
of this disease from the imaging, and finally&nbsp;&nbsp;

00:00:47.600 --> 00:00:54.800
it's not sufficient just to say you have a 96%&nbsp;
chance of having COVID but, or even to highlight&nbsp;&nbsp;

00:00:54.800 --> 00:01:00.320
a portion of the lower lung, but we aspire to&nbsp;
label the images with things like, there's ground&nbsp;&nbsp;

00:01:00.320 --> 00:01:06.960
glass in the lower left lung. We're evaluating&nbsp;
a variety of existing approaches for both&nbsp;&nbsp;

00:01:06.960 --> 00:01:13.360
classification and explanation and starting to&nbsp;
develop new ones as well. The ultimate goal is for&nbsp;&nbsp;

00:01:13.360 --> 00:01:19.120
machine learning to acquire diagnostic signs that&nbsp;
can be communicated to people, such as a clinician&nbsp;&nbsp;

00:01:19.120 --> 00:01:24.560
when they're doing diagnosis, or perhaps to&nbsp;
teach peers or residents without even [inaudible]&nbsp;&nbsp;

00:01:24.560 --> 00:01:32.720
computers marking up the images. On the next&nbsp;
slide we show the process of deep learning-&nbsp;

00:01:34.800 --> 00:01:40.800
-and essentially we have a database, an existing&nbsp;
database of images as well as new images that&nbsp;&nbsp;

00:01:40.800 --> 00:01:47.440
unfortunately we've collected at UCSD over the&nbsp;
past few months of patients with COVID, and the&nbsp;&nbsp;

00:01:47.440 --> 00:01:55.360
goal is to take images of normal patients, those&nbsp;
with COVID-19, those with other conditions and&nbsp;&nbsp;

00:01:55.360 --> 00:02:02.240
come up with a learning method that distinguishes&nbsp;
them, and also provides the explanation. We're&nbsp;&nbsp;

00:02:02.240 --> 00:02:06.800
quite fortunate in that we get to leverage a lot&nbsp;
of existing infrastructure that was already in&nbsp;&nbsp;

00:02:06.800 --> 00:02:14.080
place of doing imaging, sending it to the cloud&nbsp;
for analysis, and then sending it to the clinic&nbsp;&nbsp;

00:02:14.080 --> 00:02:19.600
where the physician end user can observe it.&nbsp;
All of this was in place already by Albert at&nbsp;&nbsp;

00:02:19.600 --> 00:02:27.760
UCSD Health and we've just had to modify the cloud&nbsp;
diagnostic procedures with new data from COVID-19.

00:02:27.760 --> 00:02:28.960
And from here I'll let&nbsp;&nbsp;

00:02:28.960 --> 00:02:32.080
Albert take over and explain a little&nbsp;
bit more about how we're doing this.

00:02:32.080 --> 00:02:36.000
Albert Hsiao:
Thanks Mike, thank you again for giving us the&nbsp;&nbsp;

00:02:36.000 --> 00:02:42.960
opportunity to present this work. It's definitely&nbsp;
very technical in nature but also very clinical&nbsp;&nbsp;

00:02:42.960 --> 00:02:51.200
and impactful immediately as we're already using&nbsp;
it in our clinic. The primary concept is really to&nbsp;&nbsp;

00:02:51.200 --> 00:02:58.720
develop AI algorithms that allow us to localize&nbsp;
pneumonia. This is work that we started even&nbsp;&nbsp;

00:02:58.720 --> 00:03:07.840
before the COVID-19 pandemic began, but has&nbsp;
been accelerated a lot because of the need.&nbsp;&nbsp;

00:03:08.960 --> 00:03:15.920
One really key important aspect of COVID-19 is&nbsp;
that not every patient develops pneumonia, some&nbsp;&nbsp;

00:03:15.920 --> 00:03:20.800
patients do and some patients don't, some become&nbsp;
asymptomatic, of course, but those that do,&nbsp;&nbsp;

00:03:21.440 --> 00:03:29.840
the severity of pneumonia on x-ray or CT provides&nbsp;
us very good prognostic information and a lot of&nbsp;&nbsp;

00:03:29.840 --> 00:03:35.280
data is starting to come out with that. We've&nbsp;
taken a very different strategy towards this&nbsp;&nbsp;

00:03:35.280 --> 00:03:40.960
U-net type segmentation approach as opposed to a&nbsp;
lot of classification approaches that have been&nbsp;&nbsp;

00:03:40.960 --> 00:03:48.320
previously used, although both are feasible, and&nbsp;
you can generate these kinds of probability maps&nbsp;&nbsp;

00:03:48.320 --> 00:03:53.680
and what-not - activation maps from classification&nbsp;
approaches which we'll be exploring as well.&nbsp;&nbsp;

00:03:55.040 --> 00:04:02.080
The important aspect is quantifying the severity&nbsp;
of illness, I guess, essentially gives us&nbsp;&nbsp;

00:04:02.080 --> 00:04:07.440
prognostic information, because ultimately we want&nbsp;
to know which patients require hospitalization,&nbsp;&nbsp;

00:04:07.440 --> 00:04:12.880
which ones can stay at home, which ones require&nbsp;
mechanical ventilation and which ones are likely&nbsp;&nbsp;

00:04:12.880 --> 00:04:20.080
to survive or not. And some of our initial data&nbsp;
here is showing us that those patients with&nbsp;&nbsp;

00:04:20.080 --> 00:04:26.400
high likelihood predicted by the - by the&nbsp;
algorithm are also the ones that tend to&nbsp;&nbsp;

00:04:27.280 --> 00:04:34.720
not survive and are the ones that tend to require&nbsp;
intubation. So, this will give us really good&nbsp;&nbsp;

00:04:34.720 --> 00:04:39.440
data for how to best manage these patients&nbsp;
so that's a really critical element of how-&nbsp;

00:04:39.440 --> 00:04:43.680
-how this will all come into play in our&nbsp;
clinic and hopefully many others through&nbsp;&nbsp;

00:04:43.680 --> 00:04:49.280
our collaborations. Our current results in&nbsp;
COVID-19, this is one example of a patient&nbsp;&nbsp;

00:04:49.280 --> 00:04:56.160
with COVID-19 who presented to our clinic. Our&nbsp;
AI algorithm produced our initial AI algorithm&nbsp;&nbsp;

00:04:56.160 --> 00:05:01.040
produced this result, very subtle that doesn't&nbsp;
really highlight the areas of pneumonia that well&nbsp;&nbsp;

00:05:02.160 --> 00:05:09.040
as it was trained on initially only public data&nbsp;
before COVID-19 and we came up with a strategy&nbsp;&nbsp;

00:05:09.040 --> 00:05:13.840
that uses active learning transfer learning&nbsp;
to specifically identify good cases for us to&nbsp;&nbsp;

00:05:14.400 --> 00:05:21.520
train on applying transfer learning to that neural&nbsp;
network. Using also concurrently performed CTs&nbsp;&nbsp;

00:05:21.520 --> 00:05:27.840
that were done to give us a better ground truth&nbsp;
and that that's given us a higher performance&nbsp;&nbsp;

00:05:27.840 --> 00:05:31.520
both on the internal and external data set&nbsp;
and really highlights the pneumonia better so&nbsp;&nbsp;

00:05:31.520 --> 00:05:36.320
our initial algorithms that were in place&nbsp;
we're replacing with these updated algorithms-&nbsp;

00:05:37.040 --> 00:05:45.760
-thanks to the support of the NSF in this project&nbsp;
we've been deploying this it's in our clinic and&nbsp;&nbsp;

00:05:45.760 --> 00:05:52.880
there's certainly articles online about it as well&nbsp;
as a peer-reviewed publication that we brought out&nbsp;&nbsp;

00:05:52.880 --> 00:05:58.240
at around the time that we were investigating&nbsp;
this initially. And our next steps are&nbsp;&nbsp;

00:05:58.240 --> 00:06:03.840
really to aggregate large data sets across&nbsp;
multiple institutions, have multiple readers&nbsp;&nbsp;

00:06:03.840 --> 00:06:10.640
annotate markup the areas of pneumonia to sort of&nbsp;
solidify the ground truth a little bit, use the CT&nbsp;&nbsp;

00:06:10.640 --> 00:06:17.680
as well and develop this comparable algorithm for&nbsp;
CT in in the process. And ultimately, we want this&nbsp;&nbsp;

00:06:17.680 --> 00:06:22.640
algorithm to be explainable, is really critical&nbsp;
for us to be able to use it clinically is to&nbsp;&nbsp;

00:06:23.760 --> 00:06:28.880
be certain that we're relying on features that&nbsp;
really matter not, not sort of accessory features&nbsp;&nbsp;

00:06:28.880 --> 00:06:34.240
that the neural network sort of coincidentally&nbsp;
saw associated with COVID but actually things&nbsp;&nbsp;

00:06:34.240 --> 00:06:40.320
are related to it and then. And then assess its&nbsp;
clinical utility both in the detection of disease,&nbsp;&nbsp;

00:06:41.120 --> 00:06:45.840
distinguishing between other diseases that are&nbsp;
quite similar like pulmonary edema, and then give&nbsp;&nbsp;

00:06:45.840 --> 00:06:51.360
us best management practices for these patients,&nbsp;
so that's kind of where we're going. Thank you.

