As available now, databases of electronic health records are diverse and massive, but they are also messy and heterogeneous. There’s a lot of noise,” said Jimeng Sun, associate professor at Georgia Tech’s School of Computational Science and Engineering. “Our charge is to find ways to make the information more robust and easier to read, thus leading to meaningful clinical concepts without extensive labor and time.”