skip navigation
student.bmj.com

Can Google help you to diagnose patients' problems?

This month Trish Groves takes you through a cross sectional study that tried to find out whether the internet search engine Google is useful as a diagnostic tool

This month's paper is "Googling for a diagnosis-use of Google as a diagnostic aid: internet based study" by H Tang and J H K Ng (BMJ 2006;333:1143-5). You can read it by going to studentbmj.com and clicking on the link. Don't miss the rapid responses to the article on bmj.com at http://bmj.com/cgi/eletters/333/7579/1143. In the first couple of weeks after online publication the article prompted more than 30 responses, many of them quite critical.


Abstract

Objective-To determine how often searching with Google (the most popular search engine on the world wide web) leads doctors to the correct diagnosis.

Design-Internet based study using Google to search for diagnoses; researchers were blind to the correct diagnoses.

Setting-One year's (2005) diagnostic cases published in the case records of the New England Journal of Medicine.

Cases-26 cases from the New England Journal of Medicine; management cases were excluded.

Main outcome measure-Percentage of correct diagnoses from Google searches (compared with the diagnoses as published in the New England Journal of Medicine).

Results-Google searches revealed the correct diagnosis in 15 (58%, 95% confidence interval 38% to 77%) cases.

Conclusion-As internet access becomes more readily available in outpatient clinics and hospital wards, the web is rapidly becoming an important clinical tool for doctors. The use of web based searching may help doctors to diagnose difficult cases.


csiro
Dr Google will see you now


Why did the authors do the study?

One of the doctors who did this study saw a teenager whose father had used Google to find a diagnosis: "After evaluating a 16 year old water polo player who presented with acute subclavian vein thrombosis, one of us (HT) started to explain that the cause of the thrombosis was uncertain when the patient's father blurted out, "But of course he has Paget-von Schrötter syndrome.'"

The father had used Google to search the web for information and, on the strength of this, gave the doctor a mini-tutorial on the pathophysiology (hypertrophy of the neck muscles leading to dynamic compression of the axillary vein at the thoracic inlet-leading to thrombosis) and treatment of the syndrome. This made the doctor ask, "How good is Google in helping doctors to reach the correct diagnosis?"

The answers to this question could be "potentially confusing" or "overwhelming," given that, when I put the term "acute subclavian vein thrombosis" into Google's search box on 29 November, the search yielded 103 000 results. How can a doctor, let alone a patient without medical training, evaluate so much information? Tang and Ng set out to tell us.


What did the researchers do?

The authors decided to use previously published case reports to test Google's ability to find diagnostic information. They looked at reports published in one journal, the New England Journal of Medicine, in 2005 and chose all 26 reports that focused on diagnosing rather than managing illnesses. These were mostly "mystery illnesses"-complex diseases with non-specific symptoms or common diseases with rare presentations.

From each report they picked the three to five main words or phrases that most specifically described the patient's symptoms and signs and entered them on to a data sheet (the words and phrases the authors chose are given in a table on bmj.com).They didn't read the reports further to learn the final diagnoses.

The authors used these words and phrases as search terms on Google and, for each case report, noted the three most prominent and appropriate diagnoses that Google found. To save time-and, perhaps, knowing that Google yields the best websites first (by ranking web pages in order of the number of links to them: a reasonably good proxy for quality)-they looked only at diagnoses mentioned in the first 30 search results.

When the authors compared these diagnoses with articles reported in the New England Journal of Medicine their internet searches had found the correct diagnosis in 15 out of 26 cases (58%). For example, from a case report of a 48 year old man with loss of consciousness while jogging, Tang and Ng chose the search terms "cardiac arrest", "exercise", "young", and Google found articles indicating a diagnosis of hypertrophic obstructive cardiomyopathy, which turned out to be correct for that case. On the other hand, for a report of a 17 year old tsunami survivor with respiratory distress and paralysis (hemiplegia) on the right, the Google search terms were "pulmonary infiltrates" and "CNS (central nervous system) lesion": the internet came up with a diagnosis of aspergillus (fungal) infection while the patient had actually had pneumonia and a brain abscess from breathing in water contaminated by several types of bacteria.


Did this study show whether Google was a useful diagnostic tool?

Google found the right diagnosis in 58% of cases. Tang and Ng reported that this result had a 95% confidence interval of 38% to 77%. To put it another way, the true percentage had a 95% chance of falling somewhere between these two percentages if someone did the same study many times with another set of 26 case reports from the same journal in another year (if all other things stayed the same, for instance, the journal's criteria for accepting case reports and the authors' criteria for selecting search terms and picking the best diagnoses). This is a wide interval, and one that left some BMJ readers unimpressed. Of the doctors who sent a rapid response to bmj.com, Oltunde A Ashaolu, said "Tang and Ng seem to be comfortable with a correctly predicted diagnosis of 58% (with a lower confidence interval limit of 38%) whereby a lot of my colleagues may feel much less so The researchers have expertly selected relevant symptoms to input data into Google and a yield of 58% is obtained. A junior doctor may input a slightly different set of symptoms and the array of available information in fact becomes a quagmire of confusion rather than being an educative medium."


Was this a good study?

The study used a simple cross sectional design, looking at and describing a clearly defined sample (of case reports) in two different settings-firstly in a journal and then, after reducing the case reports to just a few key words, in the context of the web. Cross sectional studies are most often used in medical research for surveying a population or sample of patients at one point in time to see what health problems or risk factors they have. They cannot tell you anything meaningful about the causes of an illness or the effects of an intervention or experiment.

It would be going too far to describe Google as a diagnostic test in this study; at best it's a tool. The equivalent diagnostic test would be: the whole package of search terms chosen by doctors with some clinical expertise plus the use of the search engine plus the contents of the web pages the search found plus the doctors' criteria for selecting the best pages. This test, therefore, relies a lot on the doctors' expertise and knowledge. And to assess properly the performance and value of such a diagnostic test the authors would have had to compare their Google method with the best diagnostic test currently available-the "gold standard" and calculated both of the tests' sensitivity (ability to correctly find the right diagnoses) and specificity (ability to spot the wrong diagnoses). Comparing Google with another search engine would have been a good start, and it would have been most useful and valid to compare Google with the widely used academic search engine PubMed.

Indeed, one rapid responder to the BMJ, Reinhard Wentz, did just that and found that a PubMed search found the right diagnosis 88% of the time, for the same case reports Tang and Ng used and with the same search terms. It took him about three minutes per case report. Dr Wentz also said, "Is Google perhaps the modern version of 10 000 monkeys sitting at 10 000 typewriters, who, given time, will eventually produce a true copy of Hamlet? Among the one correct version there will of course be thousands with just one or two typos. So, can Mark Twain's remark 'Be careful about reading health books. You may die of a misprint' be applied to Google?"

If Tang and Ng had wanted to assess the validity of their diagnostic test, an even better gold standard would comprise all of the best current diagnostic methods for each of the conditions described in the 26 case reports. This would have been much more clinically meaningful and scientifically strong. Another rapid responder, Catherine A Smith, also pointed out that the study would have been much more useful if the researchers had given us information about the sources of the diagnostic answers-both the correct and the incorrect ones.

If Google had found the right diagnoses more often, with less need for expert medical knowledge on the part of the researchers, the article might have prompted even more debate on bmj.com about whether patients really need doctors any more.


How much does this study matter?

I would give Tang and Ng's study six out of 10 for clinical relevance and scientific strength because it was only cross sectional and did not compare Google with any other way of finding diagnoses. But the journalistic merits of the work, its topicality, and its success in prompting debate have bumped up my overall score to eight out of 10.

Tang and Ng wanted to assess, in a reasonably objective way, something that many patients and doctors already do. For instance, they say that 25 million people in the United Kingdom alone had access to the web by 2001, and that searching for health information is one of the commonest uses of the web. Given that doctors are expected to remember about two million facts, this small and preliminary study was worth doing and is worth reading.



Trish Groves, senior editor, studentBMJ
Email: tgroves@bmj.com


studentBMJ 2007;15:1-44 January ISSN 0966-6494



Return to top   
Printer friendly page    Download article PDF    Email this article to a friend