Blinded Knee Doctors

    Introduction

    Faced with a need for surgery, the first concern of both patient and doctor is that a surgical procedure is a success. Though not as important, the desire for a speedy recovery and a limited amount of discomfort warrants research to address these concerns. For this reason many medical studies are conducted to investigate the effects of procedures, drugs and post-operative therapy on the amount of pain and time of recovery of surgical patients. In one such example, researchers Nelson, Henderson, Almekinders, et al. (1993), performed a double-blinded study on the effects of a nonsteroidal anti-inflammatory drugInformationNonsteroidal Anti-Inflammatory Drug:A drug designed to reduce inflammation without the use of steroids. given before and after arthroscopic knee surgery.

     

     

     

     

Synopsis

Abstract
This story discusses a double-blinded experiment to test the effects of a nonsteroidal anti-inflammatory drug on pain. The experiment examines effects of giving the treatment both before and after surgery or after surgery only.

Extensions
Table of results.

6 Questions
Blinding, bias, validity and reliability, sample size effects.
Basic: Q1-6

Protocol

Information Eighty-three patients between the ages of 18 and 55 undergoing elective knee arthroscopyInformationArthroscopy:A surgical procedure using a small scope to examine and repair damage on the interior of a joint. were randomly divided into three groups. Group A received the nonsteroidal anti-inflammatory drug (NSAID) both 3 to 5 days prior to surgery and 7 days after surgery. Group B received a placebo before surgery and the NSAID after surgery. Group C received the placebo both before and after surgery. Post-operatively all patients were given prescriptions for codeineInformationCodeine:A narcotic drug used as a pain reliever and cough remedy. which could be taken every 4 to 6 hours as needed.

Data were collected from four sources: patient questionnaires, knee examinations, surgery records, and physical therapy sessions. The patient questionnaire, given twice before surgery and four times after surgery, provided information on medication use, crutch use, possible side effects, activity level, and level of pain. Four standardized knee examinations, conducted by orthopedic physicians once preoperatively and three times after surgery (at 1, 3, and 8 weeks), assessed range of motion, tenderness, instability, and other measures of the physical condition of the knee. The surgery records indicated the type of anesthesia, arthroscopic procedure(s), duration of procedure(s), and the surgeons' assessment of the severity of the patient's condition. Physical therapy sessions tested muscle strength in the quadricepsInformationQuadriceps:Muscles in the thigh. and hamstringsInformationHamstrings:Tendons in the upper thigh. once before surgery and at the three postoperative exams. For the duration of the study, all of the patients, physicians, and therapists were blinded as to whether the patient was in Group A, B, or C.

 

 

 


Results

Sixteen of the 83 patients were excluded from analysis because of postponement of surgery (3 patients), intolerance of preoperative medication (1 patient), failure to follow medical instructions (4 patients), failure to return 1 week after surgery for exams (6 patients), or expansion of the surgical procedure (2 patients). Conversely seven patients who had at least one follow-up exam but did not return after the 1-week exam and four patients who did not return after the 3-week exam were included in the analysis.

Based on the 67 patients used in the analysis the researchers reported that, at one day after surgery, the pain scores for Group A were lower (P-value = 0.02) than Group C and lower, but not significantly lower, than Group B. Differences in pain scores at all other times were not found to be significant (P-values > 0.1). In addition, Group C took more codeine pills in the first 24 and 72 hours following surgery (P-value < 0.02) than Groups A and B. Differences between the groups in the measures of physical condition and speed of recovery were not found to be statistically significant at any of the assessment stages in the study.

A table of the mean pain scores by group and examination time is given below:


Questions

  • Question 1
  • Question 2
  • Question 3
  • Question 4
  • Question 5
  • Question 6
Question 1

An experiment is called a double-blinded study when both the subjects and the researchers are unaware who is receiving the treatment and who is receiving the placebo.

a) Why is it important for this study to be double-blinded?

b) Describe a treatment vs. control study in which blinding is not possible for the subject.

c) Is it possible for the researcher to be blinded in your example for Part b? If so describe how this could be accomplished. If not explain why it would not be possible.

 

Learning Objectives
  • Understand the advantages of a comparative experiment. Be able to identify the basic features of a randomized comparative experiment.
Solution

a) Several of the measurements (for example, level of pain) by both patient and doctor are subjective and may be influenced by knowledge of whether or not the patient was a member of the treatment group.

b) One example would be a study of the effectiveness of a physical therapy or exercise program such as swimming or the use of an exercise bike. In this case the patients would be aware of the treatment and therefore any variables measured requiring their input may be affected by their knowledge of their participation in the treatment.

c) For the examples given in Part b one could blind the researcher who performs the assessment by having someone else perform the therapy or surgical procedure. This would eliminate possible bias in the evaluation of the treatment.




Question 2

The researchers reported that one patient was removed from the study due to "intolerance of the preoperative study medication (1 patient who was receiving a placebo)" and "slightly more patients who took a placebo reported side effects potentially caused by the study medication" than did patients who received the NSAID. Do these results say anything about the importance or effectiveness of the blinding in this study?

Learning Objectives
  • Understand the advantages of a comparative experiment. Be able to identify the basic features of a randomized comparative experiment.
Solution

The fact that reported side effects were more common in the control group may indicate that the procedure for blinding was effective and/or the NSAID may in fact limit possible side effects due to other factors present in the story (e.g. codeine pills and the anesthesia used during the operation).



Question 3

The researchers state:

“The fact that all patients enrolled in the study were not followed the full 8 weeks could potentially bias the results if subjects were lost to followup, depending on which study medication was taken, the NSAID or the placebo.”

Do you agree with this statement? If so, how might this affect the results? If not, why would this fact not bias the results?

Learning Objectives
  • Be able to identify problems with poorly designed experiments.

Solution

One possible reason for not participating in the follow-up exams might be that the patients felt the knee had healed enough that further examination was unnecessary. If the majority of these patients came from the control group their absence might exaggerate negative results for the remaining control group members. Conversely, if a majority of these patients were from the treatment groups it might lessen the positive results of the remaining treatment subjects.





Question 4

The patients were asked to assess the level of pain they were experiencing at each of the exam times. A pain score was based on the patients' marks on a 10-cm line with a range from 0 (no discomfort) to 100 (agonizing pain).Is the pain score a valid measurement of the amount of pain felt by the patients? Explain your answer.

 

Learning Objectives
  • Be able to identify potential sources of bias.
Solution

The validity of the scale is questionable since pain is a highly subjective quantity and it would be difficult to assume that similar scores for different people would indicate the same amount of pain. However, repeated measurements over time could serve as a valid indicator of increases or decreases in the level of pain for all of the subjects.





Question 5

Examine the table of mean pain scores.

a) At which exam time(s) would it be desirable for there to be little or no difference between the three groups? Does this appear to be the case? What does this say about the randomization used to determine the three groups?

b) For one exam time and group use the information provided to describe the likely distribution of the pain scores. Is it plausible to assume the data is normally distributed? Explain your answer.

 

Learning Objectives
  • Understand three basic techniques fundamental to well-designed experiments: randomization, replication, and blocking. Understand that the first is used to decrease bias and that the last two help to increase precision.
  • Be able to assess important features of the shape of a distribution by looking at a histogram (single or multiple peaks, symmetric or skewed, the existence of outliers).

 

Solution

a) It is important that there is little or no difference between the three groups at the start of the study to eliminate some of the possibility that a factor other than the treatment is responsible for differences observed later in the study. In this case we see that Group C had slightly higher pain scores than Groups A and B at the time of enrollment but there is considerable overlap of all three groups within one standard deviation. If the pain score was the only variable of interest for the study this would suggest the randomization was successful in creating three similar groups of patients.

b) As one example consider Group B on the day of surgery where the mean is 41, the SD is 27, and the range is from 5–89. Comparing these numbers it is evident that all of the observations lie within two standard deviations of the mean, which is near the middle of the range. This implies that the distribution is somewhat symmetric and has few outliers.

As a second example consider Group B at one week after surgery, where the mean is 26, the SD is 23, and the range is from 2–84. This time the distribution does not appear to be symmetric due to the presence of extreme values near the high end of the scale.

Although neither description exactly describes a normal distribution one might consider the data to be from a truncated (tails removed) normal distribution due to the limits imposed by the use of the pain scale. A histogram of the individual scores would help us determine if this is a reasonable assumption.

Question 6

Suppose you only had information on the pain scores at the exam times listed in the table. For each research question below choose one of the suggested relative comparisons and explain why it may be the best choice for addressing the question.

a) Research Question: Is the NSAID beneficial as a preoperative treatment?

i. Comparison of the pain scores at day of surgery among Groups A, B, and C.
ii. Comparison of the change in pain scores from day of enrollment to day of surgery among Groups A, B, and C.
iii. Comparison of the pain scores at day of surgery between Group A and the combined grouping of Groups B and C.
iv. Comparison of the change in pain scores from day of enrollment to day of surgery between Group A and the combined grouping of Groups B and C.

b) Research Question: Does NSAID aid in the speed of recovery immediately after surgery?

i. Comparison of the pain scores one week after surgery among Groups A, B, and C.
ii. Comparison of the change in pain scores from one day after surgery to one week after surgery among Groups A, B, and C.
iii. Comparison of the pain scores one week after surgery between Group C and the combined grouping of Groups A and B.
iv. Comparison of the change in pain scores from one day after surgery to one week after surgery between Group C and the combined grouping of Groups A and B.

Learning Objectives
  • Be able to conduct a formal hypothesis test for

a) a population mean,
b) a population proportion,
c) association between two categorical variables.

Be able to state the relevant hypotheses, compute the test statistics, and compare the value of the test statistic to the appropriate distribution. Be able to draw a formal conclusion in the language of the problem.

Solution

a) One way to assess the benefit of the treatment would be to see if there was a difference in the reduction in pain from time of enrollment up to the time of surgery. Also, since both Groups B and C received the same treatment (i.e. a placebo) up to this point we are not concerned about any differences in these two groups. Therefore the comparison given in (iv) may be the best choice of those provided to address this research question.

b) The property of interest, speed of recovery, may be addressed using the differences in pain scores from one time to another. This time both Groups A and B received the same treatment (i.e. the NSAID) following surgery. However, their treatment differences before surgery may be confounded in the results if we combine these two groups. Therefore the comparison given in (ii) may be the best choice of those provided to address this research question.

 

 

References

Nelson, W. E., Henderson, R. C., Almekinders, L. C., DeMasi, R. A., and Taft, T. N. (1994)

Credits

This story was prepared by Greg Elfring and last modified on 3/12/95.