Document Type : Original Article

Authors

1 Road Traffic Injury Research Center, Tabriz University of Medical Sciences, Tabriz, Iran

2 Injury Epidemiology and Prevention Research Group, Turku Brain Injury Center, Turku University Hospital and University of Turku, Turku, Finland

3 Cabrini Research, Cabrini Health, Melbourne, VIC, 3144, Australia

4 School of Public Health and Preventative Medicine, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, VIC, 3800, Australia

5 Research Center of Psychiatry and Behavioural Sciences, Tabriz University of Medical Sciences, Tabriz, Iran

6 College of Medicine, Charles Drew University of Medicine and Science, Los Angeles, California, USA

7 Department of Medical Informatics, School of Management and Medical Informatics, Tabriz University of Medical Sciences, Tabriz, Iran

Abstract

Objective: To assess psychometric properties of the European Quality of Life 5-Dimension 3-Level Version
(EQ-5D-3L) commonly used tool for measuring road traffic injury (RTI) patients’ quality of life.
Methods: The psychometric study assessed the reliability and applicability of EQ-5D-3L through phone
surveys, based on a national cohort platform. Data of 150 RTI patients recruited from the cohort study were
included as 50 patients per each follow-up phase (one, six, and twelve months after discharge). A 12- day-time
span was between test and retest. We measured psychometric properties (internal consistency reliability and
stability reliability) and agreement using Kappa coefficients and percentages of agreement and Bland-Altman
method. Data were analysed using software STATA statistical package.
Results: The majority of patients were men (80%) with mean age (SD) of 41(14.7%), employed (78%) and
educated (86%). The Persian version represented high internal consistency reliability at total level (Cronbach’s
α=0.81) and moderate to good reliability at phase levels (0.62-0.87). The stability reliability was excellent at
total (ICC=0.98, 95% CI: 0.97, 0.98) and phase levels (0.97-0.98. The kappa agreement coefficients were valued
moderate to perfect (0.6-0.8, p>0.0001). The Bland-altman plot illustrated high agreement between test and
retest scores. No floor and ceiling effects were found.
Conclusion: The study revealed that EQ-5D-3L was highly reliable and responsive to be applied through phone
interviews at three different times post injury and discharge, as no previous study considered its psychometric
properties at various phone follow-ups after RTIs.

Keywords

Introduction

 

Road Traffic Crashes (RTCs) annually leave 20 to 50 million people with non-fatal injuries, with many suffering a health problem, and injuries account for 41.2 million years of healthy life lost [1-3]. Moreover, over 2600 person years of life lost are attributed to road traffic injuries (RTIs) [4], which causes the second most deaths in Iran [5, 6]. The RTCs should be of a specific concern in Iran, where only in 2007, more than 27000 people lost their lives and approximately 277000 were injured [7]. To draw a surprising contrast, Nordic countries are universally in the front position on injury prevention, while Iran has globally been prominent for its top road traffic injury and death rates [8], and the first in the Eastern Mediterranean region [9]. Low and middle income countries (LMICs) are the place for 90% of increasing trauma-related deaths and disability-adjusted life years, which decrease national incomes [10]. Hospitalized patients due to RTCs often suffer severe injuries which may lead to functioning problems in the near future or in a long run [11]. In terms of their physical and mental health, the patients’ quality of life is required to be measured using an easy-to-understand and precise tool post injury. The most commonly-used tool to assess health related quality of life (HRQoL) is the EQ-5D in the world. It has presented good validity and responsiveness in a variety of populations [12-16]. Its original version has three response levels for its 5 dimensions, called EQ-5D-3L (European Quality of Life 5-Dimension 3-Level Version). Considering the heavy burden of RTIs in Iran, it is substantial to evaluate the psychometric properties of EQ-5D-3L for accurately assessing RTIs patients’ quality-adjusted life years (QALYs).

The way applied to collect data such as phone surveys comparing with in-person interviews may affect an instrument’s psychometric properties [11]. Therefore, it is essential to assess a tool’s psychometric properties for various data collections. As a major concern, very sparse research study were done through phone survey on health status of injured patients post-crash and post discharge. For such assessing, there is a rare opportunity to have in-person visits, in some settings particularly in low and middle income countries, and the resources do not support in-person interviews. However, we had a great opportunity of a national cohort study. This setting helps to apply the tool and assess its psychometric properties through national cohort platform of Post-crash PERSIAN (Prospective Epidemiological Research Study in IrAN) Traffic Safety and Health Cohort Study (PTC) [17]. Then, the key action was to select and apply reliable and valid tools in measure health-related quality of life after RTIs. In order to assess HRQoL, generic instruments include the Health Utility Index, the EQ-5D-3L, or the Short Form 6D, creating utilities that produce a quantity that combines both the length and quality of life [18]. The cohort study requires a complete tool, short enough to use for phone surveys, such as EQ-5D-3L with great sensitivity. As the brief form of this globally approved tool takes short time, and includes five different health dimensions, we psychometrically assessed it through phone interviews on three follow-up phases (one, six, and 12 months after injury). To our knowledge, the paucity of such all-inclusive tool on creating the health profile of people involved in RTCs is sensible reason to examine its internal consistency and stability reliability among study population through phone surveys at three different time periods post injury. Then, it would pave the study way of the health status in a cost-effective manner.

 

Materials and Methods

 

Assessment of Reliability

The psychometric properties of EQ-5D-3L was evaluated at three times after crash through phone interviews. This study was conducted in a clinical population. The source population were RTIs survivors enrolled from post-crash PTC [17]. All participants hospitalized in one of two referral trauma centres, were followed up through the PTC at one-, six-, and twelve- months follow-up post injury. All hospitalized patients have been recorded in the database of integrated road traffic injury registry system (IRTIRS) since 2019 [19]. The minimum number of sample required was 50 to detect the value of 0.4 for Intra-class correlation coefficient (ICC), with alpha and power defined at 0.05 and lower than 90%, respectively [20]. Patients were randomly recruited from those who must be called via phone to complete follow-ups forms. We included 50 patients per each follow-up. The subjects of each follow-up was different from others, then, we had 150 patients in total who participated in test and re-test phases between May-June 2020. In the participants’ selection process for each follow-up phase, we regarded both gender, four age groups (18-24; 25-44; 45-64; 65 and above), and educational levels (illiterate, school education, academic education) to be included in the sample. A number of ten persons rejected to participate (six patients) or died (one patient) or did not answer the call (three patients). They were substituted by the same sex, age group, and education level. The average time between two measurements was 12 days (10 to 14 days) [21]. They did not have any treatment or changes in their medication during that period. As inclusion criteria, participants were hospitalized due to RTIs for at least 24 hours, one or six or twelve months prior to their interview date dependant on their follow-up phase, registered, aged 18 or over, and informed consent was required. Specific exclusion criteria included having any disorders like psychiatric problems affect the validity of the responses or an event experienced during the test-retest interval significantly increases the level of disability (e.g. disease, car crash, fall etc.). The data collection was performed by a trained interviewer with broad experience (≥4years) [22]. The completeness of tools was mostly guaranteed since the interviewer filled them out. The selected participants were replaced by the same sex and age if they refused to participate.

 

Measures

The European Quality of Life 5-Dimension 3-Level Version

The EQ-5D-3L has been developed since 1980s by a network of global multidisciplinary researchers, “EuroQol group”. It is a broadly applied generic tool for measuring respondents’ diverse aspects of health outcome on the day of the survey [21] by five dimensions of Mobility, Self-care, Usual Activities, Pain/ Discomfort, and Anxiety/ Depression. Each dimension has three levels (no/some/extreme problems), characterised 1–3 [23]. This tool defines 243 (53) unique health states. We calculated the health status scores for test and re-test data at total (n=150) and follow-up (n=50) levels.

 

Socio-demographic Data

Basic socio-demographic and crash-related information such as used/ counterpart vehicle, mechanism, and injured person’s role were collected.

 

Statistical Analysis

Initial analyses of the tool and baseline characteristics data were performed using descriptive statistics. Distribution of demographic variables were tested based on the value of skewness and kurtosis. Kruskal-Wallis test and Wilcoxon rank-sum (Mann-Whitney) test were applied to test the hypothesis that there are not statistically significant differences between groups of independent demographic variables including sex, age, education levels, and marital status, on the dependent variable of quality of life score on the basis of test phase data. To measure the psychometric properties of the tool, we assessed internal consistency reliability and stability reliability. We also evaluated the agreement of test and re-test scores using kappa agreement coefficient and Bland-Altman Plot. Data were analysed using STATA statistical package version 15 (StataCorp LLC, Texas) and a significance level of 0.05 was adopted.

 

Scale Score Reliability

Internal Consistency Reliability

Internal consistency indicates the extent to which the items in a subscale are correlated. The homogeneity was evaluated using Cronbach’s Alpha statistic [24]. A value between 0.5 and 0.75, 0.75 and 0.9, and greater than 0.90 specify moderate, good, and excellent reliability, respectively [25].

 

Test–retest Reliability

Another measurement used to directly assess the scale score reliability of the tool was repeatability assessment using the test-retest method. Comparing the results of both measurements examined the consistency over time. The test-retest correlation was measured through tau Kendal b and Spearman- Brown correlation coefficient. Spearman- Brown coefficient of greater than 0.7 is considered as acceptable [26]. A common metric, ICC derived from a two way mixed model with an absolute agreement definition was calculated between the two phases to quantify the test-retest reliability of disability measurements at three follow-ups and total level [27]. Scores of test-retest were analysed using the “single-rater type (k = 1), absolute-agreement, 2-way mixed-effects model to produce the intra-class correlation with 95% confidence intervals [28]. An ICC coefficient of >=0.75 was considered as evidence of measurement stability. ICC between 0.4 and 0.75 indicates fair to good reliability, and ICC <0.4 indicates poor reliability [29].

 

Agreement between Test and Retest Data

The Kappa coefficients and percentages of agreement were calculated to measure the agreement between test and re-test responses for EQ-5D-3L. To measure the amount of agreement on the ordinal variable between two phases, we applied the Kappa coefficients [30]. Kappa value above 0.81 specifies a nearly perfect agreement, 0.61-0.80 substantial agreement, 0.41–0.60 moderate agreement, 0.21–0.40 fair agreement, below 0.20 indicates a slight agreement [31]. Besides, regarding restrictions of ICC and correlation coefficient in assessing the agreement between the tool’s test and retest data [32], Bland-Altman method was applied to examine their agreement. The result is a scatter plot XY, in which the Y axis displays the difference between the two paired measurements (A-B) and the X axis indicates the average of these values ((A+B)/2) [33]. Thus, its vertical and horizontal axes indicate difference between and the average of test and retest scores, respectively. The limits of agreement were also estimated as mean difference ±1.96 standard deviations of the difference. A good agreement between would be represented by a near to zero mean difference and with a maximum of 5% or less points residing outside the limits of agreement [34].

 

Floor and Ceiling Effects

The floor and ceiling effects were derived from the percentage of respondents with the lowest and highest possible scale scores for individual items. A ceiling effect is considered by 15% or more participants with the highest score [34]. Ceiling and floor effects were considered moderate up to 25% and substantial when higher than 25% [35].

 

Results

 

Sample Characteristics

Our sample composed of 120 men participants (80%). The total mean age (SD) was 41 (14.7) years old. The majority of participants were adults between 25-64 years old (46%), or married (72%), or educated (86%), or employed (78%) (Table 1). At each follow-up phase, the most common crash mechanism was vehicle-vehicle collision (50%-56%), followed by Vehicle-pedestrian collision (20%-30%). The majority of subjects were motorcycle and car drivers (44%-52%) and Passenger/ pillion passenger (24%-32%).

 

Table 1. Basic information at three follow-up phases, separately (N= 50) and overall (N= 150).

Variables

Categories

F-up 1

(N=50)

N (%)

F-up 2

(N=50)

N (%)

F-up 3

(N=50)

N (%)

Total test

(N=150)

N (%)

Sex

Male

43 (86)

42 (84)

35 (70)

120 (80)

Female

7 (14)

8 (16)

15 (30)

30 (20)

Age group

18-24yrs

5 (10)

9 (18)

8 (16)

22 (14.7)

25-44yrs

25 (50)

24 (48)

21 (42)

70 (46.7)

45-64yrs

17 (34)

14 (28)

15 (30)

46 (30.6)

65yrs and above

3 (6)

3 (6)

6 (12)

12 (8)

Marital status

Never married

10 (21.7)

10 (20)

14 (29)

34 (23.4)

Married

34 (74)

38 (76)

33 (67)

105 (72.4)

Others (Divorced/ widowed)

2 (4.3)

2 (4)

2 (4)

6 (4.2)

Education level

Illiterate

4 (8)

6 (12)

10 (20)

20 (14)

School education

42 (84)

38 (76)

36 (72)

116 (77)

Academic education

4 (8)

6 (12)

4 (8)

14 (9)

Employment

Employed

42 (84)

42 (84)

33 (66)

117 (78)

Unemployed

8 (16)

8 (16)

17 (34)

33 (22)

 

As represented in Table 2, the whole patients’ quality of life showed that there are not statistically significant differences between groups of independent demographic variables including sex, age, education levels, and marital status, on the dependent variable of quality of life score on the basis of test phase data.

 

Table 2. Differences between groups of demographic variables on quality of life (N=150).

Variables

Categories

Test

p valuec

Sexa

Male

0.393

0.69

Female

Age groupsb

18-24yrs

3.504

0.32

25-44yrs

45-64yrs

65yrs and above

Marital statusb

Never married

2.781

0.42

Married

Others (Divorced/ widowed)

Education levelsb

Illiterate

2.636

0.26

School education

Academic education

a Wilcoxon rank-sum (Mann-Whitney) test; bKruskal Walis Test; cNo statistically significant differences between groups

 

Scale Score Reliability

Floor and Ceiling Effects

The proportion of patients (N=150) with a full health state (11111) and worst health state (33333) were 2.7% and 8%, respectively. There was reported no floor and ceiling effects in each follow-up (floor and ceiling effects in F-up1: 0, 16; F-up2: 2, 4; F-up3: 6, 4, respectively).

 

Internal Consistency Reliability

Cronbach’s Alpha Coefficient

For the total test data (N=150), the overall Cronbach’s α coefficient resulted in 0.81 showing excellent internal consistency. The Cronbach’s α coefficients were estimated for each follow-up phase (N=50) as 0.62 at one month after crash, 0.69 at six months after crash, and 0.87 at 12 months after crash.

 

Stability Reliability

Testing the test-retest reliability, the Spearman-brown correlation coefficients were above 0.9 at one, six, and 12 months after crash (0.97, 0.98, and 0.97, respectively) as well as at total score level (0.98).

The reliability of the test-retest method for the EQ-5D-3L was confirmed using the ICC as described in the materials and methods. The ICC showed the absolute agreement between test and retest data at three post-crash follow-ups. For total data, it was excellent at both total and item levels. The coefficients suggested good to excellent reliability and stability (Figure 1). The minimum ICC at item levels was 0.85 for Anxiety.

Fig. 1. Test-retest reliability of the European Quality of Life 5-Dimension 3-Level Version (EQ-5D-3L) at three follow-up phases in road terrific injuries (RTI) patients.ICC: Intraclass Correlation Coefficient

 

Testing the correlation between total test and retest data (N=150), Kendall’s tau-b showed 0.92. At item level, it ranged from 0.81 for Anxiety to 0.97 for Mobility, with p>0.0001 (Table 3). The kappa agreement coefficients were estimated moderate to perfect ranged from 0.6 at F-up 3 to 0.8 at F-up 2, with p>0.0001. The percentage of agreement was high enough (Table 3) [36].

 

Table 3. Test-retest correlation and agreement of the European Quality of Life 5-Dimension 3-Level Version (EQ-5D-3L) at three phone-based follow-ups.

Items

F-up 1 (N=50)

F-up 2 (N=50)

F-up 3 (N=50)

Total data (N=150)

Mobility

Kendall’s tau-ba

0.96

1.0

0.93

0.97

Self-care

0.82

0.94

0.94

0.93

Activities

0.84

0.84

0.85

0.88

Pain

0.80

0.91

0.91

0.89

Anxiety

0.89

0.85

0.84

0.84

Total data

Kendall’s tau-ba

0.89

0.92

0.91

0.92

Kappa

0.75

0.80

0.57

0.71

Agreement

80%

84%

62%

75%

aNote: each domain has three levels (0: no problems, 1: some problems, 2: extreme problems); All coefficients are at significance level of 0.0001.

 

Bland-Altman Plot

To assess agreement between results of test and retest data, we used Bland-Altman plot. In this graph, the majority of the points with higher score fall within the range of ±2SD of the mean differences and represent high agreement between the tool’s test and retest scores. As the score increases (health status gets worse) more points gather around the mean difference (-0.073), with 95 % limits of agreement (-1.21, 1.06; SD=0.58). We have similar number of outlier differences distributed above the high limit and below the low limit (Figure 2).

Fig 2. Bland-Altman plot (agreement between test and retest scores). Y axis, difference between test and retest scores; X axis, average of test and retest scores

 

Discussion

 

This study assessed the psychometric properties of available Persian version of EQ-5D-3L through phone surveys at three various times after crash as a special feature of the paper. This questionnaire was translated in different languages even in Persian [15, 37] but neither specifically through phone survey nor in RTIs patients. Then, we assessed and revealed that Persian EQ-5D-3L is psychometrically reliable to the great extent in phone-based assessment. Adequate internal consistency discloses that the tool measures health status in five dimensions and delivers high reliable score at item level and total score in our representative sample. The instrument is robust to changes in the same subjects over time and less prone to measurement error as time passes by, as ICC showed excellent reliability and the Bland-Altman Plot showed high agreement. The stability reliability of the tool was approved in phone surveys at three different times after crash. This research showed better results than studies in different populations [15, 38] and in in-person interviews [31, 39]. Patients with acute conditions such as road traffic injuries could report different health status from those with chronic diseases. Moreover, time span between test and re-test and different sample size could affect the results.

To collect valid data and prevent from underestimating the burden of disease [40], it was generally preferred to apply not only reliable and valid, but also short and simple instruments in the setting of the national PTC along with other tests. Therefore, it was a proper choice to use EQ-5D-3L in the current study, to examine the patients’ quality of life, regarding its better reproducibility than its five-level version [38] and a good agreement between both versions [16]. The elderly, illiterate patients or those with lower education levels had more troubles in discriminating the five levels and remembering them with their corresponding number [41, 42]. As the majority of RTI victims have only school education or none, and 8% are the elderly [43], the 3-level version is preferable. Moreover, less time-consuming technique of data collection like phone interviews is critical in long-lasting projects, because frequent connections with patients are needed to complete defined tools. Phone surveys enable data collection without unnecessary travel of patients to the study site/ hospital or travel of interviewers to the place of patients’ residence, due to patients’ movement restrictions. Simultaneously, it prevents from exposure to pathogens in the Pandemic COVID-19. Likewise, in-person interview is not financially an easily-conducted method for every researcher/ participant. Additionally, phone interviews require interviewers with special communication skills to continue the communication. However, elderly patients and those with hearing problems may have difficulties with phone survey. Therefore, it is recommended to reassess the tool in this specific target population.

The study found no ceiling and floor effects at any follow-ups and better results compared with studies in China, Greece and Thailand [31, 34, 38]. Contrarily, a research in rural Chinese patients published a high-ceiling effect [16]. Different diseases in various severity could result in different ceiling effects. The inter-countries comparisons may be affected by cross-cultural response pattern, too.

The agreement represented that responses are identical. Satisfactory agreement was revealed between test and re-test results at all three times post injury similar to a study in Thai diabetes [38] in which only the retest data were collected via telephone. However, lower agreement was reported by a Korean study in breast cancer patients for self-administered EQ-5D-3L [44]. Quality of life measured using EQ-5D-3L would be affected by types of injuries.

This study examined the psychometric properties of EQ-5D-3L in patients at one, six, 12 months post injury via phone surveys. We realized that the tool is highly reliable and suitable to apply in phone interviews during follow-ups of injured patients above 18 years. Applying it in the traffic cohort study, we would decline the costs of research since its data could be collected remotely. Also, it would be applied by patients to self-rate their health status.

Regarding the limitations, because the pattern of answers may vary by disease features [38], the finding extrapolating to other groups of patients as well as under 18 RTI patients should be made with caution. It is also recommended to re-assess the tool in specific target population such as elderly people through phone survey due to their possible hearing problems.

 

Declaration

 

Ethics approval and consent to participate: The study respected ethical precepts in research and was performed consistent with the principles of the Declaration of Helsinki. It was evaluated and approved by the Ethics Committee of the Road Traffic Injury Research Center, Tabriz University of Medical Sciences (cohort ethics code: IR.TBZMED.REC.1398.543). An informed consent was required from all participants.

 

Consent for publication: All authors read and approved the final manuscript to be published, and agreed to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved.

 

Conflict of interest: None declared.

 

Funding: This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

 

Authors’ contributions: HSB conceived and designed the study. HSB and NS conducted the analyses, interpreted the results, and drafted the manuscript. MAJ, MF, SBH, AR, MG, and FP contributed in the critical drafting and revising the manuscript for important intellectual content.

 

Acknowledgements: The authors would like to thank the participants in the study.

 

 

  1.  

    1. Palmera-Suárez R, López-Cuadrado T, Almazán-Isla J, Fernández-Cuenca R, Alcalde-Cabero E, Galán I. Disability related to road traffic crashes among adults in Spain. Gaceta sanitaria. 2015;29:43-8.
    2. World Health Organization. Global Status Report on Road Safety: time for action. Geneva; 2009.
    3. Sadeghi-Bazargani H, Hasanzadeh K, Salarilak S, Amiri S, Golestani M, Shahedifar N. Evaluating the relationship between adult attention-deficit/hyperactivity disorder and riding behavior of motorcyclists. J Inj Violence Res. 2019;11(1):45-52.
    4. Saadat S, Yousefifard M, Asady H, Moghadas Jafari A, Fayaz M, Hosseini M. The Most Important Causes of Death in Iranian Population; a Retrospective Cohort Study. Emerg (Tehran). 2015;3(1):16-21.
    5. Azami-Aghdash S, Abolghasem Gorji H, Derakhshani N, Sadeghi-Bazargani H. Barriers to and Facilitators of Road Traffic Injuries Prevention in Iran; A Qualitative Study. Bull Emerg Trauma. 2019;7(4):390-8.
    6. Bazargan-Hejazi S, Ahmadi A, Shirazi A, Ainy E, Djalalinia S, Fereshtehnejad SM, et al. The Burden of Road Traffic Injuries in Iran and 15 Surrounding Countries: 1990-2016. Archives of Iranian medicine. 2018;21(12):556-65.
    7. Samadirad B, Khodadoost M, Sadeghi-Bazargani H, Shahedifar N, Heydari ST, Gholamzadeh S. Epidemiology of Fatal Traffic Injuries Registered in East Azerbaijan and Fars Forensic Medicine Organizations: The Research Protocol. Journal of Clinical Research. 2016;5.
    8. Sadeghi-Bazargani H, Samadirad B, Shahedifar N, Golestani M. Epidemiology of Road Traffic Injury Fatalities among Car Users; A Study Based on Forensic Medicine Data in East Azerbaijan of Iran. Bull Emerg Trauma. 2018;6(2):146-154.
    9. Homayoun SB, Bahram S, Mina G, Nasrin S, Milad J. Analysis of Provincial Mortalities Among Bus/Minibus Users Over Twelve Years, East Azerbaijan, Iran. J Med Life. 2018;11(4):312-319.
    10. Mousazadeh Y, Sadeghi-Bazargani H, Janati A, Poustchi H, Zakeri R, Shafiee-Kandjani AR, et al. Functional Consequences of Road Traffic Injuries: Preliminary Results from PERSIAN Traffic Cohort (PTC). Trauma Monthly. 2021;26(6):294-304.
    11. Shahedifar N, Sadeghi-Bazargani H, Asghari-Jafarabadi M, Farahbakhsh M, Bazargan-Hejazi S. Psychometric properties of the 12-item WHODAS applied through phone survey: an experience in PERSIAN Traffic Cohort. Health Qual Life Outcomes. 2022;20(1):106.
    12. Rencz F, Gulácsi L, Drummond M, Golicki D, Prevolnik Rupel V, Simon J, et al. EQ-5D in Central and Eastern Europe: 2000-2015. Qual Life Res. 2016;25(11):2693-2710.
    13. Christell H, Birch S, Horner K, Lindh C, Rohlin M. Economic evaluation of diagnostic methods used in dentistry. A systematic review. Journal of dentistry. 2014;42(11):1361-71.
    14. Rowen D, Azzabi Zouraq I, Chevrou-Severac H, van Hout B. International Regulations and Recommendations for Utility Data for Health Technology Assessment. Pharmacoeconomics. 2017;35(Suppl 1):11-19.
    15. Dastourani A, Mansour Sohani S, Shah Ali S, Nurizadeh dehkordi S. Reliability and Validity of the Persian Version of the European Quality of life Questionnaire (EQ-5D-3L) in Patients with Meniscus and Knee Ligaments Injury. Journal of Paramedical Sciences & Rehabilitation. 2018;7(4):73-82.
    16. Jiang J, Hong Y, Zhang T, Yang Z, Lin T, Liang Z, et al. Comparing the measurement properties of the EQ-5D-5L and the EQ-5D-3L in hypertensive patients living in rural China. Qual Life Res. 2021;30(7):2045-2060.
    17. Sadeghi-Bazargani H, Shahedifar N, Somi MH, Poustchi H, Bazargan-Hejazi S, Asghari Jafarabadi M, et al. PERSIAN Traffic Safety and Health Cohort: a study protocol on postcrash mental and physical health consequences. Inj Prev. 2022;28(3):269-279.
    18. Kim SH, Jo MW, Lee JW, Lee HJ, Kim JK. Validity and reliability of EQ-5D-3L for breast cancer patients in Korea. Health Qual Life Outcomes. 2015;13:203.
    19. Sadeghi-Bazargani H, Sadeghpour A, Lowery Wilson M, Ala A, Rahmani F. Developing a National Integrated Road Traffic Injury Registry System: A Conceptual Model for a Multidisciplinary Setting. J Multidiscip Healthc. 2020;13:983-996.
    20. Bujang MA, Baharum N. A simplified guide to determination of sample size requirements for estimating the value of intraclass correlation coefficient: a review. Archives of Orofacial Science. 2017;12(1).
    21. Pattanaphesaj J, Mahidon M, Phēsatchasāt MMK. Health-related Quality of Life Measure (EQ-5D-5L): Measurement Property Testing and Its Preference-based Score in Thai Population: Mahidol University; 2014.
    22. Marin S, Sadeghi-Bazargani H, Farahbakhsh M, Ala A, Poustchi H, Pourasghar F. The protocol for validating phone interview tools on post-discharge consequences of road traffic injuries. J Inj Violence Res. 2020;12(3):45–51.
    23. In: EuroQol Research Foundation. EQ-5D-3L User Guide. 2018. Available from: https://euroqol.org/publications/user-guides.
    24. Andresen EM. Criteria for assessing the tools of disability outcomes research. Arch Phys Med Rehabil. 2000;81(12 Suppl 2):S15-20.
    25. Koo TK, Li MY. A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research. J Chiropr Med. 2016;15(2):155-63.
    26. Abedzadeh-Kalahroudi M, Razi E, Sehat M, Asadi-Lari M. Psychometric properties of the world health organization disability assessment schedule II -12 Item (WHODAS II) in trauma patients. Injury. 2016;47(5):1104-8.
    27. McGraw KO, Wong SP. Forming inferences about some intraclass correlation coefficients. Psychological Methods. 1996;1:30-46.
    28. de Raadt A, Warrens MJ, Bosker RJ, Kiers HAL. A Comparison of Reliability Coefficients for Ordinal Rating Scales. Journal of Classification. 2021;38(3):519-43.
    29. Cicchetti DV. Guidelines, criteria, and rules of thumb for evaluating normed and standardized assessment instruments in psychology. Psychological assessment. 1994;6(4):284.
    30. Bizhan Shabankhani. Assessing the inter-rater reliability for nominal, categorical and ordinal data in medical sciences. Archives of Pharmacy Practice. 2020;11(4):144-8.
    31. Fang H, Farooq U, Wang D, Yu F, Younus MI, Guo X. Reliability and validity of the EQ-5D-3L for Kashin-Beck disease in China. Springerplus. 2016;5(1):1924.
    32. Sadeghi-Bazargani H, Aboubakri O, Asghari-Jafarabadi M, Alizadeh-Aghdam MB, Imani A, Tabrizi JS, et al. Psychometric properties of the short and ultra-short versions of socioeconomic status assessment tool for health studies in Iran (SES-Iran). Journal of Clinical Research. 2016;4.
    33. Giavarina D. Understanding Bland Altman analysis. Biochem Med (Zagreb). 2015;25(2):141-51.
    34. Yfantopoulos JN, Chantzaras AE. Validation and comparison of the psychometric properties of the EQ-5D-3L and EQ-5D-5L instruments in Greece. Eur J Health Econ. 2017;18(4):519-531.
    35. Anjos DB, Rodrigues RC, Padilha KM, Pedrosa RB, Gallani MC. Reliability and construct validity of the Instrument to Measure the Impact of Valve Heart Disease on the Patient’s Daily Life. Rev Lat Am Enfermagem. 2016;24:e2730.
    36. McHugh ML. Interrater reliability: the kappa statistic. Biochem Med (Zagreb). 2012;22(3):276-82.
    37. Emrani Z, Akbari Sari A, Zeraati H, Olyaeemanesh A, Daroudi R. Health-related quality of life measured using the EQ-5D-5 L: population norms for the capital of Iran. Health Qual Life Outcomes. 2020;18(1):108.
    38. Pattanaphesaj J, Thavorncharoensap M. Measurement properties of the EQ-5D-5L compared to EQ-5D-3L in the Thai diabetes patients. Health Qual Life Outcomes. 2015;13:14.
    39. Zhou W, Shen A, Yang Z, Wang P, Wu B, Herdman M, et al. Patient-caregiver agreement and test-retest reliability of the EQ-5D-Y-3L and EQ-5D-Y-5L in paediatric patients with haematological malignancies. Eur J Health Econ. 2021;22(7):1103-1113.
    40. Rezaei S, Woldemichael A, Ahmadi S, Mohamadi Bolbanabad A, Zahir Abdullah F, Piroozi B. Comparing the properties of the EQ-5D-5L and EQ-5D-3L in general population in Iran. Int J Health Plann Manage. 2021;36(5):1613-1625.
    41. Liang Z, Zhang T, Lin T, Liu L, Wang B, Fu AZ, et al. Health-related quality of life among rural men and women with hypertension: assessment by the EQ-5D-5L in Jiangsu, China. Qual Life Res. 2019;28(8):2069-2080.
    42. Yang F, Jiang S, He XN, Li HC, Wu HY, Zhang TT, et al. Do Rural Residents in China Understand EQ-5D-5L as Intended? Evidence From a Qualitative Study. Pharmacoecon Open. 2021;5(1):101-109.
    43. Sadeghi-Bazargani H, Samadirad B, Shahedifar N, Golestani M. Epidemiology of Road Traffic Injury Fatalities among Car Users; A Study Based on Forensic Medicine Data in East Azerbaijan of Iran. Bull Emerg Trauma. 2018;6(2):146-54.
    44. Kim SH, Jo MW, Lee JW, Lee HJ, Kim JK. Validity and reliability of EQ-5D-3L for breast cancer patients in Korea. Health Qual Life Outcomes. 2015;13:203.