Open Access Article
Xu Zhang†
*a,
Kaiyan Hu†bcd,
Jing Xue†bcd,
Zhe Wangbcd,
Yanbiao Jiange,
Zhenyu Zouh,
Jinwei Yangf,
Mingyue Jiaog,
Shuo Yang‡
h,
Yingmo Shenh,
Yusha Liua,
Xingzhi Lii,
Wenbo Liu
*a,
Bin Ma
*bcdjk and
Jie Chen‡*h
aCenter for Medical Device Evaluation, National Medical Products Administration, Beijing 100871, P. R. China. E-mail: bobole@live.cn
bEvidence-Based Medicine Center, School of Basic Medical Sciences, Lanzhou University, Lanzhou, 730000, P. R. China. E-mail: kittymb2017@163.com
cResearch Center for Medical Device Regulatory Science, Lanzhou University, Lanzhou, 730000, P.R. China
dIndustry Center for Evidence-Based Research and Evaluation Standards in Medical Devices, Lanzhou, Gansu Province 730000, P.R. China
eGansu Provincial Hospital, Lanzhou, 730000, P. R. China
fCenter for Reproductive Medicine, Gansu Provincial Maternal and Child Health Hospital (Gansu Provincial Central Hospital), Lanzhou, Gansu 730050, P. R. China
gSchool of Life Sciences, Tsinghua University, Beijing 100084, P. R. China
hDepartment of Hernia and Abdominal Wall Surgery, Beijing Chaoyang Hospital, Capital Medical University, Beijing 100043, P. R. China. E-mail: chenjiejoe@vip.sina.com
iSchool of Basic Medical, Xinxiang Medical University, Xinxiang 453000, P. R. China
jResearch Unit of Evidence-Based Evaluation and Guidelines, Chinese Academy of Medical Sciences (2021RU017), School of Basic Medical Sciences, Lanzhou University, Lanzhou 730000, P. R. China
kKey Laboratory of Evidence Based Medicine of Gansu Province, Lanzhou 730000, P. R. China
First published on 5th November 2025
This review investigates the variability in the experimental methodology used in animal studies exploring the optimal composite mesh for incisional hernia (IH) repair. Eight databases were searched from inception to April 1, 2023. Animal studies conducted to evaluate the anti-adhesion effect of the composite mesh were included. Standardized forms were used to extract the experimental design characteristics. The extracted data were presented in tabular format and summarized using frequency analysis. The inherent risk of bias in the included studies was assessed using SYRCLE's risk of bias tool for animal studies. The results showed that 71 studies were included in the final analysis. Rats represented the most common animal (65%) used for studies. Conventional models (92%), high-adhesion models (4%), and abdominal cavity pollution models (4%) were reported in the included studies. The sample size of animals varied between studies (2–31/group). A variety of quantitative (calculation of adhesion area or testing of adhesion strength) and qualitative (45 assessment systems) adhesion assessment methods were reported. One month (41%) and 1 week (30%) were the most common time points used to evaluate the adhesion. The results of the risk of bias assessment showed that, of the 71 animal studies included, only one was a randomized controlled study, and only two studies reported that animal breeders and investigators were blinded. In conclusion, a large number of animal studies have been conducted to explore the ideal intraperitoneal anti-adhesive composite mesh for IH repair. However, these animal studies have significant differences in animal models, implantation procedures, control selection, and adhesion assessment. These differences directly affect the comparability between studies and the reproducibility of the studies.
000.9
Although recent years have seen a growing number of review articles on incisional hernia management, most have concentrated on clinical treatment strategies and mesh applications in surgical practice. For example, Quiroga-Centeno et al. conducted a scoping review of emergency incisional hernia repair, outlining current clinical approaches including timing of intervention, surgical techniques, and postoperative management. They highlighted the lack of standardized terminology and high-quality evidence in this area.10 Similarly, Najm et al. reviewed currently available abdominal wall meshes and emerging materials, focusing on their biological properties, postoperative complications, and performance in clinical trials.11 However, neither review addressed the preclinical evaluation of anti-adhesion composite meshes in animal models. Given the critical role of animal studies in the preliminary assessment of mesh safety and efficacy, systematically examining experimental design variability, scoring systems, and observation durations is essential for facilitating clinical translation.
Identifying an ideal mesh for IH has been a hot topic of tremendous interest for general surgeons and medical device companies for decades. Composite meshes represent the ideal design for IPOM applications in recent years,12 as these are comprised of a permanent synthetic mesh material on the parietal side and an adhesion barrier layer on the visceral side. The mesh side is intended to promote tissue ingrowth and anchor the prosthesis to the abdominal wall, whereas the barrier layer prevents adhesion of the abdominal viscera to the underlying mesh.12 Composite meshes can be divided into two basic categories: meshes with permanent barrier layers, and meshes with absorbable (i.e., temporary) barrier layers or coatings.
The anti-adhesion effect of composite mesh is a critical parameter reflecting product safety, while animal studies are usually performed to evaluate the anti-adhesion effects of the barrier layers applied to meshes before potential translation to clinics.13 Adhesion formation to the mesh can only be researched using experimental models, since patients cannot be reoperated for evaluation of this key aspect.14 Animal models for incisional hernia mesh repair can be broadly categorized into three types: the conventional model, the high-adhesion model, and the abdominal cavity pollution model. The conventional model typically involves creating an abdominal wall defect and repairing it with mesh under sterile conditions with intact peritoneum, leading to baseline levels of mesh-viscera adhesions. The high-adhesion model deliberately promotes extensive adhesions—for example, by abrading the peritoneal surface of the bowel to mimic severe injury (the Harris adhesion model)—resulting in markedly higher adhesion formation.15 The abdominal cavity pollution model simulates infected or contaminated surgical fields by introducing bacterial inoculum or other contaminants into the abdominal cavity at the time of mesh implantation; for instance, inoculating methicillin-resistant Staphylococcus aureus (MRSA) after mesh placement produces an effective infection model.16 The purpose of the adhesion testing is to provide evidence of a product's safety, and to demonstrate that a novel composite mesh is at least substantially equivalent, if not superior, to an established product.17 Preclinical in vivo testing data has been commonly used in regulatory submission of mesh products.18,19 However, there has been no recognized gold standard for the methods used to evaluate meshes in vivo for several pivotal characteristics, such as inflammation, shrinkage, ingrowth, remodeling, and adhesion formation to the mesh.
The Food and Drug Administration (FDA) has developed a guidance document outlining suitable testing methods for new surgical mesh products;18 however, the guidance does not provide details on the technical aspects of in vivo studies that should be conducted and analyzed. In 2021, Whitehead-Clarke et al.17 performed a scoping review of relevant studies to analyze the methodologies used for in vivo hernia mesh testing, and they found that standardization is absent from the current practice of in vivo mesh testing. There has been significant inconsistency in the methodology of every category of testing, encompassing mechanical testing, histology-structural analysis, and histology-inflammatory cellular analysis. Another systematic review conducted in 2020 (ref. 20) examined the experimental methodology behind in vivo testing of hiatus hernia and diaphragmatic hernia mesh and also found significant variation between existing studies.
A variety of composite meshes reducing adhesion have been developed and the number of animal studies exploring the optimal composite mesh for IH repair has been increasing. However, to our knowledge, no studies have investigated their experimental methodologies. Therefore, our scoping review analyzed the variability in this area of in vivo testing of IH composite meshes, contributing to identifying the need for standardization in the field and the areas in which standardization attracts clinical attention.
The eligibility criteria related to the report characteristics were as follows: (1) language of publication: English or Chinese; and (2) status of publication: abstracts of studies were excluded.
The extracted information included the following: (1) general study characteristics (the first author and year of publication); (2) animal species, weight, age, and sex; (3) the number of animals used; (4) whether the sample size was calculated; (5) type of animal model; (6) whether the modeling method was reported; (7) whether published standard animal models were cited; (8) barrier layers and mesh layers of composite meshes; (9) product name of the marked composite meshes; (10) time points of meshes and tissues explanted; (11) methods for assessing adhesions.
Thirty-nine percent (39%) (28/71) reported the quantitative estimation/calculation of adhesion area. The calculation/estimation tools involved in these studies are image analysis software, planimetry by digital caliper equipped with a liquid crystal display screen, and macroscopic evaluation, and 2 cm × 2 cm side length grid estimation. Among the 28 studies, nine did not report which tool was used for quantitative calculation/estimation of adhesion area. In all 71 included studies, four reported quantitative testing of adhesion strength with tensile testing equipment.
Among the studies, a qualitative adhesion assessment by macroscopic observation was reported in 92% (65/71). Departing from the assessment systems used to evaluate the existence and severity of adhesions in 61 studies, no assessment systems were used in four studies. A total of 45 assessment systems were involved, and 40 of them used severity level to grade adhesions.
In 65 studies, 37 used a single indicator system for adhesion assessment, involving 24 systems, as outlined in Appendix V. There were four assessment systems to grade adhesion based on the percentage area of the mesh surface covered, and four different severity levels of adhesions were used in these assessment systems. The frequency of adhesion scores appearing in different evaluation systems is shown in Fig. 10. Four systems were assessed based on the adhesion scope, involving the abdominal organs, and three of them used severity level. Thirteen systems were based on the adhesion strength between meshes and tissues, and 12 of them used severity level. Three systems were based on the adhesion appearance, and all of them used severity level.
In 28/65 studies, a combined indicator system was used for adhesion assessment. Twenty-one systems were involved in these studies and are outlined in Appendix VI. Among these systems, 12 considered two indicators (Scope and Appearance, 3; Strength and Appearance, 7; Area and Appearance, 1; Strength and Area, 1); 9 considered three indicators (Area and Appearance and Strength, 7; Strength and Area and Scope, 1; Strength and Area and Scope, 1), and 18 used severity level to grade adhesions, covering ten different grading levels.
Animal models are central elements in the design, implementation, and evaluation of animal studies. The selection of animal models should be evaluated comprehensively. In terms of animal models of human disease, the higher the degree to which human disease characteristics can be reproduced or simulated, the better the model.14,27 The similarity of disease characteristics is generally the first consideration in the selection and design of a particular animal model for study consideration.27 This relates to the correlation between animal studies and clinical applications, and is important for maximizing the use of the evidence obtained from future animal studies and minimizing the risks of clinical trials.27 Our scoping review found that the animal models of hernia used to study the anti-adhesive effects of composite meshes involved small animals in rats (65%) and rabbits (28%) and large animals in pigs (7%) and dogs (1%). The rat model is the most commonly used animal model for IHs. Over 50% of all experimental hernia research focused on rat models.28,29 Rats are economical, simple to manipulate, and their genetic variation can be easily controlled through inbred lines,27 but they are less anatomically and physiologically compatible with humans.30 On the one hand, they differ significantly from humans in terms of abdominal wall defect size and fascial thickness. On the other hand, rats have superior tissue healing ability to humans. Experts in the field have pointed out that the porcine model is physiologically the most appropriate choice to better mimic the structure of the human abdominal wall, particularly in terms of abdominal wall strength.31 Recently, van den Hil et al.32 found similar histological findings in rats and humans in terms of adhesion formation and foreign body response to meshes. This means that rats may not be the best animal model for evaluating the effect of hernia mesh repair, but may be one of the more desirable options for evaluating the anti-adhesive effects of composite meshes.
During one study, the growth of the animal had a significant impact on the evaluation of the efficacy of mesh repair for IHs. In 1975, Cerise et al.,33 in an animal study of meshes, found that the rats doubled in size during the study. On the one hand, the increase in volume reflects the increase in the rupture strength of the abdominal wall, and this change will directly affect the evaluation of the mechanical strength of meshes. On the other hand, changes in animal volume have an impact on the interaction between the meshes and tissues, which in turn may affect the formation of adhesions. Of the included animal studies, the majority of studies (87%) reported only the animal weight at the start of the study and only 27% reported the animal age at the start of the study. The animals used for the study were reported to be quite young. To exclude the influence of animal growth and development on the experimental results, adult animals with stable body weight (size) should be used for future studies, and changes in animal weight before and after the experiment should be reported. Our scoping review found that animal sex was not reported in 25% of studies. Of the studies that reported the animal sex used, the majority (66%) used only male animals. Although there is no relevant evidence that the efficacy of mesh repair of IHs differs between male and female patients, to avoid potential physiological differences between the sexes from influencing experimental results,34 we recommend that future studies balance and report animal sex in experimental studies of mesh repair of IHs in accordance with the Animal Research Reporting of In Vivo Experiments (ARRIVE) guidelines35 and the recently published National Institutes of Health (NIH) policy.36
The majority of studies (92%) used a conventional IH model, and only a few studies simulated postoperative pathological abdominal adhesions (4%) and infections (4%) based on the IH model. The preservation of the animal's intact peritoneum in the former IH model, without any intestinal abrasion, is closer to the clinical reality of IH. A more complex clinical application scenario is created in the latter IH model, where the friction of the intestinal canal and peritoneum, and surgical infection in the modeling modality are all associated with higher probability of adhesions in the animal model, and the results obtained are more favorable for the evaluation of complex clinical application scenarios. Most of the studies (93%) reported the modeling method of the IH model, but we found that only 10 of these studies cited an IH model from previous studies. The current method of IH modeling varies significantly between studies.14 Differences in animal modeling make it difficult to replicate study results and to directly compare results across the literature,12,37 preventing direct access to information about the advantages and disadvantages of multiple commercially available products or developed biomaterials, and resulting in difficulties translating study results into clinical practice .28,29,38 Therefore, we believe that it is appropriate to clarify the primary study objectives and the main evaluation measures when designing animal studies, limit the available modeling animals to a small scope focused on the primary study objectives and the main evaluation measures, and establish standardized models and cite them clearly, which may increase the impact of future publications and thus facilitate the clinical translation of the study results.39–42
For the regulatory clearance of new meshes, the FDA suggests the use of marketed products as standard controls for animal studies.18 The use of standard controls helps to ensure the reproducibility of animal studies, and is beneficial for correctly determining the equivalence of the new composite meshes in the experimental group to their commercially available controls in terms of safety and efficacy. A total of 44 composite meshes or materials were covered in our scoping review, 15 of which were marketed products. 39% of animal studies used marketed products as a control. These products were used to validate the anti-adhesive effect of new composite meshes. In contrast, 16% of animal studies did not use any marketed products as controls. The rationale for choosing controls in these studies was relevant to the study's purpose. In the early stage of material design, it is necessary to select material combinations with different ratios as controls to determine the best material formulation for reducing adhesion. We suggest that animal studies aiming at obtaining evidence of safety and efficacy should use marketed meshes whose clinical benefits have been validated as controls, allowing researchers to make reliable judgments on whether the new composite meshes have the value of clinical translation, improve the value of animal studies, and reduce the waste of resources. Meanwhile, accumulating evidence suggests that chemical modifications can markedly influence the adhesion behavior of surgical meshes. One review highlighted that chemical alterations to the surface or internal structure of biomaterials, particularly through the incorporation of functionalized nanofillers such as poly(methyl methacrylate) grafted carbon nano-onions (PMPMA CNOs), can significantly enhance both the biological performance and mechanical integrity of composite materials.43 Such strategies provide a strong foundation for the design and development of advanced implantable biomaterials, including anti-adhesion surgical meshes.
For adhesion evaluation measures, our scoping review found that a range of reported adhesion assessment methods were used in the pertinent literature, including quantitative assessment methods (adhesion area calculation and adhesion strength testing) and qualitative assessment methods (45 qualitative assessment systems). Qualitative assessment of adhesions (92%) was used in most studies. In the current systems for qualitative adhesion assessment, the severity of adhesions is classified according to one or more of four classification indicators: adhesion area, adhesion strength, adhesion extent, and observation of adhesion appearance. Some of the qualitative assessment systems examined the same adhesion classification indicators, but there were multiple ways of grading the severity of adhesions between studies.
No standard adhesion scoring system is widely accepted and used internationally.24,44,45 In 2019, the National Medical Products Administration (NMPA) issued a guidance document titled Technical Review of Animal Study of Intraperitoneal Internal Hernia Mesh,46 which suggested relevant requirements for the animal studies of composite hernia meshes under application for medical device registration in China. In this document, the relevance of animal studies to clinical practice was considered, incorporating the clinical applicability when establishing criteria for adhesion evaluation. Considering that mild adhesions without serious adverse events are clinically acceptable, in this guidance document, the rate of excellent adhesions was considered as a criterion to determine the acceptability of adhesions between meshes and tissues in animal studies. Moreover, since adhesion intensity and adhesion area show different effects on adhesion-related complications, such as intestinal fistula and intestinal obstruction, the “excellence” rate of adhesion is set as a composite indicator and there is an accompanying comprehensive evaluation table. In the evaluation table biased toward adhesion strength, adhesion area level 2 and adhesion strength level 3 are considered unacceptable, while adhesion area level 3 and adhesion strength level 2 are acceptable. Accordingly, investigators need to reach a consensus on a standard evaluation system for the anti-adhesion effects of meshes in animal studies. To better achieve clinical translation of animal study results, future animal studies on adhesion evaluation criteria need to be linked to those of clinical adhesions.
In the present study, the follow-up duration for adhesion evaluation varied widely among the included literature, ranging from 1 day to 1 year, most commonly at 1 month (41%) and 1 week (30%) time points. In animal studies, whether or not the barrier/coating of the composite mesh is degraded and the variability in degradation period may be one of the main reasons for the difference in follow-up time.47 After implantation of the composite mesh in animals, an extensive inflammatory response may be provoked by contact between materials and abdominal tissues, which is one of the factors contributing to the development of adhesions.48 Barrier materials such as polycaprolactone (PCL) and collagen exhibit markedly distinct degradation behaviors. PCL, a slow-degrading synthetic polymer, remains in situ for several months to years,49 offering prolonged separation between viscera and mesh throughout various stages of tissue healing. In contrast, collagen-based barriers are natural biomaterials that are enzymatically broken down within weeks after implantation.50 One study demonstrated that collagen-coated mesh significantly reduced adhesions at 7 days, but by 30 days post-op the collagen layer had been phagocytosed and adhesions had markedly increased.48 Conversely, PCL barriers maintain physical isolation over a longer period, thereby minimizing adhesion formation until peritoneal regeneration is complete. Accordingly, the degradation timeline of the barrier layer should be carefully aligned with the biological window of peritoneal healing to maximize anti-adhesion efficacy. Reaction associated with surgical trauma is another factor contributing to adhesions. In accordance with ISO 10993-6:2007,51 it may be difficult to distinguish between implant-induced or surgically-induced local tissue reactions in the first two weeks post-implantation procedure. According to the consensus of experts in this field,24 a follow-up of 4 weeks or less is appropriate to assess short-term inflammatory responses in animal studies. In addition, reparation and peritonealization of the peritoneal mesothelial cell layer are critical factors in preventing adhesions in the abdominal cavity. Previous studies52,53 noted that it typically takes 8 days for complete healing of the mural peritoneum. In our work, it is concluded that for the follow-up time of animal studies in this field, the observation time point and follow-up time should be determined by combining the barrier/coating degradation cycle of composite meshes and the duration of chronic inflammatory regression to ensure the scientific validity of outcome observations.
There were large differences in the animal sample size between studies, and none of the studies reported the calculation of sample size. In animal studies, it is crucial to determine the sample size, i.e., the number of animals per group. The sample size should meet the requirements of scientific research validity, but also comply with the ethical guidelines, national laws and regulations, and the 3rs principles of limits on the number of animals used.27 There are two erroneous tendencies in determining the number of experimental animals in animal studies, too small or too large, and too small occupies a large proportion of these two erroneous tendencies.27 Too small a sample size of animals leads to too little test efficacy in detecting meaningful or biologically significant results. Regarding how to scientifically determine the number of experimental animals when designing animal studies, the website of the Center for Biomathematics, Department of Pediatrics at Columbia University Medical Center has provided relevant calculation methods and formulas for different effect sizes,54 future studies should incorporate power analysis during the design phase to calculate the minimum number of animals required. We recommend utilizing online sample size calculators, such as those provided by the Columbia University Biostatistics Center, which allow for calculation based on parameters including anticipated effect size, significance level (α), and statistical power (1 − β).55 Researchers should also clearly report the methods and parameters used for sample size estimation in their publications to ensure transparency and reproducibility.
The studies included in this scoping review have major limitations in terms of experimental design and implementation, especially in terms of randomization, blinding, and allocation concealment, resulting in a high risk of various biases that affect the internal veracity of animal studies.56–58 In the 71 studies, only one reported a randomized approach, where no concealed grouping was implemented, and 53 studies had uneven baseline characteristics and might have been subject to selective bias. Additionally, as an interventional animal study in surgery, it is crucial that the study executor and outcome assessors are blinded during the surgical performance and observation of subjective measures of adhesions, which could reduce implementation bias and measurement bias and improve the reliability and authenticity of the experimental results. Unfortunately, the executers and outcome assessors were unblinded in 52 studies. We recommend that future animal studies in this field be designed and conducted according to the risk of bias tool SYRCLE,26 with an attempt to improve the internal veracity of the studies and ensure that the results are based on high-quality and unbiased data.
We further recommend standardizing several key parameters of animal experiments, in particular the type of animal model, follow-up time points, and adhesion assessment scoring system. First, researchers should select and clearly define the type of animal model (conventional, high-adhesion, or abdominal cavity pollution) appropriate for their study objectives, and explicitly state this in publications, to ensure results are contextually relevant and comparable.15,16 Second, follow-up observation time points should be standardized, encompassing at a minimum an early phase (e.g., 7–14 days post-surgery) and a longer-term phase (e.g., 4–12 weeks post-surgery), to evaluate initial adhesion formation as well as its progression over time. Consistent time-point scheduling will facilitate direct comparisons of outcomes across different studies. Finally, drawing on the multiple adhesion scoring criteria compiled in this review, we advocate a unified multi-dimensional scoring framework for evaluating mesh adhesions. This scoring system includes metrics such as adhesion area (the percentage of mesh surface involved), adhesion extent (the number of adhesion bands and organs attached), adhesion tenacity (the force required to separate adhesions, reflecting their firmness), and adhesion appearance (e.g., fibrous tissue thickness and degree of vascularization). Each metric can be graded on a scale (e.g., 0 = no adhesion up to 3 or 4 = most severe adhesion) to quantitatively capture the overall severity of mesh adhesions. Employing such a standardized multi-parameter scoring system will reduce biases arising from disparate assessment methods and improve the objectivity and comparability of adhesion outcomes between studies.
To improve the comparability and reliability of animal study results, it is recommended that future researchers adhere to the ARRIVE guidelines, the core principles of ISO 10993-6, and the key elements outlined by Cheng et al.59–61 These include a clear definition of study objectives, appropriate selection of test devices and controls, careful choice of animal models, adequate sample size, rational follow-up periods, robust outcome measures, comprehensive reporting, and implementation of an effective quality management system. By embracing the “3R + DQ” principle—replacement, reduction, refinement, combined with design and quality—researchers can enhance study quality, support inter-study standardization and reproducibility, and ultimately accelerate the clinical translation of anti-adhesion meshes for incisional hernia repair.
Supplementary information: Appendix I (Study protocol), Appendix II (Search strategy), Appendix III (List of included studies), Appendix IV (Description of the components of composite meshes), Appendix V (Rating system with the single indicator for adhesion assessment), and Appendix VI (Rating system with combination indicators for adhesion assessment). See DOI: https://doi.org/10.1039/d5ra02062j.
Footnotes |
| † These authors contributed equally. |
| ‡ Present address: Department of Hernia and Abdominal Wall Surgery, Peking University People's Hospital, Beijing 100044, P. R. China. |
| This journal is © The Royal Society of Chemistry 2025 |