Effects of injury size on local and systemic immune cell dynamics in volumetric muscle loss

Introduction
Volumetric Muscle Loss (VML) is a debilitating condition resulting from the loss of a large portion of muscle due to injury or disease. The lifetime projected disability-related expenses is greater than $400,000 USD per patient1. The current standard of care for VML patients is physical therapy, which has limited benefits2,3. While xenogeneic decellularized extracellular matrix (ECM) scaffolds have shown promise in clinical trials, the results still fall far short of restoring pre-VML functionality4,5. There is a major need for more effective interventions that can promote regenerative tissue healing.
Recent studies have sought to identify a cellular basis for dysfunctional wound healing following VML. Several studies have pointed to macrophages, which are major regulators of wound healing and primary targets to improve tissue regeneration across a wide variety of tissues (reviewed in refs. 6,7). A series of landmark studies showed that implantation of ECM-derived scaffolds promoted regenerative outcomes of VML in mice, associated with skewing of macrophages towards a Th2-associated profile8,9,10. On the other hand, macrophages in fibrotic outcomes generated by poly(caprolactone) (PCL) implants were skewed towards a Th17 profile. Other important studies used critical sized defects in the absence of biomaterial implantation to generate fibrotic outcomes in comparison to smaller injuries that almost perfectly regenerate11,12,13,14,15,16. In general, these studies showed that macrophages in fibrotic injuries simultaneously expressed high levels of pro-inflammatory factors like tumor necrosis factor-alpha (TNFα) and pro-fibrotic factors like transforming growth factor-beta (TGFβ)11,15. At the same time, macrophages in fibrotic injuries expressed lower levels of markers previously associated with regenerative macrophage behavior in other tissues. These markers included CD206, CD301b, CD163, and PDGFBB10,15,16, although at least one study noted elevated levels of CD206+ macrophages in fibrotic VML11. However, it is not known for how long these phenotype changes persist, and the mechanisms behind them are not known, making it difficult to design targeted interventions that correct macrophage behavior for improved tissue regeneration.
The behavior of other infiltrating immune cells to injury sites can influence macrophage phenotype. For example, Sadtler et al. showed that the phenotype of infiltrating T cells are critical for promotion of a regenerative macrophage phenotype in ECM scaffold-promoted regeneration of VML9. However, it is not clear to what extent this effect was specific to the ECM scaffold microenvironment or whether it is true for regenerative outcomes in the absence of ECM scaffolds. Neutrophils have been shown in several disease and injury models to directly influence macrophage phenotype through efferocytosis, the phagocytosis of apoptotic neutrophils that promotes an anti-inflammatory and regenerative macrophage phenotype17,18,19,20,21. If efferocytosis is impaired, neutrophils can progress to secondary necrosis and secrete soluble factors that promote a pro-inflammatory macrophage phenotype22,23,24. Although the effects of neutrophils on macrophage phenotype in VML have not been thoroughly investigated, one study did show that the prolonged presence of neutrophils was detrimental for muscle repair via effects on satellite cells14. To our knowledge, no study has employed a holistic approach to analyze macrophage phenotype in the context of the many immune cell interactions within this system. It is critical to understand how macrophage phenotype dynamics change with other immune cells to design targeted interventions to enhance tissue repair.
Finally, it is not clear to what extent VML influences the entire immune system outside of the local muscle tissue. In major traumatic injury in humans, systemic levels of pro-inflammatory cytokines are known to be elevated in the bloodstream25,26,27,28. In mice, systemic effects of injuries including myocardial infarction, stroke, and sepsis have been shown to influence the behavior of macrophages in other organs like the heart and brain29. In that study, the behavior of immune cells in the reservoir organs (i.e. bone marrow and spleen) was not evaluated. Spleen- and bone marrow-derived immune cells are the principal sources of monocyte-derived macrophages and other leukocytes that populate sites of injury30,31. These studies suggest that critical size muscle defects might have systemic influence on immune cells at the reservoir organs, with possible repercussions on leukocyte recruitment to the site of injury. Therefore, the purpose of this study was to evaluate the local and systemic changes in immune cell behavior in fibrotic vs. regenerative muscle repair, with the goal of identifying possible mechanisms of macrophage dysfunction in fibrotic outcomes. We hypothesized that the magnitude of the injury in fibrotic muscle repair has systemic repercussions that result in dysfunctional behavior in monocyte-derived macrophages, ultimately leading to worse healing outcomes.
To investigate how the size of the injury influences local and systemic changes in immune cell behavior, we utilized a murine model of VML in which small (2 mm) defects in the quadriceps completely regenerate while critically sized (4 mm) defects result in fibrotic repair with both reduced quantity and quality of repair tissue32. We characterized local macrophage phenotype and other immune cell profiles in the muscle via multi-dimensional flow cytometry and multiplex gene expression analysis (nanoString). Then, we investigated systemic effects of the two injury types on immune cell behavior and macrophage phenotype at the major immune cell reservoirs, the bone marrow and the spleen, using flow cytometry, and we measured systemic levels of immunomodulatory factors in blood serum through multiplex cytokine analysis (Luminex). Finally, we investigated the effects of reducing the levels of neutrophils in the larger injury size to those observed in the smaller injury size. These findings provide critical insight into the mechanisms behind dysfunctional immune cell dynamics in fibrotic muscle injuries.
Results
Altered macrophage behavior at early stages of fibrotic muscle repair
To evaluate the mechanism driving pathological macrophage phenotype following VML, we utilized a murine model in which a critically sized defect (4 mm) in the quadriceps leads to fibrotic outcomes, while smaller defects (2 mm) lead to regeneration14,32 (Fig. 1A, Supplementary Fig. 1). By 28 days, the muscle tissue in the fibrotic group was characterized by higher levels of collagen staining, lower overall muscle mass, and higher sulfated glycosaminoglycans (sGAGs) content, compared to the regenerative group and to naïve (uninjured) tissue, while no differences were observed between the regenerative and the naïve groups (Fig. 1B). We next characterized the local and systemic immune response in these groups via multidimensional flow cytometry, multiplex gene expression analysis (nanoString) (Supplementary Table 1), multiplex cytokine analysis (Luminex), and histology and immunohistochemistry over 28 days post injury (Fig. 1A).

A Overview of the model and representative images of Masson’s trichrome staining. Image created in BioRender.com. B Area of collagen per section (Masson’s trichrome), overall muscle mass, and sulfated glycosaminoglycans (sGAGs) content. One way ANOVA, Tukey’s post hoc. N = 6, mean ± SD. **p < 0.01; ***p < 0.001; ****p < 0.0001.
The numbers of total myeloid cells (CD45+) and macrophages (CD45+B220-CD3-NK1.1-Ly6G-F4/80+) in the muscle increased dramatically after injury in both the regenerative and fibrotic outcome groups, with higher numbers of macrophages in the fibrotic group at days 3 and 28 (Fig. 2A–C). Interestingly, as a percentage of CD45+ cells, there were slightly fewer macrophages in the fibrotic group at day 1 compared to the regenerative group (Fig. 2C). The phenotypes of macrophages in the muscle were characterized via flow cytometry using a panel of 9 common macrophage phenotype markers, which we found to be differentially regulated by macrophages treated in vitro with pro-inflammatory (Th1), Th2, or Th17 stimuli (Supplementary Figs. 2–4). While overall temporal trends in marker expression were similar between groups, the magnitude of marker expression levels differed in macrophages from the regenerative or the fibrotic outcome groups as early as day 1 and persisted until day 28 (Fig. 2D, Supplementary Fig. 3). Compared to macrophages in the regenerative group, macrophages in the fibrotic outcome group generally expressed higher levels of CD9 (previously associated with fibrotic macrophages), CD38 and PDL1 (pro-inflammatory markers that have anti-inflammatory activity), Arg1 (reparative marker involved in collagen production), and CXCR4 (involved in cell recruitment), depending on the time point (Fig. 2D, Supplementary Fig. 3). Macrophages in the fibrotic group also expressed lower levels of CD86 (pro-inflammatory marker involved in antigen presentation), CD163, CD301b, and CD206 (regenerative markers involved in phagocytosis). Macrophages in the fibrotic group exhibited a phenotype that did not resemble any of the reference phenotypes characterized in vitro (Supplementary Fig. 4).

A CD45+ cells presence in muscle. B Macrophage presence in muscle as a function of all cells. C Macrophage presence in muscle as a function of CD45+ cells. D Muscle macrophage phenotype markers. E Multidimensional analysis of macrophages at day 7. Two-way ANOVA, Sidak’s post hoc test. N = 6, bars show mean ± SD. *p < 0.0.5; **p < 0.01; ***p < 0.001; ****p < 0.0001.
To uncover macrophage subpopulations, we employed multidimensional analysis via XShift and FlowSOM clustering algorithms at days 1, 3 and 7 post injury (Fig. 2E, Supplementary Fig. 5). While few differences between groups were observed at days 1 and 3 (Supplementary Fig. 5), several differences were apparent at day 7 (Fig. 2E). Macrophages from the fibrotic group were present in higher numbers in clusters 1 and 4, which were characterized by low levels of all markers (cluster 1) or intermediate expression of CD206, CXCR4, CD86, and CD38 (cluster 4) (Fig. 2E). In contrast, macrophages from the regenerative group were present in higher numbers in clusters 2, 5 and 8. While clusters 5 and 8 were characterized by low expression of all the markers, cluster 2 was characterized by very high expression of Arg1 and intermediate expression of CXCR4 and CD86. These results show that macrophages in each group are heterogeneous, with multiple subpopulations existing simultaneously.
To characterize macrophage phenotype more thoroughly at the early time points, we FACS-sorted macrophages (CD45+B220-CD3-NK1.1-Ly6G-F4/80+) from these two injuries at days 1 and 3 post injury (Fig. 3A). We employed multiplex gene expression analysis (nanoString) using a custom-curated panel containing 259 genes related to macrophage phenotype, immune cell recruitment and trafficking, phagocytosis, wound healing, angiogenesis, and fibrosis (Supplementary Table 1). Hierarchical clustering of the top 10% most variable genes ranked by fold change between groups (Fig. 3B) or by Benjamini–Yekutieli adjusted p-value (q-value) (Supplementary Fig. 6) at day 1 showed clear separation between the injury outcomes at both time points. At day 1, there was broad downregulation of genes in the macrophages from the fibrotic outcome group, with 20 out of the top 26 genes expressed at lower levels compared to macrophages from the regenerative group, although these differences were not statistically significant (Fig. 3B, Supplementary Data 1). The downregulated genes were mostly related to tissue repair, ECM remodeling and a reparative macrophage phenotype (Supplementary Table 2). Only two of these genes reached the level of statistical significance based on q-value cut-off of 0.05. Itgb1 and Ly96 were expressed at significantly lower levels in fibrotic group macrophages compared to regenerative group macrophages, while Cxcl2 was expressed at higher levels but not significantly so (q = 0.08). Regenerative and fibrotic group macrophages also clustered separately at Day 3 (Fig. 3D, Supplementary Fig. 6), although no genes reached the level of statistical significance based on q < 0.05 (Supplementary Table 2). Nonetheless, genes that were expressed at higher levels by macrophages in the fibrotic group compared to the regenerative group were related to immune cell trafficking, while genes expressed at lower levels were related to the reparative macrophage phenotype. These results indicate that, at the gene expression level, macrophages in the fibrotic outcome group undergo phenotype changes soon after injury compared to the regenerative outcome group, although with relatively small effect sizes.

A Gating strategy for macrophage sorting (day 1 time point). B Hierarchical clustering of the 10% most variable genes based on fold change (FC) between fibrotic and regenerative outcomes from nanoString multiplex gene expression analysis of FACS-sorted macrophages at B day 1 and C day 3.
We next characterized changes in the whole muscle tissue using the same nanoString gene expression panel. As early as 6 h after injury, the fibrotic outcome group was characterized by higher expression of genes encoding several chemokines involved in immune cell recruitment, such as Cxcl2, Cxcl3, and Ccl4 (q < 0.05, Fig. 4A, Supplementary Table 3). By days 1 and 3 the fibrotic group was also characterized by increased levels of Il36g and Tgfb1, respectively, which have been previously linked to fibrotic VML10,13,15,16. Finally, expression of numerous genes involved in ECM deposition and assembly were decreased in the fibrotic group at days 1 and 3, including Col1a1, Col3a1, Col5a2, Col6a1, and Dcn, while the degradative enzyme Mmp8 and the pro-fibrotic marker Trem2 were upregulated.

A Volcano plots of nanoString from whole muscle tissue at 6 h, 1 day, and 3 days post injury. Multiple t-tests (Benjamini–Yekutieli false discovery rate correction). The genes highlighted on the plot represent the top 5 genes based on fold change. B Immune cell populations in muscle over 7 days. Two-way ANOVA, Sidak’s post hoc. N = 6, mean ± SD. *p < 0.0.5; **p < 0.01; ***p < 0.001; ****p < 0.0001.
Given the upregulation of several chemokines associated with the trafficking of immune cells such as neutrophils and monocytes, we next evaluated immune cell populations in the muscle over 28 days (Fig. 4B, Supplementary Fig. 7). On day 1 after injury, there were nearly twice as many neutrophils in the fibrotic outcome group compared to the regenerative group, both as a percentage of leukocytes (CD45+) (Fig. 4B) and as a percentage of all live cells (Supplementary Fig. 8). There were also more dendritic cells in the fibrotic group at days 3, 7, and 28 when expressed as a percentage of all cells but only at day 7 when expressed as percentage of CD45+ cells. B cells, natural killer (NK) cells, monocytes and macrophages were present in lower numbers in the fibrotic group, depending on the time point (Fig. 4B, Supplementary Fig. 8).
Systemic changes in immune response in fibrotic vs. regenerative outcome groups
Having identified differences in macrophage phenotype in the muscle, we next characterized the behavior of macrophages and monocytes in the spleen and bone marrow, the primary sources of monocyte-derived macrophages that infiltrate sites of injury (Fig. 5). While there were fewer total myeloid cells (CD45+) overall in the bone marrow of mice in the fibrotic outcome group, there were more natural killer cells at day 7, and slightly more macrophages at day 3 (Fig. 5A, Supplementary Fig. 9, Supplementary Fig. 10). While there were no differences in immune cell numbers in the spleen between fibrotic and regenerative outcomes (Supplementary Fig. 10), there were several differences in macrophage phenotype in the spleen as early as day 1, persisting for up to 7 days post injury (Fig. 5B). Spleen macrophages from the fibrotic muscle group expressed higher levels of CD38, CD86 and PDL1 and the reparative marker CD206, although these differences were relatively small. Macrophage phenotype in the bone marrow did not differ between injury types (Supplementary Fig. 9).

A Immune cell populations in bone marrow over 7 days. B Macrophage phenotype in spleen over 7 days. Two-way ANOVA, Sidak’s post hoc. C Significantly different analytes in blood serum at day 1. D Significantly different analytes in blood serum at day 3. T-tests. N = 6, bars show mean ± SD. *p < 0.0.5; **p < 0.01; ***p < 0.001; ****p < 0.0001.
To evaluate possible signaling factors that might modulate immune cell behavior at the systemic level, we analyzed the levels of 32 immune response-related cytokines in blood serum over 7 days post injury (Fig. 5C, D, Supplementary Fig. 9E, Supplementary Table 4). At day 1, the fibrotic group was characterized by higher levels of granulocyte colony stimulating factor (G-CSF), which is crucial for neutrophil development and maintenance. There were also lower levels of pro-inflammatory cytokines interleukin-1-β (IL-1β), IL-12 (p70), and IL-15 in the fibrotic group compared to the regenerative group and to the naïve (uninjured) group. IL1β is a potently pro-inflammatory cytokine with broad effects on multiple cell types, while IL12 is involved in Th1 differentiation of T cells and IL15 is involved in natural killer (NK) cell differentiation. On day 3, the only difference was in IL-13, a fibrotic Th2 cytokine that was elevated in the fibrotic outcome group (Fig. 5D).
Apoptotic neutrophils are elevated in fibrotic outcomes
Because the increased number of neutrophils in the fibrotic group was the greatest difference observed in our characterization thus far, we decided to further investigate their phenotype. Neutrophils have been shown to modulate macrophage phenotype via several different mechanisms, including release of soluble factors22 and their phagocytosis by macrophages (efferocytosis)33,34. To evaluate potential differences in neutrophil phenotype, we FACS-sorted neutrophils (CD45+B220-CD3-NK1.1-CD11b+Ly6G+) from the muscle at day 3 (Fig. 6A). Gene expression analysis of FACS-sorted neutrophils showed only 2 differentially expressed genes out of a panel of 259 between the groups, based on a q-value of 0.05 (Fig. 6B, Supplementary Table 5). Upregulated genes in the fibrotic group included Pcdc1lg2, which encodes an anti-inflammatory protein, and Lpl, which is involved in metabolism and collagen synthesis (Fig. 6C). There was a population of Ly6G+F4/80+ cells in the muscle, so we cannot exclude possible contamination from macrophages (Supplementary Fig. 11).

A Gating strategy for neutrophil sorting (day 1 time point). B Hierarchical clustering of top 10% most variable genes based on fold change between fibrotic and regenerative groups from nanoString gene expression data of FACS-sorted neutrophils at day 3. C Two significantly differentially expressed genes (multiple t-test with Benjamini–Yekutieli false discovery rate correction). D Neutrophil numbers in muscle at 6 h and 1 day post injury. T-test. E, F Neutrophil apoptosis in muscle at 6 h and 1 day post injury as measured using ApoTracker staining in the neutrophil population. T-test. *p < 0.0.5; **p < 0.01; ***p < 0.001; ****p < 0.0001. N = 6, mean ± SD.
Next, we evaluated the levels of neutrophil apoptosis and cell death within the muscle at 6 h and at day 1. By 6 h post injury, there were already increased numbers of neutrophils (Fig. 6D) and more of them were apoptotic (Fig. 6E, Supplementary Fig. 12) in the fibrotic group compared to the regenerative group. The increased numbers of live neutrophils remained at day 1 (Fig. 6D), although the percentage of neutrophils that were apoptotic did not differ between groups at this time point (Fig. 6F).
Partial neutrophil depletion does not affect macrophage phenotype or muscle repair outcome
Excessive levels of apoptotic neutrophils, and their impaired clearance by macrophages, have been linked to fibrosis and impaired wound healing across a range of tissues outside of VML (reviewed in ref. 35). Within VML, the prolonged presence of neutrophils has been linked to persistent inflammation in fibrotic outcomes14. Therefore, we hypothesized that an excess of neutrophils might be a possible mediator of macrophage dysfunction and impaired tissue repair in the fibrotic outcomes. The gene expression analysis of FACS-sorted macrophages at day 3 revealed generally lower expression of phagocytosis- and efferocytosis-related genes in macrophages from the fibrotic group compared to the regenerative group (Supplementary Fig. 13), although none of these genes reached the level of statistical significance based on q < 0.05, suggesting possible defects in efferocytosis of neutrophils. To directly investigate the influence of excessive neutrophils on macrophage phenotype and muscle repair, we developed a partial neutrophil depletion protocol using systemic administration of anti-Ly6G antibody that depleted neutrophils in the muscle to the levels observed in the regenerative outcome group (Fig. 7A, B, Supplementary Fig. 14).

A Neutrophils were partially depleted via systemic injection of anti-Ly6G antibody in the critical size (4 mm) muscle defect and compared to the fibrotic control (4 mm defect) and the regenerative group (2 mm defect). Image created with BioRender.com. B Neutrophil presence in muscle at days 1 and 3. Kruskal–Wallis Test, Dunn’s post hoc. C Macrophage phenotype in muscle at day 3 in the critical size (4 mm) muscle defect group when mice were treated with anti-Ly6G antibody (Depleted group) vs. an isotype control (Isotype) or a PBS control (PBS). One-way ANOVA, Tukey’s post hoc. D Hierarchical clustering of the 10% genes with largest fold change (FC) from nanoString of FACS-sorted macrophages at day 3. E Representative Masson’s trichrome images of all injuries (scale bar 2 mm). F Quantification of collagen staining per section. G Quantification of muscle fiber diameter from laminin staining. H Quantification of whole muscle cross-sectional area from Masson’s trichrome stains. One-Way ANOVA, Tukey’s post hoc. N = 8, bars show mean ± SD. *p < 0.0.5; **p < 0.01; ***p < 0.001; ****p < 0.0001.
Flow cytometry analysis of macrophage phenotype at the site of injury demonstrated that neutrophil depletion increased PD-L1 expression on day 3 compared to both PBS and isotype controls (Fig. 7C, Supplementary Fig. 15). However, there were no other differences in macrophage phenotype markers between the neutrophil-depleted group and the isotype control. To further characterize macrophage phenotype following neutrophil depletion, we FACS-sorted macrophages from these injuries at day 3 and conducted gene expression analysis using nanoString (Fig. 7D, Supplementary Fig. 16, Supplementary Table 6). Hierarchical clustering and analysis of individual genes showed minimal effects from the neutrophil depletion on macrophage phenotype, upregulating only Il36g relative to the fibrotic control (PBS) group (Supplementary Fig. 16). Here it is important to note that results using the PBS control may differ from an isotype control.
Finally, we evaluated the effects of partial neutrophil depletion on muscle repair outcomes at day 28 using histology for total collagen staining (Masson’s trichrome) and immunohistochemistry of laminin expression as a marker of muscle fibers (Fig. 7E–H, Supplementary Fig. 16). No significant differences were observed in collagen staining, muscle fiber diameter, or muscle fiber area compared to the fibrotic control (PBS) group. Altogether, these results demonstrate that, surprisingly, partial neutrophil depletion had little to no effect on macrophage phenotype or on muscle repair outcome.
Discussion
This study demonstrated numerous changes in immune cell behavior locally within the muscle and systemically between small (regenerative) and critical size (fibrotic) VML injuries. Within 1 day after injury, macrophages exhibited altered phenotypes, with changes persisting for up to 4 weeks, even though overall temporal trends in typical phenotype markers were largely similar. In addition, we uncovered several differences in leukocyte numbers in the bone marrow, changes in macrophage phenotype in the spleen, and differences in inflammatory cytokine levels systemically. Lastly, we observed a large increase in neutrophil accumulation locally in fibrotic injuries compared to regenerative injuries. Despite this finding, and a vast body of research implicating neutrophils in macrophage dysfunction, partial neutrophil depletion had little to no effect on macrophage phenotype or on muscle repair outcome. These results are critical for understanding how the dynamics of immune cell interactions differ between two injuries of different sizes in the same tissue, which lead to very different outcomes. Furthermore, they demonstrate that the entire system must be considered when designing new interventions.
The reasons for impaired tissue repair in critical sized defects are often attributed to reductions in the number of regenerative stem cells or to physical limitations of cell migration. However, our results show that dysfunctional immune cell dynamics likely also play an important role. For example, macrophage gene expression was broadly downregulated in the fibrotic injuries as early as day 1, with decreased expression of critical genes involved in regulation of repair and clearance of tissue debris. These effects would be expected to inhibit muscle regeneration even if there were sufficient levels of stem cells available for differentiation. The increased numbers of macrophages and altered phenotypes persisted to at least day 28, indicating long term consequences of the initial injury. Therefore, any strategy intended to promote muscle regeneration must take these immune system changes into account.
It is not known why the size of the injury has such drastic local and systemic effects on immune cell dynamics. It may simply be that a larger injury causes more cell injury and death, and therefore more release of danger associated molecular patterns (DAMPs) into the circulation. Previous reports have highlighted the role of High Mobility Group Box 1 (Hmgb1), a DAMP and agonist for macrophage toll like receptor 4 (TLR4), in skewing macrophages towards an inflammatory phenotype36,37. Interestingly, Hmgb1 has also been shown to decrease macrophage phagocytotic/efferocytotic capacity38,39, perhaps leading to the high levels of neutrophils observed in this study. Additionally, we also observed increased expression of genes related to leukocyte-recruiting factors in the living cells remaining in the injured muscle of the fibrotic outcome groups. Together these factors could account for the increased leukocyte infiltration in the fibrotic outcome group and the altered behavior of macrophages, which are critical for regulating satellite cell proliferation and myofibroblast differentiation and myotube formation40. By day 3, muscle cells had downregulated expression of key genes involved in muscle repair, including genes encoding various types of collagen and decorin. It is possible that the altered macrophages and other leukocytes drove these changes, which should be evaluated in future studies.
In many tissues, including muscle, it is believed that the transition of macrophages from pro-inflammatory (or “M1-like”) to reparative (or “M2-like”) phenotypes is critical for successful tissue regeneration, which has been reviewed by our own group41 and others42. However, in the present study, we found that the macrophages in both the regenerative and fibrotic outcome groups exhibited the same temporal trends of typical phenotype markers, with the only differences being in their expression levels. The fibrotic group’s macrophages expressed higher levels of CD9, CD38, and CXCR4, but these markers are not specific markers of pro-inflammatory macrophages. On the gene expression level, the effect sizes were relatively small, and macrophages in the fibrotic outcome group also did not appear more pro-inflammatory than macrophages in the regenerative outcome group in that they did not upregulate any of the pro-inflammatory genes in the panel. On the protein level, fibrotic group macrophages did express lower levels of important reparative phenotype markers, including CD163, CD301b, and CD206, which are also involved in clearance of tissue debris, but again the effect sizes were smaller than we expected that they would be. Together, these results suggest that the macrophages in the fibrotic outcome group were not overly pro-inflammatory but rather defective in their reparative/regenerative behavior. One other study that characterized macrophage phenotype in critical vs. subcritical VML also reported that macrophages from critical sized defects were no more pro-inflammatory or M1-like, based on Ly6C expression, but were actually more M2-like, based on CD206 expression, although the macrophage gating strategy differed quite a bit from the one used in the current study in that it also included the efferocytosis receptor MerTK and the Fc receptor CD64 to identify macrophages11. Another major difference between that study and the present one is that that study used as 3 mm defect in male mice whereas the present study used a 4 mm defect in female mice, which is comparatively much more severe. Another study that included single cell RNA sequencing data from 3 mm vs. 2 mm defects in a combined cohort of male and female mice together with regenerative vs. fibrotic bone defects reported no obvious differences in M1-like or M2-like macrophage profiles between the two different outcomes16. Together, these results show that macrophages from fibrotic VML injuries are quite different from macrophages observed in more typical models of fibrosis, such as liver and lung, in which macrophages express high levels of both M1-like and M2-like markers (reviewed in refs. 43,44). This difference is not entirely surprising considering fibrotic VML is characterized by too little ECM deposition, whereas liver and lung fibrosis are characterized by too much ECM deposition.
Another important finding from the present study was that there were systemic consequences of the larger, fibrotic injury that were not observed in the smaller, regenerative injury. For example, several blood cytokines were altered in the fibrotic group compared to naïve mice but not in the regenerative outcome group. The higher levels of systemic G-CSF in the fibrotic outcome group are consistent with the higher numbers of neutrophils at the site of injury, as G-CSF is crucial for neutrophil maturation. It is also noteworthy that the levels of three pro-inflammatory cytokines (IL1-beta, IL-12, and IL-15) were lower in the fibrotic outcome group at day 1. This effect may be attributed to a compensatory anti-inflammatory feedback mechanism in response to higher levels of inflammation at earlier time points, although the very early kinetics of systemic cytokines were not evaluated in this study. The fact that splenic macrophages were more inflammatory in the fibrotic outcome group compared to the regenerative group supports the hypothesis that early increases in inflammation have systemic repercussions. In theory, these effects could in turn have local influence at the site of injury. While the contribution of splenic macrophages to VML repair has not been evaluated, they have been shown to be major contributors to repair of other tissues like heart following myocardial infarction30 and lung injury45. Future studies should be directed towards increasing understanding of how these systemic repercussions of injury size may influence tissue repair outcome.
Defective efferocytosis (clearance of apoptotic neutrophils by macrophages) is well known to be a major source of chronic inflammation in fibrosis of multiple tissues following injury ranging from the lungs to the spinal cord46. We and others have shown that efferocytosis is a potent stimulus that causes macrophages to switch from pro-inflammatory to anti-inflammatory/pro-regenerative17, so impaired efferocytosis stalls this critical process of macrophage phenotype regulation. The fact that partial neutrophil depletion did not influence macrophage phenotype suggests that the elevated levels of neutrophils may result from defective macrophage behavior, not necessarily defective neutrophil behavior. In fact, impaired clearance of apoptotic neutrophils in other tissues like diabetic wounds has also been linked to defective macrophage activity47. In this study, we did find that macrophages from the fibrotic group expressed higher levels of some pro-inflammatory phenotype markers on the protein level, which would be expected to reduce their efferocytotic behavior20,48, as well as slightly decreased expression of phagocytic and efferocytotic genes at the early stages of the response to injury in this study. Neutrophils release pro-inflammatory factors, so their persistence in the injury site can impair satellite cell differentiation and fusion into myofibers14. Additionally, apoptotic neutrophils that are not adequately cleared may further progress to secondary necrosis/NETosis49. These modes of cell death are more inflammatory compared to apoptosis and have been shown to further impede phagocytosis by macrophages50. The present study also showed that clearance of neutrophils alone was insufficient to improve muscle repair outcomes. In fact, there have been three other published studies in which neutrophils were depleted in the context of muscle injury, and they yielded conflicting results. Sadtler et al. reported no effect of neutrophil depletion on biomaterial-mediated fibrosis51. Two other studies depleted neutrophils in muscle repair using models in which injury was induced by injection of snake venom52 or by exhaustive exercise53, and these studies reported improved muscle repair outcomes. Therefore, more studies are required to address the lack of consensus of the effects of neutrophil depletion on muscle repair outcomes.
Although this study uncovered several previously unknown phenomena in VML repair and pathology, there were several limitations. The animal model utilized a relatively sterile injury, which is not normally the case for most VML patients. It is likely that during a non-sterile injury such as a vehicle accident or combat wounds, the elevated presence of neutrophils is crucial to clear the tissue from bacteria and other microbes. Additionally, VML in humans is often accompanied by substantial injury to the nearby bone, which can influence VML healing54. While we reported some differences in neutrophil phenotype between regenerative and fibrotic outcomes, these results need to be thoroughly validated because the tissue digestion process can influence neutrophil phenotype. Furthermore, neutrophil depletion was accomplished via systemic injection of anti-Ly6G antibodies, which may depend on clearance by splenic and liver macrophages55,56. Therefore, in principle this method could indirectly modulate muscle macrophage phenotype. Finally, the study was performed in female mice. Further studies must be conducted to evaluate the influence of sex on the findings described here.
This study increases understanding of macrophage dysfunction following VML by establishing a complete timeline in the context of the complex ecosystem in which they operate. Coinciding with alterations in macrophage phenotype were changes in numerous other leukocytes in the muscle and in the bone marrow, changes in splenic macrophage phenotype, and changes in systemic levels of pro-inflammatory cytokines. Despite excessive levels of apoptotic neutrophils in the muscle, their depletion to levels observed in the regenerative outcome group did not influence macrophage phenotype or muscle repair outcome. Together, these results suggest that the entire system of immune cell interactions will need to be considered to improve muscle repair outcomes in VML.
Methods
Animal surgeries
All animal experiments were conducted with approval by the Drexel University Institutional Animal Care and Use Committee. For each experiment, six to eight female C57BL/6 mice per group per time point were anesthetized using isoflurane (Vedco) and weighed. After buprenorphine sustained-release (Wedgewood) pain relief administration at 0.05 mg/kg, fur was removed using depilatory cream (Nair) and the skin was washed. Next, a triple alternating iodine and ethanol treatment was used to sterilize the skin and a 1–1.5 cm incision was made in the left hindlimb. Using a 2 or 4 mm in diameter biopsy punches full-thickness injuries were created in the mid portion of the quadriceps, with the leg comfortably extended. Hemostasis was accomplished using sterile cotton tipped swabs until bleeding ceased. Animals were sutured using a number 6 suture and observed until fully recovered. Animals were euthanized following IACUC approved methods. Mice who underwent cardiac puncture as primary method of euthanasia for blood collection were placed under 5% isoflurane displacement for 5 min prior to blood collection. Alternatively, mice were euthanized following standard CO2 induced asphyxiation. Both methods were followed by cervical dislocation as a secondary euthanasia method.
Bone marrow derived macrophage cell culture
Femur and tibia of female C57BL/6 mice, 8–12 weeks of age, were harvested for bone marrow extraction. The ends of the bones were removed using a scalpel, and marrow was flushed using a 27 gauge containing sterile RPMI 1640 (GIBCO CAT:11875-093). Cells were then cultured in non-tissue culture treated flasks (Thermo Scientific, CAT: 12-566-85) with media containing 89% RPMI 1640, 10% Fetal Bovine Serum (FBS) (VWR, CAT: 45000-734) and 1% Penicillin-Streptomycin (Themo Fisher, CAT: 15070063) and 25 ng/ml of M-CSF (Peprotech, CAT: 315-02). Media was changed after 3 days, and 2 days later cells were detached using 3 ml of TrypLE (Thermo Fisher, CAT: 12604021). Cells were moved to a non-tissue culture treated 6-well plate (VWR, CAT: 351146) and allowed to rest overnight. Next, cells were polarized using either 40 ng/ml of IL-4 (Peprotech, CAT: 214-14), 50 ng/ml of LPS (Fisher Scientific, CAT: NC9039780) or 60 ng/ml of IL-17A (Peprotech, CAT: 210-17). Cells were collected and analyzed via flow cytometry 3 days later.
Neutrophil depletion
Partial neutrophil depletion was accomplished via tail vein injection of anti-Ly6G antibodies (BioXCell CAT: BE0075-1) compared to a corresponding isotype control or PBS control. The antibody was diluted to 0.5 mg/ml in 100 uL of sterile PBS, and administered immediately prior to surgery, as well as 1 and 2 days after surgery. Partial neutrophil depletion was confirmed in muscle via flow cytometry, as described below.
Tissue preparation
For flow cytometry analysis, tissues were excised and digested into homogenous single cell suspensions. Whole muscle was digested using 3 mg/ml of Collagenase 4 (Worthington CAT:LS004188) diluted in RPMI (GIBCO CAT:11875-093) to a total volume of 3 ml for 30 min at 37 °C. Spleen was gently dissociated by pressing it against a 70-micron filter using a syringe plunger, followed by the addition of 7 ml of red blood cell (RBC) lysis (eBioscience CAT: 00-4300-54)) for 5 min at room temperature. Bone marrow was collected by removing the ends of both tibia and femur using a scalpel, then flushed through with ice cold RPMI using a 21 G syringe. The marrow was then lysed using the same protocol as the spleen. Blood was collected via cardiac puncture using a 25 G needle and placed in an EDTA coated tube (BD CAT: 367856). Blood was then centrifuged at 2000 G for 5 min to separate the serum from the cells. The serum was frozen at −80 °C for Luminex, while the cell pellet underwent two rounds of RBC lysis as previously described.
Flow cytometry
All samples were plated at approximately 1 million cells per well, where appropriate Fluorescence Minus One (FMOs) and Unstained controls were created by mixing cells from all samples, where each organ had its own FMO. Staining dilutions, catalog numbers and vendors can be found in Table 1 and the gating strategy can be found in Table 2. Cells were first stained with 200 uL Live/Dead for 30 min at 4 °C. Next, cells were stained with 50 uL FcR block for 10 min at 4 C. Cells then received 50 uL of extracellular marker staining for 15 min at 4 °C. Next, samples were washed and fixed using BD Fixation and Permeation buffer for 20 min at 4 °C. Cells were washed twice with the appropriate washing buffer and stained for intracellular markers for 30 min at room temperature. Finally, cells were washed and resuspended in FACS buffer at 4 °C until analysis.
Samples destined for Fluorescence Activated Cell Sorting (FACS) were prepared similarly to samples prepared for flow cytometry analysis, except samples were split in half. One half was used for sorting and follow up RNA extraction, while the other half was used to create the appropriate FMO and unstained control. The volume of blocking and staining cocktail was adjusted accordingly, maintaining the same dilution. Note that F4/80 was not used in the gating strategy as F4/80 expression has been previously described in immature neutrophils57. Moreover, we confirmed the presence of a small subpopulation of F4/80+ neutrophils that did not differ between both groups (Supplementary Fig. 10). All samples were analyzed using a BD Fortessa. Sample analysis was conducted using FlowJo. Clustering analysis was performed by identifying the number of clusters present using XShift, followed by clustering using FlowSOM using the number of clusters from the previous step.
nanoString multiplex gene expression analysis
RNA was extracted using RNAqueous microkit (Invitrogen CAT: AM1982) for FACS-sorted macrophages and neutrophils, while whole tissue RNA was extracted using RNAqueous isolation kit (Invitrogen CAT: AM1912), according to manufacturer’s recommendation. RNA was evaluated for quantity and purity using Biotek Take3TM plate and Biotek synergy H1 microplate reader. 5 uL of 20 ng/ul per sample was added to the reaction tube. Next 8 uL of master mix was added to the sides of each tube, followed by 2 uL of Capture Probeset. Samples were quickly centrifuged to allow reagents to mix and placed on thermocycler at 65 °C for 20 h. Gene counts were measured using nCounter MAX/FLEX system, and samples were scanned using the high field of view (FOV, 280) settings.
Histology and immunohistochemistry (IHC)
Histopathological and IHC assays were performed by Translational Pathology Shared Resources at Sidney Kimmel Cancer center, Thomas Jefferson University. Mouse quadriceps muscle tissue samples were harvested and fixed in 10% Neutral Formalin Buffer (NFB) fixative at room temperature for 48 h and then embedded in paraffin. These formalin fixed paraffin embedded (FFPE) blocks were sectioned at 4 µm. Four levels with 1500 µm apart were examined beginning from distal part of the muscle. H&E and Masson’s Trichrome staining were performed by using standard methods. IHC staining was performed using an intelliPATH FLX® Automated Slide Stainer (Biocare medical, LLC). Antibodies and epitope retrieval methods are: PECAM-1/CD31 (Santa Cruz, Cat#: sc-1506-R, 1/200 dilution), Heat Induced Epitope Retrieval (HIER) using Uni-TRIEVE (INNOVEX, Cat#: NB325-500) at 70 °C for 1 h; Laminin (Abcam, Cat#: ab11575, 1/400 dilution), epitope retrieval was performed using proteinase K(Invitrogen, P/N: 100005393, 20 µg/ml) at room temp for 5 min. Briefly, the slides were deparaffinized, rehydrated and then exposed to epitope retrieval. After endogenous peroxidase, normal serum and endogenous biotin blocking, primary antibodies were applied and incubated at room temperature. Biotinylated anti-rabbit (Vector Laboratories, cat#: BA-1000) secondary antibody and ABC-HRP complexes (Vector Laboratories, Cat#: PK-6100) were applied following the primary antibodies with 30 min incubation at room temperature. Three washes with 1x tris-buffered saline with Tween (TBTS) were performed between each step above. The signals were visualized using ImmPACT™ VECTOR Red (Vector Laboratories, Cat#: SK5105). The slides were counter stained with Hematoxylin, dehydrated, cleared and coverslipped with Permount mounting medium. All the stained slides were scanned using Aperio ScanScope CS2 at 20X magnification.
Luminex
Samples were analyzed using manufacturer’s recommendation. Blood was collected via cardiac puncture using a 25 G needle and placed in an EDTA coated tube (BD CAT: 367856). Blood was then centrifuged at 2000 G for 5 min to separate the serum from the cells. The serum was then frozen at −80 °C for Luminex.
Histological analysis and sGAG quantification
Collagen accumulation was measured as a function of blue pixels (collagen) in Masson’s Trichrome staining over all pixels using a custom MATLAB code. Muscle fiber diameter was measured manually using Laminin stained sections. Using ImageScope software, 4 different regions of interest surrounding the center of the injury (below, right and left), including the center, were selected and 50 measurements were taken at random per region, totaling 200 measurements per sample. For elongated fibers the shortest axis was used as the diameter. Only one section per sample was used in this quantification. These values were averaged resulting in a single average muscle fiber diameter per sample. Similarly, muscle cross sectional area was measured using ImageScope software by contouring the outer edge of the muscle section.
sGAG content was measured by first digesting tissue with 2% papain (Millipore Sigma, CAT:76216) (1 mL/construct, 0.56 U/mL in a buffer containing 0.1 M sodium acetate (Millipore Sigma, CAT: 71188), 10 M cysteine hydrochloric acid (Millipore Sigma, CAT: 30120), 0.05 M EDTA (MP Biomedicals, CAT: 6381-926), pH 6.0) at 60 °C for 16 h. Sulfated glycosaminoglycans (sGAG) content was quantified with DMMB assay following established procedures58. These data were normalized by the wet weight of the samples for native tissues (N ≥ 4 animals).
Statistical analysis
Statistical analysis of flow cytometry data and quantification of histological images was performed using GraphPad Prism v10. All data were reported as mean values ± standard deviation. Outliers were detected using ROUT method, and removed if deemed an outlier. The normality of data distributions was verified via Kolmogorov-Smirnov test. Depending on normality, data were analyzed with one-way or two-way ANOVA or respective non-parametric test or multiple t-tests, as specified in each figure legend. All measurements reported were taken from distinct samples, and not repeated measurement. Where applicable, two-sided tests were applied. The z-score for each marker was calculated based on the expression of that marker across all clusters using weighted means and standard deviation, to account for the discrepancy of cluster sizes.
NanoString data were normalized according to manufacturer recommendations. Briefly, negative control was first subtracted from all gene counts, followed by a positive control normalization. Then, gene counts were normalized by total counts per sample. Negative values were removed, and all data was transformed on log base 10. For statistical analysis, data were log-transformed and multiple t-tests were conducted followed by the Benjamini–Yekutieli method to control for false discovery less than 5% (q < 0.05). Clustering analysis of nanoString data was performed using RStudio (R Version 4.3.1). The normalized and log transformed gene counts were converted into Z Scores, and clustering was done by minimizing cluster variance.
Responses