The evolution of preferred male traits, female preference and the G matrix: “Toto, I’ve a feeling we’re not in Kansas anymore”

Introduction

Throughout the animal kingdom, females frequently exhibit preference for particular characteristics of males (Wiens and Tuschhoff 2020). Research has typically focused upon a single male trait and examined how characteristics of the female or the environment affect female preference (Roff 2015; Kelly 2018; Bentsen et al. 2006). However, female preferences may incorporate several male characters or individual females may prefer different male traits (Hedrick and Weber 1998; Bentsen et al. 2006; Roff and Fairbairn 2015; Burley et al. 2018). Thus, evolution of female preference and the preferred male traits potentially depends upon complex genetic interactions between traits both within and between the sexes. For example, males of many orthopteran species attract females acoustically with a call that can be decomposed into components such as time spent calling, volume, frequency, pulse rate etc. If one component of the call is under stabilizing preference while another is under directional preference then the resulting response of both traits to female selection of mates will depend both upon the strength of the preference for each trait, the heritabilities of the two traits and the correlation between them (Roff et al. 2017). Song components that are correlated to the preferred traits may themselves change as correlated responses to changes in the preferred traits, even if they are not directly preferred by a female.

Females of the sand cricket, Gryllus firmus are attracted to several components of the song of the males: the amount of time a male spends calling (Crnokrak and Roff 1995, 1998a) and at least some components of the song (Roff et al. 2017). Heritability estimates and common garden experiments have demonstrated genetic variation in time spent calling and some of the other song components (Crnokrak and Roff 1998b; Webb and Roff 1992; Roff et al. 2003). Common garden experiments have also shown geographic variation in female preference, indicating genetic variation in female preference.

Consider the following experiment in which females are presented with a choice of two males and are mated to the one they prefer. Based on the previous experiments we would predict that the time spent calling would increase. Given that song components are inherited we would also predict that these would change in accordance with female preference. In any selection experiment we need to distinguish effects due to the selection process and extraneous factors, such as genetic drift. In part this can be resolved by multiple selection lines but typically control lines in which no selection is applied are run in parallel with selection. In the present case this would consist of mating females with males at random. If female preference is genetically correlated with preferred male traits then a true control line may not be strictly possible since female preference and the male traits will co-evolve.

It is generally posited that the genetic correlation between preference and the preferred trait is generated by linkage disequilibrium rather than pleiotropy (Kirkpatrick 1987; Pomiankowski and Iwasa 1993; Mead and Arnold 2004; Radwan 2008; Kuijper et al. 2012; Hosken and Wilson 2019). The reasoning behind this is that there is no obvious functional link between the trait and the underlying neurological process that generates the preference (Bakker and Pomiankowski 1995). However, any mutation that modified genes to affect both preference and preferred trait would be favored and increase in frequency, thereby creating a pleiotropic component to the genetic correlation. In either case preference will generate selection on the trait and thus result in linkage disequilibrium. Consequently, the genetic correlation between preference and preferred trait can be a result of linkage disequilibrium and pleiotropy. Experimental detection of a covariance between preference and preferred trait appears to be quite common (63% in 43 studies, Fowler-Finn et al. 2016) but the range, 0.049–0.207, is low (Greenfield et al. 2014) as predicted if such correlations are primarily due to linkage disequilibrium (Roff and Fairbairn 2014).

A genetic correlation resulting from linkage disequilibrium will be quickly eroded by random mating. In the absence of physical linkage, the correlation will be reduced by one half at each generation (Bakker and Pominakowski 1995; Roff 1997, p73-74). If the genetic correlation is due in part to pleiotropy, then the genetic correlation will remain constant unless population size is small enough for genetic drift to be significant (Roff 1997, p73-79). These observations give us a method by which to distinguish between genetic covariance by linkage disequilibrium alone from linkage disequilibrium plus pleiotropy.

Here I report on a selection experiment on G. firmus consisting of two lines: in one line females are presented with a choice of two males and are mated to the one they prefer (S lines), while in another line the female is mated to a male at random (the R lines). Each line was replicated. Based on the observed female preference, time spent singing is predicted to increase in the S lines. Other song components should change towards those most preferred by the female. Theory predicts that linkage disequilibrium will be lost over generations in the R lines and at least lost at a lower rate in the S lines. Provided female preference is inherited, we would expect that over time her preference would be degraded in the R lines relative to the S lines.

Methods

Experimental protocols

To initiate the starting population, 73 gravid females were collected from the environs of Jacksonville, Florida. These females had already expressed their realized preference before capture and, therefore, the genetic correlation(s) between realized preference and preferred trait(s) was expressed. Females were returned to the laboratory and placed in individual cages with dishes of earth for oviposition. Sixty-one females produced offspring: these were raised in family groups of approximately 40 per group in rearing cages following the protocol described in Roff et al. (2003). Once emergence started, cages were checked at least three times a week (Monday, Wednesday, Friday). A total of 2459 adults were collected comprising 93.1% macropterous (long-winged, LW) females and 87.0% macropterous (LW) males. From these, 155 triplets, described below, were run, with each individual in a triplet drawn from a different family collected on the same day.

The initial experimental setup followed that of Crnokrak and Roff (1995). Males of a triplet were housed in individual white plastic jars (9 cm diameter) with abundant food and water. Each jar was situated in a bucket that was connected to two other buckets by plastic tubing, 2.5 cm in diameter and located just below the top of the bucket, thereby forming a T-maze. For each T-maze, two males were selected haphazardly from separate families and placed individually in the two buckets forming the top of the T. A female, also selected haphazardly from a different family, was placed in the remaining bucket that formed the bottom of the T (Fig. 1). The tubing that inter-connected all three buckets allowed the females free access to a male (a ramp in her home bucket allowed females to access the tunnels). Cones on the end of each tube leading to a male’s quarters, plus the elevated height of the tubes from the bottom of the bucket, prevented the female from exiting the male’s bucket once entered. A female could interact acoustically with a male but was visually separated by the solid wall of the male’s cage. The lid of each male cage was fitted with a wired, omnidirectional microphone (microphone name: AOM-6545P-R © PUI Audio, Inc.) by which time spent calling (CALL) was recorded. However, the microphones also recorded volume as voltage changes with an upper maximum and thus represented a truncated distribution of call volume. With the caveat that this volume measure is a truncated index I shall refer to it as call volume (VOL).

Fig. 1: Diagram of experimental setup.
figure 1

In generations 1 and 2 no motion detectors were used and the T tube was set at the top of the containers, thereby preventing the female from leaving a male’s cage after she entered it. Starting at generation 3 the T tube was moved to the bottom of the cage and the cones removed, thereby permitting a female to move between cages. Motion detectors were used to monitor the female’s movement. Output from the sound detectors and motion detectors were continually sent to a recording device and stored on an SD card. Song components were recorded at generation 3 and 11 using USB audio recorders.

Full size image

Triplets were set up in one of three growth chambers, which had shelving sufficient for 25 triplets, with each triplet being set approximately 30 cm apart and separated by thick sound reducing foam. A few preliminary tests suggested that females responded primarily to the two males of her triplet. However, there was clearly a chorus of songs in the chamber and females undoubtedly heard these and this could have affected their behavior in some way, but this would not be unrealistic given natural conditions.

With temperature regulation set in the growth chambers the fan noise was unacceptably loud. Continuous active temperature control was therefore eliminated and the daily temperature change monitored. Photoperiod was set at 15 h:9 h light:dark with lights off at 3 pm. During the dark cycle the temperature slowly fell from 30 oC to 25 oC, a temperature cycling that resembled that found in the natural habitat. Virtually all calling occurred during the dark cycle when the temperature was between 25–28 °C. Differences in measured components of cricket song of Gryllus species over this temperature range are small (G. fultoni, Doherty and Callos 1991, G. firmus, Pires and Hoy 1992, G. integer, Martin et al. 2011). Both males were obviously subject to the same temperature regime and hence effects of temperature differences are controlled within triplets. Male calls were recorded during an 18-h window between 3:00 p.m. (lights off) and 9:00 a.m. on the next day. This 18-h window included 9 h of dark (3:00 p.m. to midnight) and 9 h of light (midnight to 9:00 a.m.).

After testing, triplets were divided into male/female pairs based on one of two criteria. In one line, females were mated with the male in whose container she ended up, which will be referred to as the S(elected) line: two replicate lines, S1 (42 families), S2 (37 families), were so established. For the other line, named the R(andom) line, females were mated to a randomly selected male: two such replicates, R1 (47 families), R2 (39 families) were created. For subsequent generations 50 triplets per replicate were used.

Using pairs from generation 1, up to 3 cages per pair were set up with up to 40 nymphs per cage. The protocol for generation 2 followed that for generation 1. A potential problem with this setup was that females could fall into a male’s cage as a result of movements not initiated by an attraction to a song. Thus the S lines would comprise both favored males and randomly encountered males. For generation 3 and all subsequent generations the set-up was changed to monitor the movement of the females. The tubes connecting the three buckets were moved to the bottom of the containers, permitting the free movement of females to and from each male’s bucket. At the entrance to a male’s bucket was a motion detector which recorded each time the female passed into the bucket (Fig. 1). As in previous generations females still could not interact physically with the male but her attraction to a male could be evaluated by the relative number of visits to each male. The time trace was divided into 50 second bins and a male recorded as singing if detected during this period. Trips to the male were recorded as movements by the female across the motion detector towards the male. S females were mated to the males they visited most frequently, whereas R females were mated at random with a male from a different cage. Because of logistical constraints, movement of R females and measurement of song components (see below) in both S and R lines was measured only at generation 3 and 11.

From generation 3 onwards, females were still mated with the male they preferred (S lines) or at random (R lines) but their offspring (40 per pair) were produced by mass rearing in two colony cages per line (60.7 cmW × 42.6 cmD × 33 cmH). Husbandry was the same as for generations 1 and 2, except scaled up to account for the larger cage and higher number per cage. Colony cages were checked as previously: adult age was assigned as zero on the day of collecting and age of testing as the number of days since collection (individuals collected on Monday and tested the following Monday were recorded as age 7 d). In generation 3 age ranged from 3 d (1.7%) to 15 d (0.2%) with the majority (88.9%) between 7 d and 10 d. Time spent calling (CALL) is known to be related to age (Crnokrak and Roff 1998a) and so this was added to the analysis of generation 3 but in generation 11 this potentially confounding factor was avoided by all testing being done within a narrow age distribution (94.0% at 7 d or 8 d). In the closely related species, G. pennsylvanicus, no effect of age was detected on the song components, pulse rate, pulses per chirp, or chirp width (Ciceran et al., (1994)).

As the intent of the selection experiment was to test the effect of song selection and not wing morph (micropterous=short-winged[SW] females are significantly more fecund than macropterous[LW], Roff 1984) each female contributed 40 offspring to the next generation, giving an effective population size of 2,000. The proportion of LW individuals fluctuated from 60% to 90% but the proportions in generation 3 and 11, the generations examined in detail, were very similar. In generation 3 the combined S lines consisted of 78.4% LW females and 64.1% LW males whereas in generation 11 the combined S line consisted of 77.8% LW females and 65.0% LW males. In generation 3 the combined R lines consisted of 68.4% LW females and 59.5% LW males and in generation 11 the combined R lines comprised 68.3% LW females and 55.1% LW males. The phenotypic and genetic basis of male song characteristics and female preference was estimated using a half-sib experiment in generation 11. While this approach is able to exclude non-additive effects such as dominance and maternal effects it halves the genetic correlation between males and females established by linkage disequilibrium (Roff 1997, p73-74). Where possible, three dams per sire, were used, the dams coming from a different cage to the male: in total, CALL and VOL were measured in 2,740 male offspring from 455 families (181 sires giving an average of 2.51 dams per sire and 6 males per family). Female preference was measured in 2091 females for CALL and VOL and 1588 females for the other song components, described below.

Song characteristics of the males were obtained using usb recorders, bought from a variety of sellers through Amazon© (about $10 each from China but $60 from USA outlets). In generation 3 recorders were started at the time the triplets were set up (9:30am) but most had run out of battery before recording any song, thereby considerably reducing sample size. As noted above, CALL and VOL were recorded separately from the other song components and 630 records were obtained for these components but only 223 recordings for the other components.

As calling is primarily restricted to dusk to dawn, in generation 11 the problem of run time of the recorders was alleviated by starting the recordings just before lights out (3 pm). The number successfully recorded was slightly less than for CALL and VOL: 2490 recordings spread over 444 families (179 sires, 5.6 offspring per family). The recordings were transferred to computer and scanned using the software Audacity© to locate a call segment of about 2 min. This segment was then analysed using the software Raven©. Gryllus firmus is a “chirper”, meaning that its song consists of discrete chirps of generally 3–5 pulses, which preliminary analysis showed could be divided into bouts, defined as groups of chirps separated by less than 0.5 s. Songs were divided into 12 components (Fig. 1S, 14 when CALL and VOL are included): chirp rate (=number of chirps per bout=CHIRP), pulse rate (=mean pulses per chirp/mean chirp Width=PULSE), mean pulses per chirp (PULSES/CHIRP), variance in pulses per chirp (V.PULSES/CHIRP), mean chirp width (CHIRP.W), variance in chirp width (V.CHIRP.W), mean frequency (FREQ), variance in frequency (V.FREQ), mean pulse width (PULSE.W), variance in pulse width (V.PULSE.W), mean interpulse width (IPULSE.W), variance in interpulse width (V.IPULSE.W). Potentially some of these components are mathematically related. However, because within an individual male components vary, relationships can be obscured. Principal component analysis was used to determine if fewer compound components could be constructed. Overall, there was no obvious grouping that would lead to fewer variables: only 26.0% of the variation was accounted for by the first PC and nine were required to account for 93.6% of the total variation (Fig. S2). Given the lack of a clear groupings, I conducted the analyses on the original components. A similar finding was obtained for four other cricket species (Bertram et al. 2011) suggesting that the information contained in the calling song is more extensive than might be expected from such a relatively simple structure.

Statistical analyses

Phenotypic analyses

Song components are measured on very different scales and were not normally distributed. To provide appropriate comparison and to better conform with the assumption of normality, each trait was transformed using the Box-Cox transformation (boxcox in R using lm(x ~ y), where x is the data and y the replicate) and then standardized by subtracting the mean and dividing by the standard deviation. Thus trait values are in standard deviation units and differences among traits are comparable.

I tested for overall differences in song traits between lines using a linear mixed model with Line (S, R) and wing morph (LW, SW) as fixed effects (and where appropriate, male or female age), and replicate as a random effect. I began with a full model and then used the stepwise function step of R to determine the final model. For some I also used stepAIC which uses AIC as the metric of comparison: as both approaches gave the same qualitative result I present those from step. For simplicity, I present only the final model produced by this approach. Replicate was never significant and therefore the replicate lines do not appear in statistics presented: I refer to the combined lines as the S and R lines, respectively.

Analysis of preference

Directional preference for a particular male was measured from generation 3 onwards as the number of approaches to a focal male (arbitrarily the left male), divided by the total number of approaches to both males (Roff et al. 2017). This preference measure is estimated as D.PREF =a+bR.TRAIT, where D.PREF = (Approaches by female to focal male)/(sum of approaches to each male) and R.TRAIT =(Trait value of focal male)/(sum of Trait values of each male). In the absence of selection for preference there will likely be a decrease in the overall frequency of preference alleles and hence female realized preference is predicted to be reduced in the R lines relative to the S lines. This hypothesis leads to the prediction that the slope of the regression (b) in the R lines will be lower than those in the S lines.

A similar approach can be used for the case of stabilizing preference in which female preference declines as the difference between her most preferred trait value and the one encountered increases. Details of this approach are described in Roff et al. (2017) and in Supplemental materials. Theory is not a good guide to whether differences in stabilizing preference should be observed between the R and S lines. Therefore, when stabilizing preference was indicated I tested for differences between lines using a randomization test (Roff 2006), without an a priori prediction on the direction of change that might occur (see Supplemental materials).

Songs were measured at generations 3 and 11. However, as noted above, the sample size in generation 3 was small and hence the analyses presented here are from generation 11, unless otherwise specified. Female preference was first tested for each song component separately. For directional preference I included R.TRAIT (e.g. R.CALL, R.CHIRP), LINE and their interaction as fixed effects and replicate as a random effect. To construct a multivariate preference function, I used stepwise multiple regression starting with the equation

$$Y=const+LINE+mathop{sum }limits_{i=1}^{9}{X}_{i}+mathop{sum }limits_{i=1}^{9}LINEast {X}_{i}+mathop{sum }limits_{j=1}^{5}{X}_{S,j}+mathop{sum }limits_{j=1}^{5}LINEast {X}_{S,j},$$

where, for display simplicity, coefficients are omitted: Xi is the relative trait value for those nine components showing directional selection and XS,j is the relative trait value for the five traits subject to stabilizing preference (see Results).

Prokuda (2016) showed that female preference for time spent calling is not a function of a female assessing the time a male spends calling but simply that she does not approach non-calling males. Therefore, at least some of the preference displayed by females in this experiment could be due to time when only one male was calling. Thus we would like to know female preference when both males were calling. Given that time is required to move between males such an analysis would require binning into larger bins than used to detect time spent calling (for example, say 5 minute segments). However, there is no way to assess the appropriate bin size and, therefore, I adopted an alternate approach. At R.CALL equal to 0.5 females are equally likely to select either male and thus at this value female preference will presumably be dependent on other song characteristics. To ensure an adequate sample size I chose values of R.CALL ranging from 0.47 and 0.53 and applied the above multivariate approach.

Genetic analysis

Heritabilities and genetic correlations were estimated in generation 11. Genetic analyses were done using the animal model as implemented by the software Asreml©. Line (LINE), age (AGE) and wing morph (WING) were entered as fixed effects, while sire, dam and replicate were entered as random effects. Neither age, nor replicate were statistically significant and were dropped from the final analysis. There was also no significant dam effect and thus the genetic estimates were based on sire/dam effects. For the genetic analysis it is necessary to ascribe a trait value preferred by the female: for this purpose, female preference was set as the trait value of the male she visited most. Note that is a relatively crude estimate as it is based on only two males. The estimation of the genetic correlation between male and female traits was done using the suggestion of Falconer (1952) that the two sexes be considered as different environments (Roff 1997, p89-91).

Simulation analysis

I used the genetic parameters and female preference function estimated in generation 11 and the population data in generation 3 to predict the response to selection over the 8 generations. For a trait that occurs in both male and female the response to selection is given by Δz = ½, where, in the present case, Δz is the vector of responses, G is the genetic covariance matrix in standardized units and β is the vector of selection differentials in standardized units (Lande and Arnold 1985). An alternative writing of this equation is (varDelta {z}_{i}=frac{1}{2}({beta }_{i}{h}_{i}^{2}+mathop{sum }nolimits_{j=1,jne i}^{n-1}{beta }_{j}{h}_{i}{h}_{j}{r}_{ij})), where Δzi is the response in the ith trait, ({h}_{i}^{2}) is the heritability of the ith trait, βi is the standardized selection differential of the ith trait and rij is the genetic correlation between traits i and j. This version of the equation more clearly demonstrates that the response to selection is composed of a direct response, ({beta }_{i}{h}_{i}^{2}), and an indirect response,(mathop{sum }nolimits_{j=1,jne i}^{n-1}{beta }_{j}{h}_{i}{h}_{j}{r}_{ij}), that is due to the genetic correlation between traits.

The above equations can be used to determine the response to a single round of female selection on the 14 male song traits. To most accurately mimic the experimental population, males are drawn at random from the population data in generation 3. One hundred males are drawn and paired at random, after which we use the multivariate preference function derived from the analysis of females in generation 11 to select the preferred males. Response is then calculated using the G matrix calculated from males in generation 11 and the β values generated by female preference. The mean response over n generations can be approximated as nΔz. I generated 800 responses and divided them into groups of 8 and summed the responses to give 100 simulated data sets.

To examine the effect of more than two males per female I adjusted the simulation model to allow for n males per female preference. The procedure was modeled as a sequence of encounters as follows: two males are first drawn at random from the population and presented to the female for her selection. The more preferred male is retained and another male drawn from the population at random and the female is given the choice between these two males. As before, the more preferred male is retained and another male drawn from the population. This process is continued until the female has had a choice of n males. The number of males per female was varied from 2 to 12.

Results

Genetic analysis of male song

LINE was a significant fixed effect in 13 of the 14 song components, whereas WING was retained in eight components, with an interaction between LINE and WING in three components (Table 1). The inclusion of WING as a fixed effect assumes a constant difference between morphs and hence a genetic correlation between wing morphs of 1: to check this I estimated the genetic correlation between wing morphs using the same approach as for the genetic correlation between the sexes. In no case is there any indication that the genetic correlation deviates significantly from 1 (Table 1).

Table 1 Heritability of song components, significant fixed effect differences between lines and genetic correlation between wing morphs (last three columns).
Full size table

Overall, heritability was modest ((bar{x}) = 0.15, SE = 0.01) but significantly different from zero in 12 of the 14 song components (Table 1). No formal test of differences between the R and S lines in heritabilities was made but it is evident from the estimates that the confidence regions considerably overlap and that there are no meaningful differences (Table 1).

Genetic correlations were calculated with LINE and WING as fixed effects. Forty-nine of the 91 genetic correlations were significantly different from zero (Table 2). Genetic and phenotypic correlations were highly correlated (rg = −0.082 + 0.853rp, P < 0.0001) but the variation is too large to reliably predict the genetic correlation from the phenotypic (Fig. S4). The distribution of phenotypic correlations does provide an indication of the distribution of the genetic correlations even if individual correlations cannot be predicted (histograms in Fig. S4).

Table 2 Genetic correlations between song components. Correlations above diagonal, SEs below the diagonal.
Full size table

Overall, there was no strong structure in the correlation matrix, making it difficult to predict by simple examination of the matrix of correlations how the suite of traits would evolve under multivariate female preference. However, 27 correlations exceed 0.5, indicating that responses to female preference on one or more traits will have ripple effects on the other traits.

Effects of female preference on CALL and VOL

I first focus on the principal prediction that CALL will increase in the S line and stay more or less constant in the R line. Because CALL and VOL were measured at each generation of the S lines it is possible to calculate the selection differential on these components produced by female preference (Fig. 2). Generations 1 and 2 differed from generation 3 onwards in that female choice was assessed by the container the female visited first and in which they were thereby trapped. In neither S line nor trait type (CALL, VOL) did the male selected differ significantly from the non-selected male (Selected males actually called less in generation 1 but not significantly so, P = 0.6388, two-tailed t test and called with higher volume in both lines and generations, but also not significantly so, P > 0.5). From generation 3 males in the S lines were chosen based on the number of times a female visited their container and in this case in both S lines β values were positive at generations 3 onwards for both CALL and VOL (Fig. 2). CALL of the selected males was significantly larger than the rejected males in all generations based on one-tailed t tests, whereas VOL was statistically significant in 13 of the 18 comparisons. The β values of CALL were larger, overall (mean=0.23, sd=0.08, n = 18), than VOL (mean = 0.15, sd = 0.09, n = 18).

Fig. 2: Per generation changes in CALL and VOL.
figure 2

Changes in CALL and VOL (top row) and selection (BETA) exerted by female preference on the two replicate S lines (bottom row).

Full size image

Thus no significant selection on CALL or VOL was generated in generations 1 or 2 but significant female preference was asserted in the following generations with selection on CALL being larger than that on VOL. Despite this selection there was no clear cut change in either CALL or VOL from generation 3 to 11 (Fig. 2). Stepwise covariance analysis of CALL indicated no significant effect of generation but a significant difference between lines (F3,14 = 3.521, P = 0.0434). Similar analysis on VOL showed no significant effects (F3,14 = 0.165, P = 0.9182).

Analysis of changes in male song components by comparing generations 3 and 11

Because of the large variation in age in generation 3, AGE was included as a fixed factor in the analysis. As we are interested in population trends WING was excluded. There were significant changes across generations in 12 of the 14 song components (Table 3). A significant response to female preference is indicated by a significant Generation by LINE interaction (G*L): such an interaction was significant in six of the song components (Table 3, Fig. S5). VOL and FREQ increased relatively more in the S lines than the R lines. In the three song components, V.PULSE/CHIRP, V.CHIRP.W and V.FREQ, S and R lines decreased over the generations but the S lines decreased less than the R lines. V.PULSE.W increased in the S lines but the R lines did not change significantly (Fig. S5).

Table 3 Results of stepwise regression of trait on Generation (G), Line (L), and Age (A) across generations 3 and 11.
Full size table

Time spent calling (CALL) did not change significantly from generation 3 to generation 11 (Table 3). The lack of change in CALL is consistent with the test across all generations but is surprising given that β was positive at each generation. In contrast, song volume (VOL) increased substantially more in the S lines (Table 6, Fig. S5), suggesting that female preference either directly or indirectly favored louder calling males, although this was not evident in the test across generations (Fig. 2).

The above results suggest that changes in song structure are a consequence of female preference. In the next section I examine this question and calculate the genetic basis of preference and the genetic correlation between female preference and male trait.

Analysis of female preference on all song components

Phenotypic components of female preference as measured in generation 11

Univariate analysis of female preference showed statistical significance for all song components (12 if, instead of using P = 0.05 we correct for the number of tests and use a P of 0.05/14 = 0.0036): in five cases preference was stabilizing (Table 4, Fig. S3 shows plots for four examples of stabilizing preference), while in five cases it was directional and positive and in the remaining four cases it was directional and negative. In three cases there was a significant interaction between LINE and R.TRAIT (R.CALL, R.VOL, R.CHIRP). There was no significant difference between lines when preference was stabilizing (P > 0.20 in all cases). R.CALL by itself accounted for the greatest amount of variance (8.60%), followed by R.VOL (3.95%) and then R.CHIRP (3.07%), with all other song components accounting for less than 3% each. In general, the change in song traits from generation 3 to generation 11 was in the direction predicted by female preference (Table 3), though the lack of generation by line interaction makes attribution to female preference uncertain.

Theory predicted that the slope of the S lines should be steeper than that of the R lines when preference is directional: to examine this I computed the slopes of the preference regressions separately for each combined line (replicate being not significant). As predicted, the slope of the S lines was steeper in the three cases where LINE was significant (R.CALL, R.VOL, R.CHIRP) and was also true in four of the five cases where R.TRAIT was significant but LINE was not (Table 3). No prediction was made for stabilizing preference: in two cases the slope of the S lines was steeper than the R lines (R.PULSE, R.V.PULSE.W) and in three it was less steep (R.CHIRP.W, R.PULSE/CHIRP, R.V.FREQ).

Table 4 Results of the analysis of female preference.
Full size table

Stepwise analysis of the multivariate model resulted in a highly significant (F11,1571 = 16.86, P < 10−15) equation with eight additive directional terms (LINE, R.CALL, R.VOL, R.CHIRP, R.V.PULSE/CHIRP, R.V.CHIRP.W, R.IPULSE.W), one stabilizing term (PULSE, labeled as S.PULSE in the equation) and three interactions (LINExR.VOL, LINExR.CHIRP, LINExR.IPULSE.W). As the three interaction terms all involve LINE we can form two separate equations for the two lines (Table 5). In terms of magnitude, the effects of R.CALL and R.CHIRP are much greater than any other song component.

Table 5 Coefficients in the multivariate female preference model.
Full size table

The average time spent calling by a male over the 18-h period from 3 pm to 9am was 170.8 min (se=2.9, n = 2490, min=0, max=774.2). Males did not tend to call at the same time, one male calling by itself on average 89.0% of the time. Thus the large effect of CALL may be a consequence of solo calling. As discussed above, to test for this the data set was reduced to one in which R.CALL lay between 0.47 and 0.53. The resulting multivariate equation was highly significant (F11,90 = 3.014, P = 0.00184, n = 104) and excluded R.CALL (Table 2). R.VOL and R.V.PULSES/CHIRP, which were included in the model using the full data set were also excluded but with the reduced data set several other song components were now retained (Table 5). Particularly noteworthy is that the new equation contains three components under stabilizing preference: S.PULSE, which was previously included but now has a much larger effect (−0.047 vs −0.145), V.FREQ and V.PULSE.W.

Genetic components of female preference

The S and R lines differed significantly in 12 of the 14 song components (Table 6). Heritabilities for female preference were low but significant for four components (Table 6). Genetic correlations between female preference and male trait, using the combined data set were modest to high but SEs were also high and only two traits (V.CHIRP.W and IPULSE.W) were significant. The correlations estimated for the S and R lines independently are very similar (Paired t = 1.32, P = 0.121), though the SEs are relatively large (Table 6). If the genetic correlation is caused solely by linkage disequilibrium then, in the absence of close physical linkage, the genetic correlation in the R lines is predicted to be close to zero. The R lines do not show clearly diminished values (on average they are 0.30 greater than the S lines), suggesting that the genes for female preference and male trait are either very tightly linked or the correlations result from both linkage disequilibrium and pleiotropy.

Table 6 Heritability of female preference (SE) between two males and the genetic correlation (SE) between female preference and male trait.
Full size table

Simulation model

Relating changes in the S lines to female preferences is made uncertain by the number of significant genetic correlations between song traits. To explore how the genetic structure can explain the observed results I used simulation. Given the uncertainty in the components of G (Table 2) I focus primarily on the qualitative results of the simulations. Overall, there is a significant positive correlation between the total response and β values (r = 0.594, P = 0.0126, one-tailed test, Fig. 3). There is no obvious distinction in response between those traits included in the female multivariate preference function and those that are not (Fig. 3). The analysis comparing generations 3 and 11 revealed significant effects of female preference for six traits: in all six cases the observed direction of the response corresponds to that predicted by the simulation (Table 3). The results for the other eight song traits are difficult to compare because the observed response is confounded by the response in the R lines; in five cases the observed change over generations is the same as that predicted by the simulation.

Fig. 3
figure 3

Top plot shows the mean total response (±1SE) over 8 generations for the 100 simulated experiments. Solid line shows fitted regression. Lower plot shows the division of the total response into direct and indirect responses (mean with bidirectional ±1SE). Plot divided by dotted lines into four quadrats as indicated by + and – signs (direct, indirect). Circle symbols indicate those traits comprising the multivariate female preference function. Triangle symbols show those traits not included.

Full size image

Indirect effects dominate all but CALL and, except for CALL, VOL and CHIRP, β are small, which accounts for the lack of direct effects (Table 7, Fig. 3). As expected from the multivariate female preference function and previous experiments (Crnokrak and Roff 1995, 1998) females showed a strong preference for the longer calling male, thereby generating the largest β value for CALL of the 14 song traits (Table 7). However, in spite of this strong direct selection the mean predicted response over 8 generations was only 0.07 sd units (sd=0.19, 31% of simulations produced negative responses, Table 7), a result that is matched by the observed insignificant response in the S lines (Table 7, Fig. 2). The apparent paradoxical result of strong selection on CALL but no response is a consequence of the large direct response (0.18) being countered by an indirect response that is almost as large in magnitude but differs in sign (−0.11, Table 7, Fig. 3). The mean β in the simulation is smaller than that observed in the experiment and thus might have led to the low simulated response in CALL: to test this I eliminated all β values less than 0.15, which then gave a mean β of 0.22, which was sufficiently close to the observed value of 0.23. This resulted in 52 estimates of the response over 8 generations. As expected, CALL did increase, but only to 0.08 sd units, demonstrating that it wasn’t the lowered mean β that resulted in the lack of response. Overall, the restriction of β values increased the magnitude of the trait responses (y = −0.001 + 1.402x, r = 0.995, P < 0.001, where x is the total response of the entire data set and y is the data set restricted to β > 0.15) but in no case the direction of change.

Table 7 Results of simulated response to 8 generations of female preference.
Full size table

Male song volume (VOL) has a negative coefficient in the multivariate preference function (Table 5) but has the second largest β value in the simulation (β = 0.10, Table 7). The large change in VOL is almost entirely due to indirect effects (direct = 0.06, indirect = 0.43, Table 7). This can be attributed in part to its positive phenotypic correlation with CALL (0.65) and CHIRP (0.25), which are the largest contributors to the preference function (Table 5). The relatively large simulated response matches the large, statistically significant observed response (Fig. S5).

To examine the effect of more than 2 males per female I adjusted the simulation to vary the number from 2 to 12 (see Methods for procedure). For this analysis I examined the total response per generation with 100 replicates per male/female combination. As would be expected, β values increased with the number of males per female (top plot, Fig. 4) but for the three song traits subject to the highest β values in the 2 males/female scenario (CALL, VOL and CHIRP) the β values quickly levelled out at 0.32 for CALL and 0.19 for VOL and CHIRP (Fig. 4). Despite a doubling of β on CALL, there was no significant change in total response (Fig. 4). This lack of response is, as in the previous simulation, a consequence of direct responses being negated by the indirect responses. In contrast, in the other traits, VOL and CHIRP, the magnitude of the response increased with β but tended to level out at about 5–6 males/female (for VOL, a = 0.26 and for CHIRP, a = 0.12, Fig. 4).

Fig. 4: Effects on β and three traits (CALL, VOL, CHIRP) of increasing the number of males each female can evaluate.
figure 4

Plots show means±1SE. Curves fitted using y = a(1 − e−bx). For the other song traits the β function was an increasing asymptotic (FREQ, PULSE.W), a decreasing asymptotic β (V.CHIRP.W, V.FREQ, V.IPULSE.W, V.PULSE.W, V.PULSE/CHIRP), a simple decreasing function (IPULSE.W, PULSE) or showed no significant relationship (CHIRP.W, PULSE/CHIRP). This pattern was somewhat mirrored in the response of the song traits: the same general pattern was observed in VOL, CHIRP, FREQ (increasing asymptotic), V.CHIRP.W, PULSE (decreasing asymptotic), V.FREQ, V.IPULSE.W, V.PULSE/CHIRP, IPULSE.W (decreasing but not evidence of an asymptote), and CHIRP.W, PULSE/CHIRP (no significant relationship), whereas a different pattern was observed in PULSE.W (no significant relationship) and PULSE.W (decreasing asymptotic).

Full size image

Discussion

Previous experiments showed that G. firmus females preferred males that called the most. Based on these observations I predicted that a selection experiment in which females were mated to the male that she visited the most would be the male that called more and would result in selection for longer calling males. The first prediction was supported but the second was not because the length of time spent calling (CALL) in the selected, S, lines did not increase over 11 generations of selection. Comparison with the random, R, lines also showed no change in CALL.

In addition to time spent calling, an index of call volume (VOL) was measured over all generations of selection. Females showed a positive preference for louder males but there was no change in the S lines measured over generations but there was a positive change when measured with respect to the R lines. Twelve song components were measured in generations 3 and 11 and five of these changed in the S lines relative to the R lines. Directional changes in these song components were generally consistent with directional female preference but predictions when females showed stabilizing preference were not possible by considering only the univariate situation. To address this and determine why time spent calling failed to respond as predicted I constructed a simulation model using the estimated G matrix, an estimated multivariate female preference function and initializing data from generation 3. The simulation produced results consistent, overall, with the observed changes in song components. An important finding was that while the direct response of CALL was positive, as observed, the indirect response countered this, resulting in no overall response. Further, the overall responses of other components were heavily dependent upon the indirect responses, generated by the genetic and phenotypic correlation among traits.

The simulation was modified to allow for multiple males per female still using the same preference function. CALL showed no significant response to an increase in the number of males available. Other responses were variable, though both VOL(ume) and CHIRP(rate) which previously showed the largest positive response increased asymptotically, with males per female and more or less plateaued at 5–6 males per female. The results of the simulation suggest that access to more males may not result in further selection through female preference. More males may also produce a noisy environment making discrimination difficult, thereby reducing the ability of a female to exhibit preference (Tanner and Simmons 2022).

In G. firmus the amount of time spent calling is a passive choice by the female in that the female does not assess the amount of time spent calling but simply does not respond to non-calling males, thereby making the male calling the most also the most conspicuous (Prokuda 2016). In the present experiment the strong response to CALL resulted from only one male calling most of the time. Eliminating the effect of CALL by restricting the female preference data to a narrow range of relative time spent calling, the resulting multivariate female preference function excluded CALL but retained chirp rate, its relative importance increasing, and six other song components, three of which were under stabilizing preference. Therefore, when two males are calling simultaneously females pay attention to the structure of their songs, particularly to chirp rate, a song characteristic also preferred in the sibling species Gryllus pennsylvanicus (Harrison et al. 2013). However, preference for males with a faster chirp rate will be tempered by stabilizing preference for pulse rate and to a lesser extent by stabilizing preference for variance in frequency and pulse width (Table 5). In the former case the genetic correlation is positive (0.24), permitting a reduced but positive response, but in the latter case the genetic correlations are negative (−0.01 and −0.21, respectively), acting against faster chirp rates. Such stabilizing effects of call structure have been observed in the cricket Teleogryllus commodus (Bentsen et al. 2006) and are probably the norm in songs of organisms that attract females by song.

Harrison et al. (2013) found that female preference increased with song volume in G. pennsylvanicus and that song volume correlated with body size. The univariate analysis in the present study found that female preference also increased with VOL(ume) but this was likely a result of its correlation with CALL: when CALL was controlled for, VOL(ume) was not retained in the multivariate preference function.

The importance of CALL in determining female preference will diminish as the number of males within the hearing range of the female increases. The probability of two males calling together at some given moment is p2, where p is the probability of a single male calling, which in the present experiment can be estimated as p = √0.05 = 0.224. Given n males, the probability of at least two males calling at any moment, provided calling does not initiate calling by other males, is determined using binomial probability, where p = 0.224. Niemelä et al. (2021) found that male Gryllus campestris were typically associated with 5 males (range 0–10). For the sequence n = 2, 4, 6, 8, 10 we have that the probability of at least two males calling is 0.05, 0.22, 0.40, 0.56, 0.69, or only one male calling as 0.95, 0.78, 0.60, 0.44, 0.31. Thus even with 10 males there is a significant probability that a female will hear only a single male calling and make her approach to that male, even if her preference may be greater for another of the 10 males.

Because random mating disrupts female preference, the R lines were predicted to show a reduced preference, which they did. The two song components most preferred by females, CALL and CHIRP(rate), had very low heritabilities of female preference (0.03 and <0.005, respectively, Table 6), while PULSE(rate), the third most preferred trait had the largest heritability (0.12). Using QTL analysis of Gryllus texensis and G. rubens, Blankers et al. (2019) obtained heritabilities for female preference for pulse rate of 0.61. Estimates from other organisms based on the male trait the female preferred in a choice trial are somewhat larger than obtained here for G. firmus (mean= 0.25, see Table 1 in Roff and Fairbairn 2015) but vary widely, ranging from 0 to 1.02 (sd=0.27).

If a female has access to all the males in the population, female preference is set by the male for whom she shows the greatest preference, which I shall call population preference. This situation will rarely be realized and I therefore define another parameter, realized preference, as her preference in a subset of males in the population. Field estimates of the number of males surveyed by females averaged only 4.5, with a median of 2.9 (Roff and Fairbairn 2014). Thus it is heritability of realized preference that is the most pertinent parameter for the study of the evolution of preference and preferred traits. As the number of males a female has access to increases, the heritability of realized preference approaches the heritability of the population preference (Roff and Fairbairn 2015) and in the present case could be as high as 0.25 (see Fig. 1 in Roff and Fairbairn 2015).

If the genetic correlation between preference and preferred trait was due entirely to linkage disequilibrium then, unless there was very tight physical linkage, in the R lines it should have diminished to zero by generation 11. Genetic coupling has been observed between male song pulse rate and female preference in Laupala crickets (Xu and Shaw 2019, 2021). The large number of chromosomes in Gryllus species, (28, Lim et al. 1973; Zefa 1999; Yoshimura 2005; Yoshimura et al. 2006) relative to Laupala, (7 autosomal linkage groups, Blankers et al. 2018), make such tight linkage less likely. Some physical linkage was observed between pulse rate and preference in QTL analysis of G. texensis and G. rubens (Blankers et al. 2019).

The genetic correlation would tend to be maintained if it were generated in part by pleiotropy. Genetic correlation estimates were typically high but, despite the large sample size, the SEs were also high, a problem that plagues such estimates (Sharma et al. 2016). Never-the-less the lack of difference between the R and S lines suggests that pleiotropy in addition to linkage disequilibrium underlies the genetic correlation in this population.

The overall finding of the experiment reported here is that a focus on a univariate female preference function or even a phenotypic examination of multivariate female preference may be quite inadequate to predict how a suite of traits preferred by a female or genetically correlated with such traits will evolve. Given even a reduced choice of males, female preference may still exert strong selection on traits but how they respond to such selection will depend greatly upon the details of the G matrix. In the movie “The Wizard of Oz”, Dorothy lives in Kansas, which is portrayed as a plain, gray country. She is transported to Oz, which, by comparison, is far more complex, colorful and interesting. The results presented herein mimic this in that the simple world of univariate analysis may not correctly predict the results of selection: it is only in the more complex world measured by genetic and phenotypic interactions among components, both within males and within female preferences, that multivariate evolution can be understood and predicted.

Related Articles

Type 2 immunity in allergic diseases

Significant advancements have been made in understanding the cellular and molecular mechanisms of type 2 immunity in allergic diseases such as asthma, allergic rhinitis, chronic rhinosinusitis, eosinophilic esophagitis (EoE), food and drug allergies, and atopic dermatitis (AD). Type 2 immunity has evolved to protect against parasitic diseases and toxins, plays a role in the expulsion of parasites and larvae from inner tissues to the lumen and outside the body, maintains microbe-rich skin and mucosal epithelial barriers and counterbalances the type 1 immune response and its destructive effects. During the development of a type 2 immune response, an innate immune response initiates starting from epithelial cells and innate lymphoid cells (ILCs), including dendritic cells and macrophages, and translates to adaptive T and B-cell immunity, particularly IgE antibody production. Eosinophils, mast cells and basophils have effects on effector functions. Cytokines from ILC2s and CD4+ helper type 2 (Th2) cells, CD8 + T cells, and NK-T cells, along with myeloid cells, including IL-4, IL-5, IL-9, and IL-13, initiate and sustain allergic inflammation via T cell cells, eosinophils, and ILC2s; promote IgE class switching; and open the epithelial barrier. Epithelial cell activation, alarmin release and barrier dysfunction are key in the development of not only allergic diseases but also many other systemic diseases. Recent biologics targeting the pathways and effector functions of IL4/IL13, IL-5, and IgE have shown promising results for almost all ages, although some patients with severe allergic diseases do not respond to these therapies, highlighting the unmet need for a more detailed and personalized approach.

Iron homeostasis and ferroptosis in muscle diseases and disorders: mechanisms and therapeutic prospects

The muscular system plays a critical role in the human body by governing skeletal movement, cardiovascular function, and the activities of digestive organs. Additionally, muscle tissues serve an endocrine function by secreting myogenic cytokines, thereby regulating metabolism throughout the entire body. Maintaining muscle function requires iron homeostasis. Recent studies suggest that disruptions in iron metabolism and ferroptosis, a form of iron-dependent cell death, are essential contributors to the progression of a wide range of muscle diseases and disorders, including sarcopenia, cardiomyopathy, and amyotrophic lateral sclerosis. Thus, a comprehensive overview of the mechanisms regulating iron metabolism and ferroptosis in these conditions is crucial for identifying potential therapeutic targets and developing new strategies for disease treatment and/or prevention. This review aims to summarize recent advances in understanding the molecular mechanisms underlying ferroptosis in the context of muscle injury, as well as associated muscle diseases and disorders. Moreover, we discuss potential targets within the ferroptosis pathway and possible strategies for managing muscle disorders. Finally, we shed new light on current limitations and future prospects for therapeutic interventions targeting ferroptosis.

Targeting of TAMs: can we be more clever than cancer cells?

With increasing incidence and geography, cancer is one of the leading causes of death, reduced quality of life and disability worldwide. Principal progress in the development of new anticancer therapies, in improving the efficiency of immunotherapeutic tools, and in the personification of conventional therapies needs to consider cancer-specific and patient-specific programming of innate immunity. Intratumoral TAMs and their precursors, resident macrophages and monocytes, are principal regulators of tumor progression and therapy resistance. Our review summarizes the accumulated evidence for the subpopulations of TAMs and their increasing number of biomarkers, indicating their predictive value for the clinical parameters of carcinogenesis and therapy resistance, with a focus on solid cancers of non-infectious etiology. We present the state-of-the-art knowledge about the tumor-supporting functions of TAMs at all stages of tumor progression and highlight biomarkers, recently identified by single-cell and spatial analytical methods, that discriminate between tumor-promoting and tumor-inhibiting TAMs, where both subtypes express a combination of prototype M1 and M2 genes. Our review focuses on novel mechanisms involved in the crosstalk among epigenetic, signaling, transcriptional and metabolic pathways in TAMs. Particular attention has been given to the recently identified link between cancer cell metabolism and the epigenetic programming of TAMs by histone lactylation, which can be responsible for the unlimited protumoral programming of TAMs. Finally, we explain how TAMs interfere with currently used anticancer therapeutics and summarize the most advanced data from clinical trials, which we divide into four categories: inhibition of TAM survival and differentiation, inhibition of monocyte/TAM recruitment into tumors, functional reprogramming of TAMs, and genetic enhancement of macrophages.

Integrated proteogenomic characterization of ampullary adenocarcinoma

Ampullary adenocarcinoma (AMPAC) is a rare and heterogeneous malignancy. Here we performed a comprehensive proteogenomic analysis of 198 samples from Chinese AMPAC patients and duodenum patients. Genomic data illustrate that 4q loss causes fatty acid accumulation and cell proliferation. Proteomic analysis has revealed three distinct clusters (C-FAM, C-AD, C-CC), among which the most aggressive cluster, C-AD, is associated with the poorest prognosis and is characterized by focal adhesion. Immune clustering identifies three immune clusters and reveals that immune cluster M1 (macrophage infiltration cluster) and M3 (DC cell infiltration cluster), which exhibit a higher immune score compared to cluster M2 (CD4+ T-cell infiltration cluster), are associated with a poor prognosis due to the potential secretion of IL-6 by tumor cells and its consequential influence. This study provides a comprehensive proteogenomic analysis for seeking for better understanding and potential treatment of AMPAC.

Engineering bone/cartilage organoids: strategy, progress, and application

The concept and development of bone/cartilage organoids are rapidly gaining momentum, providing opportunities for both fundamental and translational research in bone biology. Bone/cartilage organoids, essentially miniature bone/cartilage tissues grown in vitro, enable the study of complex cellular interactions, biological processes, and disease pathology in a representative and controlled environment. This review provides a comprehensive and up-to-date overview of the field, focusing on the strategies for bone/cartilage organoid construction strategies, progresses in the research, and potential applications. We delve into the significance of selecting appropriate cells, matrix gels, cytokines/inducers, and construction techniques. Moreover, we explore the role of bone/cartilage organoids in advancing our understanding of bone/cartilage reconstruction, disease modeling, drug screening, disease prevention, and treatment strategies. While acknowledging the potential of these organoids, we discuss the inherent challenges and limitations in the field and propose potential solutions, including the use of bioprinting for organoid induction, AI for improved screening processes, and the exploration of assembloids for more complex, multicellular bone/cartilage organoids models. We believe that with continuous refinement and standardization, bone/cartilage organoids can profoundly impact patient-specific therapeutic interventions and lead the way in regenerative medicine.

Responses

Your email address will not be published. Required fields are marked *