Reagent or resource
Source
Identifier
Antibodies
Rabbit polyclonal anti-neuron specific beta III Tubulin
Abcam
Cat# ab18207, RRID: AB_444319
Rabbit polyclonal anti-alpha smooth muscle Actin
Abcam
Cat# ab5694, RRID: AB_2223021
Rabbit monoclonal anti-SOX17 (D1T8M)
CST
Cat#81,778, RRID: AB_2650582
Rabbit monoclonal anti-GATA-6 (D61E4)
CST
Cat# 5851, RRID: AB_10705521
Rabbit polyclonal anti-human Nanog
PeproTech
Cat# 500-P236, RRID: AB_1268274
Mouse monoclonal anti-SSEA1 (MC480)
Abcam
Cat# ab16285, RRID: AB_870663
Mouse monoclonal anti-SSEA4 (MC813)
Abcam
Cat# ab16287, RRID: AB_778073
Mouse monoclonal anti-E Cadherin
Abcam
Cat# ab76055, RRID: AB_1310159
Mouse monoclonal anti-Oct-3/4 (C-10)
Santa Cruz
Cat# sc-5279, RRID: AB_628051
Mouse monoclonal anti-Sox-2 (E-4)
Santa Cruz
Cat# sc-365823, RRID: AB_10842165
Monoclonal Anti-Skeletal Myosin (FAST)
Sigma-Aldrich
Cat# M4276, RRID: AB_477190
Goat anti-Mouse IgG (H + L) Cross-Adsorbed Secondary Antibody, Alexa Fluor 488
Thermo Fisher
Cat# A-11001, RRID: AB_2534069
Goat anti-Mouse IgG (H + L) Cross-Adsorbed Secondary Antibody, Alexa Fluor 594
Thermo Fisher
Cat# A-11005, RRID: AB_141372
Donkey anti-Rabbit IgG (H + L) Highly Cross-Adsorbed Secondary Antibody, Alexa Fluor 594
Thermo Fisher
Cat# A-21207, RRID: AB_141637
Donkey anti-Rabbit IgG (H + L) Highly Cross-Adsorbed Secondary Antibody, Alexa Fluor 488
Thermo Fisher
Cat# A-21206, RRID: AB_2535792
Chemicals, peptides, and recombinant proteins
CHIR99021
Selleckchem
Cat# S1263
IWR-1-endo
Selleckchem
Cat# S7086
Y-27632
Selleckchem
Cat# S1049
WH-4-023
Selleckchem
Cat# S7565
SB431542
Selleckchem
Cat# S1067
PD0325901
Selleckchem
Cat# S1036
XAV939
Selleckchem
Cat# S1180
LDN193189
Stemgent
Cat# 04-0074
Recombinant Human HGF
Peprotech
Cat# 100-39H
Recombinant Human IGF-I
Peprotech
Cat# 100-11
Recombinant Murine BMP-4
PeproTech
Cat# 315-27
Recombinant human EGF
PeproTech
Cat# AF-100-15
Recombinant human SGF
PeproTech
Cat# 300-07
Human/Murine/Rat Activin A (E.Coli)
PeproTech
Cat# 120-14E
Recombinant Human LIF
PeproTech
Cat# 300-05
Recombinant Human FGF-basic (154 a.a.)
PeproTech
Cat# 100-18B
Bovine Serum Albumin
Sigma-Aldrich
Cat# A1470
Ascorbic Acid
Sigma-Aldrich
Cat# A4544
KnockOut Serum Replacement
Thermo Fisher
Cat# A3181502
Neurobasal™ Medium
Thermo Fisher
Cat# 21,103-049
DMEM/F-12, GlutaMAX™ supplement
Thermo Fisher
Cat# 10,565-018
N-2 Supplement (100 ×)
Thermo Fisher
Cat# 17,502-048
B-27™ Supplement (50 ×), minus vitamin A
Thermo Fisher
Cat# 12,587-010
GlutaMAX™ Supplement
Thermo Fisher
Cat# 35,050-061
MEM Non-Essential Amino Acids Solution (100 ×)
Thermo Fisher
Cat# 11,140-050
2-Mercaptoethanol
Thermo Fisher
Cat# 21,985-023
Penicillin–Streptomycin (10,000 U/mL)
Thermo Fisher
Cat# 15,140–122
Gelatin (0.1% in water)
Stem Cell Technologies
Cat# 07903
Trypsin–EDTA (0.05%), phenol red
Gibco
Cat# 25,300,120
DMEM, high glucose, no glutamine
Gibco
Cat# 11,960-044
Fetal bovine serum (FBS)
Gibco
Cat# 16,000-044
Accutase cell dissociation reagent
Gibco
Cat# A11105-01
TrypLE ™ Express
Gibco
Cat# 12,605,010
Dulbecco's phosphate-buffered saline (DPBS)
Gibco
Cat# C14190500CP
Critical commercial assays
KAPA Hyper Prep Kits
KAPA Biosystems
Cat# KK8054
RNeasy Mini Kit
QIAGEN
Cat# 74,106
Globin-Zero Gold rRNA Removal Kit
Illumina
Cat# GZG1224
NEBNext® Ultra™ Directional RNA Library Prep Kit for Illumina®
New England Biolabs
Cat# E7760
RNAprep pure Cell/Bacteria Kit
TIANGEN
Cat# DP430
5 × All-In-One RT Master Mix
ABM
Cat# G490
Rapid Giemsa staining kit
BBI Life Sciences
Cat# E202FA0001
Gel & PCR Clean Up Kit
OMEGA
Cat# D2000-02
Lipofectamine ™ 3000 Transfection Kit
Invitrogen
Cat# 2,173,193
Endo-free Plasmid Mini Kit
OMEGA
Cat# D6950-02B
2 × RealStar Green Power Mixture
GenStar
Cat# A311-05
Cells-to-cDNA™ II Kit
Invitrogen
Cat# AM8723
Deposited data
scRNA-seq original data
This paper
GEO: GSE256201
RNA-seq original data
This paper
GEO: GSE256201
scRNA-seq public data
Zhi et al. [20]
GSA: CRA003960
bEPSCs and biPSCs RNA-seq public data
Zhao et al. [18]
GEO: GSE129760
Primed bPSCs RNA-seq public data
Bogliotti et al. [49]
GEO: GSE110036
bEDSCs RNA-seq public data
Kinoshita et al. [16]
GEO: GSE172420
Experimental models: Cell lines
E7-bEpiSCs
This paper
N/A
E10-bEpiSCs
This paper
N/A
E12-bEpiSCs
This paper
N/A
E14-bEpiSCs
This paper
N/A
Experimental models: Organisms/strains
CD-1® (ICR) IGS Mice
Vital River
201
BALB/c Nude Mice
Vital River
401
Holstein cow
Shijiazhuang Tianquan Elite Dairy Ltd and Beijing Dairy Cattle Center
N/A
Oligonucleotides
See Table S8 for oligonucleotide sequences used in this paper
N/A
N/A
Recombinant DNA
PB-CMV-EF1A-GFP-NLS
Our Laboratory
N/A
Software and algorithms
kallisto v-0.46.0
Bray et al. [50]
https://pachterlab.github.io/kallisto/
tximport v-1.18.0
Soneson et al. [51]
https://bioconductor.org/packages/release/bioc/html/tximport.html
Genome and Genome annotation
Ensembl 105
https://dec2021.archive.ensembl.org/index.html
Seurat v-4.1.0
Hao et al. [52]
Monocle3 v-1.3.1
Cao et al. [53]
https://cole-trapnell-lab.github.io/monocle3
destiny v-2.14.0
Angerer et al. [54]
https://bioconductor.org/packages/release/bioc/html/destiny.html
KEGGREST v-1.30.1
Tenenbaum [55]
https://bioconductor.org/packages/release/bioc/html/KEGGREST.html
gghalves v-0.1.1
N/A
https://github.com/erocoar/gghalves
DESeq2 v-1.30.1
Love et al. [56]
https://bioconductor.org/packages/release/bioc/html/DESeq2.html
ComplexHeatmap v-2.6.2
Gu et al. [57]
https://github.com/jokergoo/ComplexHeatmap
FactoMineR v-2.4
Lê et al. [58]
R v-4.0.5
N/A
Experimental model and study participant detailsMiceCD-1® (ICR) IGS and BALB/c nude mice were purchased from Beijing Vital River Laboratory Animal Technology Co., Ltd. (Beijing, China) and were used for mouse embryonic fibroblasts (MEFs) isolation, and the bEpiSCs teratoma formation test. The MEFs were treated with mitomycin C (Selleckchem, S8146) to prepare feeder cells for bEpiSCs.
BovineHolstein cow embryos at E5, E6, E7, E10, E12, and E14 were used for bovine embryonic single-cell collection and single-cell transcriptome analyses. Embryonic day n (E(n)) embryos were obtained n days after mating. For bEpiSCs derivation, E7, E10, E12, and E14 embryos were used.
Collection of cow embryos and isolation of embryonic single cellsThe embryonic cells utilized in this study were exclusively derived from Holstein cow embryos. E5-E7 embryos were obtained by thawing and culturing frozen cow embryos until the desired developmental stage was reached. The zona pellucida was subjected to a 15–30 s Pronase (Sigma, 10,165,921,001) treatment, followed by cleaning and removal using a solution composed of DPBS + 0.1% BSA. Subsequently, the embryonic cells were mechanically separated and transferred into the lysate. E10, E12, and E14 embryos were obtained through in vivo transplantation and subsequent flushing, with single embryonic cells being isolated and collected via a combination of enzymatic treatment and mechanical manipulation.
Bovine epiblast stem cells culture mediumThe components of the bovine epiblast stem cells culture system were the same as the 3i/LAF culture system previously reported [20], and the contents of different components were optimized accordingly. Specifically, the basal medium (BM) was a 1:1 mixture of DMEM/F12 medium (Thermo Fisher Scientific, 10,565–018) and Neurobasal medium (Thermo Fisher Scientific, 21,103–049), and added 1 × N2 supplement (Thermo Fisher Scientific, 17,502–048), 1 × B27 supplement (Thermo Fisher Scientific, 12,587–010), 0.5% GlutaMAX (Thermo Fisher Scientific, 35,050–061), 1% nonessential amino acids (Thermo Fisher Scientific, 11,140–050), 0.1 mM β-mercaptoethanol (Thermo Fisher Scientific, 21,985–023), 1% penicillin–streptomycin (Thermo Fisher Scientific, 15,140–122), 5% knockout serum replacement (KOSR, Thermo Fisher Scientific, A3181502, optional), and 50 μg/mL ascorbic acid (Sigma–Aldrich, A4544). To prepare the 3i/LAF culture system, it is also necessary to add the following small molecule inhibitors or cytokines to BM: CHIR99021 (1 μM, Selleckchem, S1263), IWR-1-endo (0.5 μM, Selleckchem, S7086), WH-4–023 (1 μM, Selleckchem, S7565), recombinant human LIF (10 ng/mL, PeproTech, 300–05), recombinant human Activin A (25 ng/mL, PeproTech, 120-14E), and recombinant human FGF-basic (154 aa) (10 ng/mL, PeproTech, 100-18B). Optimized, the ROCK inhibitor Y-27632 (5 μM, Selleckchem, S1049) was added. In addition, XAV939 (1 μM, Selleckchem, S1180) can be used instead of IWR-1-endo. Unless otherwise noted, the base medium and supplement factor concentrations of the condition medium (e.g. I/F medium) described herein are the same as those of the 3i/LAF medium. The bEpiSCs were cultured with mitomycin C-treated mouse fibroblasts as feeder cells.
Derivation of bEpiSCs from bovine ICMs, epiblasts, and ectodermsBovine ICMs, epiblasts, and ectoderms were isolated by mechanical isolation and treated with TrypLE™ Express (Gibco, 12,605,010) for 3 min, then seeded onto feeder cells supplemented with 3i/LAF medium. Cultures were incubated at 5% O2 and 5% CO2 at 37 °C. The domed outgrowths were collected and digested using Accutase cell dissociation reagent (Gibco, A11105-01), and then passaged every 3 days at a ratio of 1:4.
Method detailsSingle-cell RNA library preparation and sequencingSingle-cell RNA-seq library was prepared using a modified Smart-seq2 protocol, as previously described in studies [59, 60]. Briefly, individual embryonic cells were transferred into a lysis buffer containing an 8 bp barcode. Subsequently, first-strand cDNA was reverse-synthesized and amplified in a reverse transcription (RT) mixture consisting of 4 U RNase inhibitor, 100 U SuperScript II reverse transcriptase (Invitrogen, 18,064,071), 1 mM dNTPs (TAKARA, 4019), 60 mM MgCl2, and 3 µM RT primer with 10 µM TSO primer. Following PCR amplification, the product was purified using 0.8 × Beckman’s AMPure XP beads (A63882). Biotin PCR enrichment was then performed to further enhance the library quality. Finally, construction of the single-cell RNA-seq library followed the instructions provided by KAPA Hyper Prep Kits with PCR Library Amplification/Illumina series (KAPA KK8054). High-quality libraries were sequenced using Illumina HiSeq Xten platform (Novogene) with paired-end reads of length 150 bp. The primers utilized in these experiments are listed in key resources table.
Cell population doubling timeThe growth curve of bEpiSCs was plotted by seeding them at a density of 3 × 105 in 12-well plates. Cells were subsequently digested and counted at intervals of 12 h, 24 h, 36 h, 48 h, and 60 h using the Luna™ Automated Cell Counter. Three replicates were performed for each time point. The Doubling time was calculated using the following formula: Doubling time (DT) = 12 × [lg2/(lgNt-lgN0)], where 12 is the cell culture time (hours); Nt is the number of cells cultured at 48 h; and N0 is the number of cells recorded at 36 h.
Analysis of single-cell cloning efficiencyBovine EpiSCs were digested with TrypLE™ Express (Gibco, 12,605,010) and filtered through a 40 μm cell filter. Cells were seeded in 6-well plates with 100, 500, and 1000 cells, respectively. After 3 days of culture, the number of colonies formed was counted, and the single-cell colony formation rate was calculated and averaged.
Karyotype analysesBefore performing karyotype analyses, 1% KaryoMAX Colcemid Solution (Gibco, 15,212,012) was added to the bEpiSCs culture medium and incubated for 1 h. The bEpiSCs were then dissociated into single cells using TrypLE™ Express (Gibco, 12,605,010) and collected through centrifugation. Subsequently, the bEpiSCs were suspended in a hypotonic solution of 0.075 M KCl (Sigma, P5405) and incubated at 37 °C for 15 min. Following this step, the bEpiSCs were fixed with methanol and acetic acid in a ratio of 3:1; this process was repeated three times. The resulting suspension of bEpiSCs was dropped onto a pre-cooled slide, thoroughly dried at room temperature, and stained with a 10% Giemsa Stain Solution (Sangon, E6073140001) for a duration of 30 min. More than 45 metaphase cells were examined for each cell line.
Alkaline phosphatase (AP) stainingFor details about AP staining procedures and precautions, refer to Alkaline Phosphatase Detection Kit (Millipore, SCR004).
Immunofluorescence analysesThe cells were washed with DPBS for immunofluorescence (IF) analyses, followed by fixation in 4% paraformaldehyde (PFA) at room temperature for 30 min. Subsequently, the cells were rinsed with DPBS and permeabilized with 0.1% Triton X-100 for 20 min. After another round of DPBS washing, the cells were blocked with 3% BSA at room temperature for 1 h. The primary antibodies were incubated overnight at 4 °C and subsequently washed three times using a wash buffer (DPBS containing 0.1% Triton X-100 and 0.1% Tween 20). Secondary antibodies were incubated at room temperature for 1 h and then washed three times using the same wash buffer. Finally, DAPI staining was performed to visualize the nucleus, enabling direct observation and photography.
Embryoid body differentiationFollowing digestion, the bEpiSCs were seeded at a density of 1 × 106 cells per well and cultured for 5–7 days on 35 mm low-attachment plates in MEF culture medium: DMEM (Gibco, 11,960–044) supplemented with 10% FBS (Gibco, 16,000–044), 1% penicillin–streptomycin (Thermo Fisher Scientific, 15,140–122), and 1% GlutaMAX (Thermo Fisher Scientific, 35,050–061), on a horizontal shaker at a speed of 70 rpm. Subsequently, Embryoid Bodies (EB) were transferred to a 12-well plate and incubated in the same medium for one week with bi-daily medium changes. Adherent cells were utilized for subsequent immunofluorescence staining.
Teratoma formationThe prepared 1 × 107 bEpiSCs were collected by centrifugation after digestion, resuspended in 50 μL BM, and subsequently injected subcutaneously into the neck of BALB/c nude mice. After a period of 4–5 weeks, the teratoma was collected and subjected to subsequent H&E analyses.
H&E analysesThe teratomas were subjected to two washes in DPBS and fixed with 4% PFA for a duration of 2 days at a temperature of 4 °C. Subsequently, the teratoma tissues underwent dehydration using an alcohol gradient (70%, 80%, 90%, 95%, and finally, 100% for each hour) before being transferred into xylene and embedded in paraffin. The samples were sliced to a thickness of 5 μm, followed by deparaffinization in xylene and rehydration using decreasing concentrations of ethanol. Finally, the samples were stained with haematoxylin (Sigma–Aldrich, MHS16) and eosin (Sigma–Aldrich, HT110116), after which they were observed under a microscope (Leica, DM5500B).
RT–qPCRThe total RNA was extracted from bEpiSCs using the RNA prep Pure Cell/Bacteria Kit (TIANGEN, DP430), followed by reverse transcription to cDNA using the 5 × All-In-One RT Master Mix (Abm, G490). Subsequently, PCR amplification was performed with the 2 × RealStar Green Power Mixture (GenStar, A311-05) on an Archimed Real Time System (ROCGENE). The data were analyzed utilizing the comparative CT (2−ΔΔCT) method. ΔCT values were calculated employing GAPDH as an internal control. Three independent biological replicates were conducted for all experiments. The primer sequences used in quantitative real-time PCR can be found in key resources table.
Myogenic differentiation of bEpiSCsThe protocol was performed as described for pig pgEpiSCs [19]. In brief, myogenic differentiation basic medium (MDBM) consisted of DMEM/F12 supplemented with 1% nonessential amino acids, 0.1 mM β-mercaptoethanol, 1% penicillin–streptomycin, 15% KOSR, and 200 μM ascorbic acid. For stage 1, the bEpiSCs were dissociated into small pieces, and cultured in MDBM supplemented with 1% B27 supplement, 3 μM CHIR99021, and 2 μM SB431542 (Selleckchem, S1067) maintained for 3 days. In stage 2, the culture medium was replaced with a combination of 3 µM CHIR99021, 2 μM SB431542, 0.5 μM LDN193189 (Stemgent, 04–0074), and 20 ng/mL recombinant human FGF-basic (154 a.a.) from day 4 to day 6. For stage 3, the culture was replaced with 10 ng/mL HGF (Peprotech, 100-39H), 10 ng/mL IGF-1 (Peprotech, 100–11), 20 ng/mL recombinant human FGF-basic (154 a.a.), and 0.5 μM LDN193189 for 2 days. During stage 4, the bEpiSCs undergo differentiate into myoprogenitors. The bEpiSCs-MPCs were treated with a concentration of 10 ng/mL IGF-1 for 4 days, and for stage 5, a combination of 10 ng/mL HGF and 10 ng/mL IGF-1 for 20–25 days. For skeletal muscle maturation, the cells were treated with N2 medium consisted by DMEM/F12 supplemented with 15% KOSR, 1% N2 supplement, 1% penicillin–streptomycin, and 1% nonessential amino acids.
Primordial germ cell like cells (PGCLCs) inductionThe induction of PGCLCs was conducted according to a modified protocol [8, 45]. Briefly, 3000 bEpiSC cells were seeded into each well of a low-cell binding U-bottom 96-well plate (Beyotime, FULA962) and incubated for 24 h in 3i/LAF medium. Subsequently, the cells were switched to PGCLC induction medium consisting of BM medium supplemented with sodium pyruvate (1 mM, Thermo Fisher Scientific), BMP4 (200 ng/mL, PeproTech), recombinant human LIF (10 ng/mL, PeproTech), recombinant human SCF (100 ng/mL; PeproTech), recombinant human EGF (50 ng/mL; Peprotech), and Y-27632 (10 μM, Selleckchem) for a duration of three days.
rRNA-depleted RNA-seqTotal RNA was extracted separately from four bEpiSCs samples using the RNeasy Mini Kit (Qiagen, 74,106). For constructing strand-specific RNA-seq libraries, we employed a rRNA depletion protocol (Globin-Zero Gold rRNA Removal Kit, Illumina, GZG1224) in conjunction with the NEBNext® Ultra™ Directional RNA Library Prep Kit for Illumina® (NEB, E7420S). All libraries were quantified using the Qubit dsDNA High Sensitivity Assay Kit (Invitrogen, Life Technologies, Q32851) and sequenced on an Illumina HiSeq 4000 platform.
Vector constructionThe GFP plasmid was stored in our laboratory. In brief, we generated a PB-CMV-EF1A-GFP-NLS plasmid by modifying the PB-CAG-MCS vector (provided by Prof. Sen Wu). Specifically, we substituted the chicken β-actin promoter with the Homo sapiens elongation factor 1 alpha (EF1A) promoter and integrated a GFP-NLS cassette downstream of the EF1A promoter.
bEpiSCs transfectionBovine EpiSCs were transfected using Lipofectamine™ 3000 Reagent (Invitrogen, L3000008). Specifically, transfection was performed 16 h after normal cell passage in a 24-well plate. First, 25 μL Opti-MEM™ Medium was added to a centrifuge tube followed by the addition of 0.75 μL Lipofectamine™ 3000 Reagent and mixing well. Then, another centrifuge tube was taken and filled with 25 μL Opti-MEM™ Medium to which 0.5 μg DNA and 1μL P3000™ Reagent were added and mixed well. The DNA mixture was then combined with the Lipofectamine™ reagent at a ratio of 1:1 and allowed to incubate for10-15 min before being added to the cells for culture.
Generation of bEpiSCs cloned embryosOvaries were obtained from cattle farms around Beijing, and oocytes in 3–8 mm follicles were extracted. The oocytes with three layers of cumulus cells were transferred to Maturation medium and matured at 38.5 °C and 5% CO2 for 18 h. Maturation medium was based on TCM199 (Gibco, 12,340–030), supplemented with 10% FBS (Gibco, 16,000–044), and 0.01 IU/mL follicle stimulating hormone (FSH, Sigma, F4021), 0.01 IU/mL luteinizing hormone (LH, Sigma, L6420), 1 μg/mL estradiol (Sigma, E2257). After 18 h of oocytes maturation, 0.1% hyaluronidase (Sigma, H4272) was used to remove the excess cumulus, the oocytes with polar bodies were placed in HM medium containing 7.5 μg/mL of cytorelaxin (Sigma, C6762) for 10 min, and then transferred to HM medium containing 10% FBS for enucleation. The enucleated oocytes were transferred into the Maturation medium until nucleus injection. bEpiSCs were differentiated in basal medium containing 10 ng/mL BMP4 (PeproTech, 315–27), 5 μM SB431542, and 10 ng/mL FGF2 for more than 1 week and then used as donor cells for nuclear transfer. Clear round donor cells were selected and injected into the periovoid space, and the cells were as close to the cytoplasm as possible to improve the fusion efficiency. The reconstructed embryos were placed between two electrodes of the fusion cell and aligned with a microneedle so that the somatic cells were oriented toward one of the two electrodes. The fusion conditions were as following: double DC pulse 2.5 kV/cm, 10 μs, 1 s interval; The fusion solution was 0.3 mM mannitol (Sigma, 1,375,105), 0.15 mM CaCl2 (Sigma, C7902), 0.15 mM MgCl2 (Sigma, M2393), and 0.05% BSA. The fusion rate was detected under stereomicroscope. The reconstructed embryos were then transferred to IVC medium contained 5 μM ionomycin (Sigma, 407,950) for 4 min and then transferred to IVC medium contained 2 mM 6-DMAP (Sigma, D2629) for 4 h of culture. The activated embryos were washed three times in IVC medium and transferred into IVC medium for culture.
Quantification and statistical analysisSingle-cell RNA-seq low level processing and filteringFor the STRT-seq dataset, Raw reads were split by 8 bp cell barcodes located on Read 2 allowing 2 mismatches. Additionally, the 8 bp unique molecular identifiers (UMIs) located on Read 2 were switched to the identifier line of paired Read 1. Then Read 1 was processed to remove the template switch oligo (TSO) primer, low quality bases and polyA sequence [59]. The trimmed reads were aligned to the respective reference genomes (bovine: Bos_taurus.ARS_UCD1.2; pig: Sscrofa11.1), respectively. The unique molecular identifier (UMI) counts using kallisto (v-0.46.0) [61].
scRNA-seq data analyses and cell-type identificationTo pre-process our bovine data, we loaded the entire raw count matrix dataset with metadata using Seurat (v-4.1.0.9) [52]. The expression matrix was filtered based on the following criteria to get high quality cells and detectable genes: 1) the number of genes should be more than 2 000; 2) the proportions of UMIs for external RNA controls (ERCC) and mitochondrial genes should be below 40% and 50%, re respectively; 3) genes should be detected in at least 3 out of 728 sequenced cells. After filtering, 594 (486 embryonic cells, 108 pgEpiSCs cells) out of 728 cells were retained in the high-quality expression matrix for down-stream clustering and cell type identification. For the public datasets, the original information in the references were used.
Sequencing reads for each gene were normalized and scaled in each cell after regression on the mitochondrial gene ratio and cell cycle scores by “SCTransform” function. Recalibrated data was then used for principal components analysis (PCA) using “RunPCA” with default parameters. Dimensionality reduction was applied to the first thirty principal components, and the clusters were identified by construction of a shared nearest neighbor (SNN) graph. Finally, we visualized cell clusters after projecting them into two-dimensional latent space via t-distributed Stochastic Neighbor Embedding (t-SNE) by “RunTSNE” and Uniform Manifold Approximation and Projection (UMAP) by “RunUMAP” function. For bovine each embryonic day’s cell types, we identified the cell types with known marker genes. We filtered out some cells for which cell type could not be defined and retained 456 cells for further analyses (Table S1). The UMAP embeddings and cluster assignments computed in this section were later used in bovine embryonic dataset pseudotime analysis.
Identification of differentially expressed genes between embryonic stages in respective lineagesBased on the differentiation process during embryonic development, we grouped the bovine embryonic cells into three main lineages: embryonic lineages, covering pre-ICM at E5, ICM at E6 and E7, epiblasts at E10 and E12, and the ectoderms at E14; TE lineages, including pre-TE at E5, TE at E6, E7, E10, E12, and E14; and hypoblast lineages, including hypoblasts at E10, E12, and definitive endoderm at E14 (Fig. 1C).
Construction of expression tendenciesTo trace the dynamic changes of DEGs during embryonic development, we constructed the expression tendencies of DEGs in epiblast lineages. We first calculated the average expression levels for each gene at specific embryonic development time points in each respective lineage separately. The average expression levels were rescaled for embryonic lineage and analyzed by the k-means clustering method with parameters k = 10 and iter.max = 100, grouping the DEGs with similar tendencies during embryonic development into separate clusters.
Cross-species comparative analyses of scRNAseq datasetsWe downloaded orthologous genes list for three species (human, cattle and pig) via BioMart tool in the ensembl genome browser 105 (http://dec2021.archive.ensembl.org/index.html) and retained 16,841 genes, all of which were 1:1 orthologs. We then retained 1:1 orthologs in the bovine and porcine embryonic lineages datasets. For integrative analyses between porcine and bovine embryonic dataset, we used Seurat CCA approach for anchors determination and datasets alignment. Top 2000 genes that were repeatedly variable across datasets were selected and anchors were identified using “FindIntegrationAnchors” function with following parameters: “reduction = ‘cca’, k.anchor = 5, normalization.method = ‘SCT’”. The datasets were then integrated based on the identified anchors with “IntegrateData” function with following parameters: “dims = 1:30, k.weight = 50, normalization.method = ‘SCT’”. The genes average expression values in the integrated assay were obtained using the “AverageExpression” function, and the Spearman’s correlation coefficient between different cell types of two species was calculated using “cor” function. The fold change of genes expression level between naïve and formative states, formative and primed states of the two species were determined with the Wilcoxon rank-sum test by ‘‘FindMarkers’’ function, respectively. Only those with | ‘avg_logFC’|> 0.25 and ‘p_val’ < 0.05 were considered as DEGs.
Pseudotime analysisR package Monocle3 (v-1.3.1) [53] was used to reconstruct the embryonic cells developmental trajectory. The UMI matrix was used as input and variable genes obtained by Seurat to sort cells in pseudotime. Individual cells of two species were further ordered with the destiny (v-2.14.0) R package [54] using the DEGs between porcine all embryonic day's cells that were calculated by “FindAllMarkers” function to determine the developmental order between different species.
RNA-seq data processing and analysesExpression levels of protein coding genes in all samples (our original bulk RNAseq dataset, and public datasets of biPSCs, bEPSCs, primed bESCs and pgEpiSCs) were quantified as transcripts per million (TPM) using kallisto (v-0.46.0). DEGs between different cell types were identified using the DEseq2 tool (v-1.30.1) [56]. We used Benjamini–Hochberg adjusted false discovery rate (FDR) < 0.05 and absolute log2 (fold change) > 2 as cut-offs for statistical significance.
Transcriptome relatedness analyses of bEpiSCs to embryonic cellsFor integrative analyses between bEpiSCs and bovine embryonic dataset, we used Seurat RPCA approach for anchors determination and datasets alignment. Top 2000 genes that were repeatedly variable across datasets were selected and anchors were identified using “FindIntegrationAnchors” function with following parameters: “reduction = ‘rpca’, k.anchor = 15”. The datasets were then integrated based on the identified anchors with “IntegrateData” function with following parameters: “dims = 1:30, k.weight = 50”. Scaling and PCA were applied on the combined datasets and the integrated PCA coordinates were used as the input of the clustering and t-SNE visualization workflow with the following parameter: ‘dims = 1:5’.
Functional enrichment analysesFunctional enrichment analyses of selected genes were performed using Metascape (http://metascape.org) The bovine genes were mapped to their human orthologs and human (Homo sapiens) was the target species for analyses. Enrichment analyses were performed using all genes in the genome as the background set with Gene Ontology (GO)-biological processes (GO-BP) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway as ontology sources. Terms with a minimum count ≥ 3, adjusted P < 0.01, and enrichment factor ≥ 1.5 were considered to be significant and similar terms were grouped into clusters. Key pathways were depicted using -log10 (P-value) bar plots.
Statistical analysisTwo-way ANOVA multiple comparisons test was used to analyses data of RT–qPCR in Figs. 5C–H, 7C, F and Figure S3B, S3D, and used to analyses data presented in Figs. 4D, E, 5B. Dunnett’s multiple comparisons test were utilized in Fig. 5B, D, E, and Figure S3B, S3D. Bonferroni’s multiple comparisons test were utilized in Fig. 5C, F–H, and Fig. 7C, F. Tukey’s multiple comparisons test were utilized to compare the Cell population doubling time, and single-cell cloning efficiency in Fig. 4D, E.
Comments (0)