1. Define context(s)
reveal new biological insights
Primary goal of the model/tool/database
Rapid growth of single-cell transcriptomic data provides unprecedented opportunities for close scrutinizing of dynamical cellular processes. Through investigating epithelial-to-mesenchymal transition (EMT), we develop an integrative tool that combines unsupervised learning of single-cell transcriptomic data and multiscale mathematical modeling to analyze transitions during cell fate decision. Our approach allows identification of individual cells making transition between all cell states, and inference of genes that drive transitions. Multiscale extractions of single-cell scale outputs naturally reveal intermediate cell states (ICS) and ICS-regulated transition trajectories, producing emergent population-scale models to be explored for design principles. Testing on the newly designed single-cell gene regulatory network model and applying to twelve published single-cell EMT datasets in cancer and embryogenesis, we uncover the roles of ICS on adaptation, noise attenuation, and transition efficiency in EMT, and reveal their trade-off relations. Overall, our unsupervised learning method is applicable to general single-cell transcriptomic datasets, and our integrative approach at single-cell resolution may be adopted for other cell fate transition systems beyond EMT.
Biological domain of the model
scRNA-seq data of various tissues
Structure(s) of interest in the model
transition states, EMT
Spatial scales included in the model
N/A
Time scales included in the model
seconds to days
2. Data for building and validating the model
Data for building the model |
Published? |
Private? |
How is credibility checked? |
Current Conformance Level / Target Conformance Level |
in vitro (primary cells cell, lines, etc.) |
|
|
|
|
ex vivo (excised tissues) |
|
|
|
|
in vivo pre-clinical (lower-level organism or small animal) |
Yes |
No |
The model was built in an unsupervised way on unbiased single-cell RNA sequencing data and spatial data. |
Extensive |
in vivo pre-clinical (large animal) |
|
|
|
|
Human subjects/clinical |
Yes |
No |
The model was built in an unsupervised way on unbiased single-cell RNA sequencing data and spatial data. |
Extensive |
Other: ________________________ |
|
|
|
|
Data for validating the model |
Published? |
Private? |
How is credibility checked? |
Current Conformance Level / Target Conformance Level |
in vitro (primary cells cell, lines, etc.) |
|
|
|
|
ex vivo (excised tissues) |
|
|
|
|
in vivo pre-clinical (lower-level organism or small animal) |
Yes |
No |
By comparing to existing knowledge. |
Adequate |
in vivo pre-clinical (large animal) |
|
|
|
|
Human subjects/clinical |
Yes |
No |
By comparing to existing knowledge. |
Adequate |
Other: ________________________ |
|
|
|
|
3. Validate within context(s)
|
Who does it? |
When does it happen? |
How is it done? |
Current Conformance Level / Target Conformance Level |
Verification |
Students/postdocs/investigators |
Throughout the project |
1) The convergence of algorithm is verified. 2) The computational results of identified transition states are consistent with expectations. |
Extensive |
Validation |
Students/postdocs/investigators |
As the unsupervised model was established |
1) The method is validated by simulated data describing cell fate transitions. 2) The identified transition states agree with prior knowledge. 3) The inferred gene regulatory network agrees with prior knowledge. |
Extensive |
Uncertainty quantification |
|
|
|
|
Sensitivity analysis |
|
|
|
|
Other:__________ |
|
|
|
|
Additional Comments |
|
|
|
|
4. Limitations
Disclaimer statement (explain key limitations) |
Who needs to know about this disclaimer? |
How is this disclaimer shared with that audience? |
Current Conformance Level / Target Conformance Level |
The algorithm efficiency on large datasets. The multiscale model does not consider cell-cell communications. |
Scientific community who intends to apply this method to excessively large scRNA-seq data or to study the interaction between cell-cell communications and cell fate transition.. |
In discussion of the paper. |
Adequate |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
5. Version control
Current Conformance Level / Target Conformance Level |
Extensive |
|
Naming Conventions? |
Repository? |
Code Review? |
individual modeler |
Yes |
Yes |
Peer |
within the lab |
Yes |
Yes |
Peer |
collaborators |
|
|
|
6. Documentation
|
Current Conformance Level / Target Conformance Level |
Code commented? |
Adequate |
Scope and intended use described? |
Extensive |
User’s guide? |
Extensive |
Developer’s guide? |
Adequate |
7. Dissemination
Current Conformance Level / Target Conformance Level |
Extensive |
Target Audience(s): |
“Inner circle” |
Scientific community |
Public |
Simulations |
|
|
|
Models |
|
|
|
Software |
Matlab package: https://github.com/yutongo/QuanTC |
Matlab package: https://github.com/yutongo/QuanTC |
Matlab package: https://github.com/yutongo/QuanTC |
Results |
Paper and Github repo: https://github.com/yutongo/QuanTC |
Paper and Github repo: https://github.com/yutongo/QuanTC |
Paper and Github repo: https://github.com/yutongo/QuanTC |
Implications of results |
|
|
|
8. Independent reviews
Current Conformance Level / Target Conformance Level |
Insufficient |
Reviewer(s) name & affiliation: |
|
When was review performed? |
|
How was review performed and outcomes of the review? |
|
9. Test competing implementations
Current Conformance Level / Target Conformance Level |
Adequate |
|
Yes or No (briefly summarize) |
Were competing implementations tested? |
Yes. The method has been compared to another commonly used methods on benchmark datasets. |
Did this lead to model refinement or improvement? |
No |