Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DIA-NN Data Vignette #19

Open
abadgerw opened this issue Oct 3, 2024 · 4 comments
Open

DIA-NN Data Vignette #19

abadgerw opened this issue Oct 3, 2024 · 4 comments

Comments

@abadgerw
Copy link

abadgerw commented Oct 3, 2024

@yafeng Thank you for a great tool. I just wanted to check in on the status of the vignette you mentioned in a previous thread using your differential expression method on an output from DIANN? I am just a bit unclear as to what information I should be using as input from the reports.

@abadgerw
Copy link
Author

abadgerw commented Oct 10, 2024

@yafeng I just wanted to circle back and see if you had some insight into this query as I'd love to be able to use your fantastic tool?

@yafeng
Copy link
Owner

yafeng commented Oct 11, 2024

@abadgerw Hi, I missed your last comments. Sorry for that.

To use the DIA-NN output for DEqMS analysis, I have some tips for you.

Use the main report (report.tsv), the relevant columns are Precursor.Id, Genes (or Protein.Ids), Proteotypic, Precursor.Normalised.

If the data is not filtered based on the Q-value, use the Q.Value or Lib.Q.Value column (for library-free and spectra-library modes, respectively) for filtering.
The output is in the long format and should be transformed into a wide format.

Next, use MedianSweeping or MedianSummary function in DEqMS to group precursors into protein level quantity.
After you get a protein table, and a precursor count table for all proteins , you should be able to proceed to differential analysis.

if you get trouble, send me a short version of your DIA-NN output, I will write the R code for you.

Yafeng

@abadgerw
Copy link
Author

Thanks, @yafeng! Is there a reason to roll up peptides to proteins using median sweeping rather than using the default method in DIA-NN that generates protein-level data and then extracting the precursor counts from the report.tsv file? I believe the new version of DIA-NN has a newly optimized method for quantification which was used for my dataset.

@yafeng
Copy link
Owner

yafeng commented Oct 16, 2024

@abadgerw,no, it is not necessary to use median sweep, the default DIA-NN protein level quant can be used directly . you can use the report.gg_matrix.tsv (gene-level quant) or report.pg_matrix.tsv(protein level quant), the relevant columns are Genes (or Protein.Group and Protein.Ids), MaxLFQ, precursor count.
The analysis workflow is similar to use MaxLFQ protein level quant as input.

Yafeng

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants