Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

function predict() doesn't finish #35

Open
SilSanGon opened this issue May 28, 2024 · 5 comments
Open

function predict() doesn't finish #35

SilSanGon opened this issue May 28, 2024 · 5 comments

Comments

@SilSanGon
Copy link

Hello!

I want to analyse my samples against 16S NCBI database, which I downloaded previously, but when I tried the last step, function predict(), its takes 5 days without results. Do you know what is the cause? or if I did something wrong?

I copy my code in:

#start blast steps
seq <- readDNAStringSet("All_combined_seqs.fasta")

#Make BLAST db and perform BLAST search
makeblastdb(path_to_seqs4_BLAST_db, dbtype = "nucl")
dbb <- blast(db=path_to_seqs4_BLAST_db)

tgz_file <- blast_db_get("16S_ribosomal_RNA.tar.gz")
untar(tgz_file, exdir = "16S_rRNA_DB")

#Load the downloaded BLAST database.
bl <- blast(db = "./16S_rRNA_DB/16S_ribosomal_RNA")
bl

#change parameters here as required
results = predict(bl, seq, BLAST_args= c("-perc_identity 99"))
write.csv(results, "blast_results.csv", row.names = FALSE)

My computer is Ubuntu 22.04.2, AMD Ryzen threadripper 1920x 12-core processor x24, 500GB of memory.
Thank you very much in advance!

@mhahsler
Copy link
Owner

Hi,

  1. Please post the output of running
    sessionInfo() and system2("blastn", "-version") after all used packages are loaded.

  2. Can you run the example in the man page successfully?

seq <- readRNAStringSet(system.file("examples/RNA_example.fasta",
       package = "rBLAST"))[1]
seq

cl <- predict(bl, seq)
cl[1:5, ]
  1. If the example works, then I will need your sequences and the code that reprodices the issue.

Regards,
Michael

@SilSanGon
Copy link
Author

SilSanGon commented May 29, 2024

Hi,

  1. Okey, I post the output
    session_info
    system2

  2. Yes, I can run the example in the man page successfully, I post too
    image

  3. I could do it, but it is heavy. How can I send it?

Maybe, I think, is it possible that the 16S_rRNA_DB database is not compatible with the readDNAStringSet() function? I'm testing with this function and not with readRNAStringSet()

Thank you very much!
Silvia

@mhahsler
Copy link
Owner

OK, let me know.

@SilSanGon
Copy link
Author

Hi Michael,

I was looking for a solution and I found that it wouldn't be a problem use 16S_rRNA database with readDNAStringSet() function in this post (https://bioinformatics.stackexchange.com/questions/4015/is-it-ok-to-use-blast-to-query-ncbis-16s-rrna-database-with-16s-dna-sequences) so what do you think it could be the problem?

I have tried to attach the script with a sample and it is impossible because it weighs more than 25Mb. Do you know other way?

@mhahsler
Copy link
Owner

Can you put the data and the script on google drive and share it with me? [email protected]

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants