Skip to content

Random introduction of "-" #21

@Viloleal

Description

@Viloleal

Dear everyone,

I have been using Rhierbaps for some months now and have recently realised a weird behaviour that I would like to point out.

I have loaded a multialignment fasta file using the load.fasta function and when I checked the dataframe I realised that there were "-" along the different samples. This at first wasn't strange, since there are some samples that have one or two ambiguous calls like K or R. However, what called my attention was the fact that the "-" were appearing in more samples, including those that did not have any ambiguous calls. In the example below, you can see that sample 10 has a "-" at position n3, whereas in the msa this should be a "T".

image

I asked a colleague of mine to repeat this using her computer and she got the same results. She then checked the dataframes from her data and realised that this happens as well on her data:

image

Again, there is a "T" at position n1 that is being called as a "-". We have also seen that this happens as well with "Gs".

This appears to happen randomly throughout the data frame, but the errors do appear in the same places each time we run the load.fasta function. We would like to know if you have seen this behaviour before and know what could be the cause?

Our Rhierbaps version is 1.1.4 and R version are 4.2.3 and 4.1.3, respectively.

Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions