How Do i remove a specific character from a specific read in a fasta file?Increasing performance of clear...

The Knight's estate

Compelling story with the world as a villain

Is there any practical application for performing a double Fourier transform? ...or an inverse Fourier transform on a time-domain input?

Why did Khan ask Admiral James T. Kirk about Project Genesis?

How would you identify when an object in a Lissajous orbit needs station keeping?

Would it be possible to have a GMO that produces chocolate?

Was it ever possible to target a zone?

Tensorflow - logistic regrssion -oneHot Encoder - Transformed array of differt size for both train and test

Is gzip atomic?

Is it possible to perform a regression where you have an unknown / unknowable feature variable?

How to investigate a problem with blown 50 amp HVAC fuse without power?

What is a CirKle Word™?

How is the idea of "two people having a heated argument" idiomatically expressed in German?

Does travel insurance for short flight delays exist?

I can see my two means are different. What information can a t test add?

How do I get a decreased-by-one x in a foreach loop?

Are modern clipless shoes and pedals that much better than toe clips and straps?

Is there any music source code for sound chips?

Justifying the use of directed energy weapons

How to prevent clipped screen edges on my TV, HDMI-connected?

What is this symbol: semicircles facing each other?

Do they have Supervillain(s)?

How to respectfully refuse to assist co-workers with IT issues?

Was there ever a treaty between 2 entities with significantly different translations to the detriment of one party?



How Do i remove a specific character from a specific read in a fasta file?


Increasing performance of clear text file readsHow to prevent grep from excessive unnecessary memory usageRemove the Nth specified character from a lineRemoving a line from a file which has only specific characterhow to remove newline characters in fasta sequencesed + remove word from specific lineHow to search for a match, then a non-match on that line, but include context lines for matched line?how to fatch fasta sequences if header line match from another fileRemove n character after specific word






.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty{ margin-bottom:0;
}







0















An error occurred while processing read where a "+" symbol seems to hinder the sequence. This is the specif error that I got after i converted a multi line fasta to a single line fasta and used sed command to remove unwanted characters.



An error occurred processing read TTGTATGCAGCCCAAAAATACCTAAAATTTGGCTCGCGCGGAATGCTAATTCCACCGTTTACAGCCTCCCTTGGGTTGACGAGCCAAAAAAGTGGATCCTTCATGCATGTGTGCCCATGCTATCCTTTAATAGTAACCTTGTCGTTTATCTTAATACTCCTCCGACAACATAATAGTGATTTTACATGAAAAAGACGTGAGTCAACGTTGCGAATCCATTCCGTGACTGATTCGATCCTAGTTTCGTCGAGTTCTTCAATATCTATTTCGTGGCCACATGTTGAGCTATTTGTGACAAATTTGTCAAATTTGCGTCGCAGTCGGCCAGTCGAAACATATTTCGAGTAACTTTCGTGTTAAATATTTTCTCATTTTGATATTCATGCAGGTAGATCGAATCCCCAATATACCGTATAAAGTTATAGATCTACTCGGCTAACTCGGCTACGAATGTCAAAACCAAAGGTTAGAACACGATAGTTGTGCGAAATGGTCTCCTCAAAGAAAATTTCTAGATAATTACGTGCATGTACGTGTGAGCAGAAAACGTCATCAAAAATATCTAAATGTGAACATTGAATAGAGCTTTAAGCACTCGATACGAACAGTCATGGAAATGGAATCAAAACATCGAGCAACACATCTTTTTCATGTAGACCTTGGCTAAATTAAAGCATGATTAGATAGATCAAGAGGATTAAAAAAGATTGAACTACTTTAGTTAAACACAACATACATAATGTACATGTAACCAAACAACGGGACTTATGTTTCTCCCACATATTAGCGAAAGTGGAACTGTTAGCAACCAAACCATATGCAGAATTAGGCCCAAATCAGAGTTTCCACCTATACAACTTTCCAATGGGCCAAGTTGCAAGAGGGAAACACCCATGTGGGCACCAAACCTCCACAAAAGCAGCTCGTATGCGAGCTCTGCATGAACTTGGCCTAGCTCAGTGCAGCTAGCAAGCCAGCAGCGTAGGTCACTTCCATGTGGGTTCATATTCCCTGTACGTAGGGGCGACATATATGGAGCATGCTGCACCATGGACGAGGAGGATGGTGCTATAGCGAATTATGTGGCGGCATAGTGGGGGTGAGGGAGCAAGGCTGCACGAGTCGTTGTGGCTTGGCATGTCAGCCCCGACGTCGTGGGTGAAGGTTGATGCGGTTTCGCGAGGACTATGTGGTATGCATACGAGTGTGCGGTGGGGAAGGCTTTAGGTGGCCACACACATAGTGTTTTTATTGTGGCCACAGGCTTGAGGTAGAGAGTTTGTGATGTGGCAGGATGATAGGGTGGGATGAACGACCACCAACTCCCAAATAATTTTCCAATTAATAGCAAATATACAAAATTTCTTGCATACTCTCATTGTTTCAATGAATAATACTTCTTTTAAAAAAATCAGGCCAAGTAAAGTTTATCCAAGATGTTATATGAATATCTTATAAAAATATATTTCATGATCTTGTATCTAATGATACTACCTCCCGTCCAGATTCATAGGGTTGCGCGTATCCCTAGGTTGTAAATTTGAAAATTGATATAATATCGTTTGTATAATATAAAAAATATCATTAGAAAGAAAGTAGAACATACAAAGCTTTACTACATTGATATATATTTTTGTAATACGTATGTTGTTATCATTCTGGAATTTACGGCTTACAGTTTCTGTGTGAGTCTGTGGCCAGGACGAAAGTATTGATTTTTCATTTGCATGTTGATAACATTGAGATGGATTTTCTCCATCCTTACGTTATCTTCTCAATCTCTGCATCAGAGTTGGCTTTGGGCTCCTATACGGGAGGCGTCTGTTGGCCGTTTTAAGCGGAGCGGACAGGTGGCTCCGCATACCTTTTTTGGTGATCAGATTTGATGGGCTCGGCCCATTTACCGACCTGATGTTCTTATAGGATTTACCCTTACGAACAGCTGAGTCTTATCATCAGAAAATGATCAGCAGTTATCAATATGTTTAACTCCGTTGACCTAACGTATAGAAATATAAAATGGAATATTATATCGCGCTCTTTTGATCCTCAAAGGGCTTTTTTGGAAGGCCTATCCTCCGAGATCTAAGGCCGATCAATCGCGATCCCAGCAATCATTCCAGCCGCTGCGCATCCAGCAGAGACCTTCGAACCCTCCTCCGACGACCTGTCGGCCGCCGCCGGCGACTACCCTCCCTGGGTGATTTTCGACGAGCAGGAACGCCGCTATCGCCACAAGGAGGTCAATGATACCCCAGCCCGCTGACCCCAGGACGGTGGCGCACGCTCGCACCTCCACCGGCCACCCCGTCCAGTTTCCTTCCGCTTCGCCGCTCCCCGGCAGTTTCGTGTCTGGACCTCAGCGTCGACGACGGCTTGCTGGATGGGGTGGATATTATTTCCAGCGTCGTCGTGGCTCACGGCGATCCGTCCTCCTTACATATGATACTGTTACTGTACAAGTACGAGATCAACACGACCGACTATTTCGTCTACAATGCCGGCGACGGCGCCACGGCCGCCGTCGTTGTCGCTTCTCCCCTTGTTACATGGACGAGGAGGAACAGTCCCATGACAGACGGTACCGGTGACTATGAGTAGGAAACGCACTGGCATCCTGCGTCGCGGCGACGAGGAGTTGGTGGTGATGATGGACCTGAGGATGATGAACACTATTGCGTGCGATGATGATGGGTGGGGCAAAGCAGGTGGAAGGTGAGGTCGCGTACTCCGGTCGGGGAGTGGCAGTGGAAGGCCGAGGCGCTCCGGTCCGCCACGATGTGAGCAAAGGCGAGGTGACTTCGTACTGCTGGTGGTGGGTGTGGGAGACCGACCGGGTGATCCCCGTCGGCGACCGGTTCTTATACTGGGTAGACCTACACCGAGGCATCGTCTTCTCCGACGTGTCCGAAAGCCCCGATCTGAGGTTCGTGCCTCTGCCCGTGGAGCCTATCCTCCGTAGACGCGAGGACTGGGACCGGCCCAGTCGCGCCGCAGCCTGTGCGCCACGGCCGGCGGCGCGACTCTGAAGTTCGTCCACGTCTCACCTCGCTGCTGTTGCGAGCAATACGTAA
The error occurred at buffer index 1612084529while processing file block at offset 81604378624 to 83751862272
File offset at error location is 83216463153
File dump around this offset:
1612084479 84 T
1612084480 71 G
1612084481 65 A
1612084482 65 A
1612084483 71 G
1612084484 84 T
1612084485 84 T
1612084486 67 C
1612084487 71 G
1612084488 84 T
1612084489 67 C
1612084490 67 C
1612084491 65 A
1612084492 67 C
1612084493 71 G
1612084494 84 T
1612084495 67 C
1612084496 84 T
1612084497 67 C
1612084498 65 A
1612084499 67 C
1612084500 67 C
1612084501 84 T
1612084502 67 C
1612084503 71 G
1612084504 67 C
1612084505 84 T
1612084506 71 G
1612084507 67 C
1612084508 84 T
1612084509 71 G
1612084510 84 T
1612084511 84 T
1612084512 71 G
1612084513 67 C
1612084514 71 G
1612084515 65 A
1612084516 71 G
1612084517 67 C
1612084518 65 A
1612084519 65 A
1612084520 84 T
1612084521 65 A
1612084522 67 C
1612084523 71 G
1612084524 84 T
1612084525 65 A
1612084526 65 A
1612084527 10
1612084528 43 +
1612084529 10 Error occurred at this position.
1612084530 62 >


So is there a way to remove all of this specific read or just tweek the "+" character out of this read?



Thanks
S










share|improve this question









New contributor



sailajah vishwanathan is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






















  • looks like a linefeed (10) followed by a + (43) and then another LF. can you show the sed command (and any other commands) you used to transform the input?

    – cas
    yesterday








  • 1





    also, while i am no expert in fasta files or bioinformatics, I have done some work in the field (as a programmer/sysadmin, not as a bio scientist) and i can't help but think you'd be better off manipulating such files using tools like bioperl or biopython than "rolling your own" with awk or sed. If you can't decide between perl or python, look here For what bioinformatics tasks is Biopython more adapted than Bioperl?

    – cas
    yesterday








  • 1





    BTW, if you haven't run across them before, both bioinformatics.stackexchange.com AND biostars.org are worth checking out.

    – cas
    yesterday













  • The sed command i used was sed -e '/^[^>]/s/[^ATGCatgc]//g' <input.fasta> <output.fasta>

    – sailajah vishwanathan
    yesterday











  • And for converting my fastq files into a single line fasta file i used a Python script

    – sailajah vishwanathan
    yesterday


















0















An error occurred while processing read where a "+" symbol seems to hinder the sequence. This is the specif error that I got after i converted a multi line fasta to a single line fasta and used sed command to remove unwanted characters.



An error occurred processing read TTGTATGCAGCCCAAAAATACCTAAAATTTGGCTCGCGCGGAATGCTAATTCCACCGTTTACAGCCTCCCTTGGGTTGACGAGCCAAAAAAGTGGATCCTTCATGCATGTGTGCCCATGCTATCCTTTAATAGTAACCTTGTCGTTTATCTTAATACTCCTCCGACAACATAATAGTGATTTTACATGAAAAAGACGTGAGTCAACGTTGCGAATCCATTCCGTGACTGATTCGATCCTAGTTTCGTCGAGTTCTTCAATATCTATTTCGTGGCCACATGTTGAGCTATTTGTGACAAATTTGTCAAATTTGCGTCGCAGTCGGCCAGTCGAAACATATTTCGAGTAACTTTCGTGTTAAATATTTTCTCATTTTGATATTCATGCAGGTAGATCGAATCCCCAATATACCGTATAAAGTTATAGATCTACTCGGCTAACTCGGCTACGAATGTCAAAACCAAAGGTTAGAACACGATAGTTGTGCGAAATGGTCTCCTCAAAGAAAATTTCTAGATAATTACGTGCATGTACGTGTGAGCAGAAAACGTCATCAAAAATATCTAAATGTGAACATTGAATAGAGCTTTAAGCACTCGATACGAACAGTCATGGAAATGGAATCAAAACATCGAGCAACACATCTTTTTCATGTAGACCTTGGCTAAATTAAAGCATGATTAGATAGATCAAGAGGATTAAAAAAGATTGAACTACTTTAGTTAAACACAACATACATAATGTACATGTAACCAAACAACGGGACTTATGTTTCTCCCACATATTAGCGAAAGTGGAACTGTTAGCAACCAAACCATATGCAGAATTAGGCCCAAATCAGAGTTTCCACCTATACAACTTTCCAATGGGCCAAGTTGCAAGAGGGAAACACCCATGTGGGCACCAAACCTCCACAAAAGCAGCTCGTATGCGAGCTCTGCATGAACTTGGCCTAGCTCAGTGCAGCTAGCAAGCCAGCAGCGTAGGTCACTTCCATGTGGGTTCATATTCCCTGTACGTAGGGGCGACATATATGGAGCATGCTGCACCATGGACGAGGAGGATGGTGCTATAGCGAATTATGTGGCGGCATAGTGGGGGTGAGGGAGCAAGGCTGCACGAGTCGTTGTGGCTTGGCATGTCAGCCCCGACGTCGTGGGTGAAGGTTGATGCGGTTTCGCGAGGACTATGTGGTATGCATACGAGTGTGCGGTGGGGAAGGCTTTAGGTGGCCACACACATAGTGTTTTTATTGTGGCCACAGGCTTGAGGTAGAGAGTTTGTGATGTGGCAGGATGATAGGGTGGGATGAACGACCACCAACTCCCAAATAATTTTCCAATTAATAGCAAATATACAAAATTTCTTGCATACTCTCATTGTTTCAATGAATAATACTTCTTTTAAAAAAATCAGGCCAAGTAAAGTTTATCCAAGATGTTATATGAATATCTTATAAAAATATATTTCATGATCTTGTATCTAATGATACTACCTCCCGTCCAGATTCATAGGGTTGCGCGTATCCCTAGGTTGTAAATTTGAAAATTGATATAATATCGTTTGTATAATATAAAAAATATCATTAGAAAGAAAGTAGAACATACAAAGCTTTACTACATTGATATATATTTTTGTAATACGTATGTTGTTATCATTCTGGAATTTACGGCTTACAGTTTCTGTGTGAGTCTGTGGCCAGGACGAAAGTATTGATTTTTCATTTGCATGTTGATAACATTGAGATGGATTTTCTCCATCCTTACGTTATCTTCTCAATCTCTGCATCAGAGTTGGCTTTGGGCTCCTATACGGGAGGCGTCTGTTGGCCGTTTTAAGCGGAGCGGACAGGTGGCTCCGCATACCTTTTTTGGTGATCAGATTTGATGGGCTCGGCCCATTTACCGACCTGATGTTCTTATAGGATTTACCCTTACGAACAGCTGAGTCTTATCATCAGAAAATGATCAGCAGTTATCAATATGTTTAACTCCGTTGACCTAACGTATAGAAATATAAAATGGAATATTATATCGCGCTCTTTTGATCCTCAAAGGGCTTTTTTGGAAGGCCTATCCTCCGAGATCTAAGGCCGATCAATCGCGATCCCAGCAATCATTCCAGCCGCTGCGCATCCAGCAGAGACCTTCGAACCCTCCTCCGACGACCTGTCGGCCGCCGCCGGCGACTACCCTCCCTGGGTGATTTTCGACGAGCAGGAACGCCGCTATCGCCACAAGGAGGTCAATGATACCCCAGCCCGCTGACCCCAGGACGGTGGCGCACGCTCGCACCTCCACCGGCCACCCCGTCCAGTTTCCTTCCGCTTCGCCGCTCCCCGGCAGTTTCGTGTCTGGACCTCAGCGTCGACGACGGCTTGCTGGATGGGGTGGATATTATTTCCAGCGTCGTCGTGGCTCACGGCGATCCGTCCTCCTTACATATGATACTGTTACTGTACAAGTACGAGATCAACACGACCGACTATTTCGTCTACAATGCCGGCGACGGCGCCACGGCCGCCGTCGTTGTCGCTTCTCCCCTTGTTACATGGACGAGGAGGAACAGTCCCATGACAGACGGTACCGGTGACTATGAGTAGGAAACGCACTGGCATCCTGCGTCGCGGCGACGAGGAGTTGGTGGTGATGATGGACCTGAGGATGATGAACACTATTGCGTGCGATGATGATGGGTGGGGCAAAGCAGGTGGAAGGTGAGGTCGCGTACTCCGGTCGGGGAGTGGCAGTGGAAGGCCGAGGCGCTCCGGTCCGCCACGATGTGAGCAAAGGCGAGGTGACTTCGTACTGCTGGTGGTGGGTGTGGGAGACCGACCGGGTGATCCCCGTCGGCGACCGGTTCTTATACTGGGTAGACCTACACCGAGGCATCGTCTTCTCCGACGTGTCCGAAAGCCCCGATCTGAGGTTCGTGCCTCTGCCCGTGGAGCCTATCCTCCGTAGACGCGAGGACTGGGACCGGCCCAGTCGCGCCGCAGCCTGTGCGCCACGGCCGGCGGCGCGACTCTGAAGTTCGTCCACGTCTCACCTCGCTGCTGTTGCGAGCAATACGTAA
The error occurred at buffer index 1612084529while processing file block at offset 81604378624 to 83751862272
File offset at error location is 83216463153
File dump around this offset:
1612084479 84 T
1612084480 71 G
1612084481 65 A
1612084482 65 A
1612084483 71 G
1612084484 84 T
1612084485 84 T
1612084486 67 C
1612084487 71 G
1612084488 84 T
1612084489 67 C
1612084490 67 C
1612084491 65 A
1612084492 67 C
1612084493 71 G
1612084494 84 T
1612084495 67 C
1612084496 84 T
1612084497 67 C
1612084498 65 A
1612084499 67 C
1612084500 67 C
1612084501 84 T
1612084502 67 C
1612084503 71 G
1612084504 67 C
1612084505 84 T
1612084506 71 G
1612084507 67 C
1612084508 84 T
1612084509 71 G
1612084510 84 T
1612084511 84 T
1612084512 71 G
1612084513 67 C
1612084514 71 G
1612084515 65 A
1612084516 71 G
1612084517 67 C
1612084518 65 A
1612084519 65 A
1612084520 84 T
1612084521 65 A
1612084522 67 C
1612084523 71 G
1612084524 84 T
1612084525 65 A
1612084526 65 A
1612084527 10
1612084528 43 +
1612084529 10 Error occurred at this position.
1612084530 62 >


So is there a way to remove all of this specific read or just tweek the "+" character out of this read?



Thanks
S










share|improve this question









New contributor



sailajah vishwanathan is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






















  • looks like a linefeed (10) followed by a + (43) and then another LF. can you show the sed command (and any other commands) you used to transform the input?

    – cas
    yesterday








  • 1





    also, while i am no expert in fasta files or bioinformatics, I have done some work in the field (as a programmer/sysadmin, not as a bio scientist) and i can't help but think you'd be better off manipulating such files using tools like bioperl or biopython than "rolling your own" with awk or sed. If you can't decide between perl or python, look here For what bioinformatics tasks is Biopython more adapted than Bioperl?

    – cas
    yesterday








  • 1





    BTW, if you haven't run across them before, both bioinformatics.stackexchange.com AND biostars.org are worth checking out.

    – cas
    yesterday













  • The sed command i used was sed -e '/^[^>]/s/[^ATGCatgc]//g' <input.fasta> <output.fasta>

    – sailajah vishwanathan
    yesterday











  • And for converting my fastq files into a single line fasta file i used a Python script

    – sailajah vishwanathan
    yesterday














0












0








0








An error occurred while processing read where a "+" symbol seems to hinder the sequence. This is the specif error that I got after i converted a multi line fasta to a single line fasta and used sed command to remove unwanted characters.



An error occurred processing read TTGTATGCAGCCCAAAAATACCTAAAATTTGGCTCGCGCGGAATGCTAATTCCACCGTTTACAGCCTCCCTTGGGTTGACGAGCCAAAAAAGTGGATCCTTCATGCATGTGTGCCCATGCTATCCTTTAATAGTAACCTTGTCGTTTATCTTAATACTCCTCCGACAACATAATAGTGATTTTACATGAAAAAGACGTGAGTCAACGTTGCGAATCCATTCCGTGACTGATTCGATCCTAGTTTCGTCGAGTTCTTCAATATCTATTTCGTGGCCACATGTTGAGCTATTTGTGACAAATTTGTCAAATTTGCGTCGCAGTCGGCCAGTCGAAACATATTTCGAGTAACTTTCGTGTTAAATATTTTCTCATTTTGATATTCATGCAGGTAGATCGAATCCCCAATATACCGTATAAAGTTATAGATCTACTCGGCTAACTCGGCTACGAATGTCAAAACCAAAGGTTAGAACACGATAGTTGTGCGAAATGGTCTCCTCAAAGAAAATTTCTAGATAATTACGTGCATGTACGTGTGAGCAGAAAACGTCATCAAAAATATCTAAATGTGAACATTGAATAGAGCTTTAAGCACTCGATACGAACAGTCATGGAAATGGAATCAAAACATCGAGCAACACATCTTTTTCATGTAGACCTTGGCTAAATTAAAGCATGATTAGATAGATCAAGAGGATTAAAAAAGATTGAACTACTTTAGTTAAACACAACATACATAATGTACATGTAACCAAACAACGGGACTTATGTTTCTCCCACATATTAGCGAAAGTGGAACTGTTAGCAACCAAACCATATGCAGAATTAGGCCCAAATCAGAGTTTCCACCTATACAACTTTCCAATGGGCCAAGTTGCAAGAGGGAAACACCCATGTGGGCACCAAACCTCCACAAAAGCAGCTCGTATGCGAGCTCTGCATGAACTTGGCCTAGCTCAGTGCAGCTAGCAAGCCAGCAGCGTAGGTCACTTCCATGTGGGTTCATATTCCCTGTACGTAGGGGCGACATATATGGAGCATGCTGCACCATGGACGAGGAGGATGGTGCTATAGCGAATTATGTGGCGGCATAGTGGGGGTGAGGGAGCAAGGCTGCACGAGTCGTTGTGGCTTGGCATGTCAGCCCCGACGTCGTGGGTGAAGGTTGATGCGGTTTCGCGAGGACTATGTGGTATGCATACGAGTGTGCGGTGGGGAAGGCTTTAGGTGGCCACACACATAGTGTTTTTATTGTGGCCACAGGCTTGAGGTAGAGAGTTTGTGATGTGGCAGGATGATAGGGTGGGATGAACGACCACCAACTCCCAAATAATTTTCCAATTAATAGCAAATATACAAAATTTCTTGCATACTCTCATTGTTTCAATGAATAATACTTCTTTTAAAAAAATCAGGCCAAGTAAAGTTTATCCAAGATGTTATATGAATATCTTATAAAAATATATTTCATGATCTTGTATCTAATGATACTACCTCCCGTCCAGATTCATAGGGTTGCGCGTATCCCTAGGTTGTAAATTTGAAAATTGATATAATATCGTTTGTATAATATAAAAAATATCATTAGAAAGAAAGTAGAACATACAAAGCTTTACTACATTGATATATATTTTTGTAATACGTATGTTGTTATCATTCTGGAATTTACGGCTTACAGTTTCTGTGTGAGTCTGTGGCCAGGACGAAAGTATTGATTTTTCATTTGCATGTTGATAACATTGAGATGGATTTTCTCCATCCTTACGTTATCTTCTCAATCTCTGCATCAGAGTTGGCTTTGGGCTCCTATACGGGAGGCGTCTGTTGGCCGTTTTAAGCGGAGCGGACAGGTGGCTCCGCATACCTTTTTTGGTGATCAGATTTGATGGGCTCGGCCCATTTACCGACCTGATGTTCTTATAGGATTTACCCTTACGAACAGCTGAGTCTTATCATCAGAAAATGATCAGCAGTTATCAATATGTTTAACTCCGTTGACCTAACGTATAGAAATATAAAATGGAATATTATATCGCGCTCTTTTGATCCTCAAAGGGCTTTTTTGGAAGGCCTATCCTCCGAGATCTAAGGCCGATCAATCGCGATCCCAGCAATCATTCCAGCCGCTGCGCATCCAGCAGAGACCTTCGAACCCTCCTCCGACGACCTGTCGGCCGCCGCCGGCGACTACCCTCCCTGGGTGATTTTCGACGAGCAGGAACGCCGCTATCGCCACAAGGAGGTCAATGATACCCCAGCCCGCTGACCCCAGGACGGTGGCGCACGCTCGCACCTCCACCGGCCACCCCGTCCAGTTTCCTTCCGCTTCGCCGCTCCCCGGCAGTTTCGTGTCTGGACCTCAGCGTCGACGACGGCTTGCTGGATGGGGTGGATATTATTTCCAGCGTCGTCGTGGCTCACGGCGATCCGTCCTCCTTACATATGATACTGTTACTGTACAAGTACGAGATCAACACGACCGACTATTTCGTCTACAATGCCGGCGACGGCGCCACGGCCGCCGTCGTTGTCGCTTCTCCCCTTGTTACATGGACGAGGAGGAACAGTCCCATGACAGACGGTACCGGTGACTATGAGTAGGAAACGCACTGGCATCCTGCGTCGCGGCGACGAGGAGTTGGTGGTGATGATGGACCTGAGGATGATGAACACTATTGCGTGCGATGATGATGGGTGGGGCAAAGCAGGTGGAAGGTGAGGTCGCGTACTCCGGTCGGGGAGTGGCAGTGGAAGGCCGAGGCGCTCCGGTCCGCCACGATGTGAGCAAAGGCGAGGTGACTTCGTACTGCTGGTGGTGGGTGTGGGAGACCGACCGGGTGATCCCCGTCGGCGACCGGTTCTTATACTGGGTAGACCTACACCGAGGCATCGTCTTCTCCGACGTGTCCGAAAGCCCCGATCTGAGGTTCGTGCCTCTGCCCGTGGAGCCTATCCTCCGTAGACGCGAGGACTGGGACCGGCCCAGTCGCGCCGCAGCCTGTGCGCCACGGCCGGCGGCGCGACTCTGAAGTTCGTCCACGTCTCACCTCGCTGCTGTTGCGAGCAATACGTAA
The error occurred at buffer index 1612084529while processing file block at offset 81604378624 to 83751862272
File offset at error location is 83216463153
File dump around this offset:
1612084479 84 T
1612084480 71 G
1612084481 65 A
1612084482 65 A
1612084483 71 G
1612084484 84 T
1612084485 84 T
1612084486 67 C
1612084487 71 G
1612084488 84 T
1612084489 67 C
1612084490 67 C
1612084491 65 A
1612084492 67 C
1612084493 71 G
1612084494 84 T
1612084495 67 C
1612084496 84 T
1612084497 67 C
1612084498 65 A
1612084499 67 C
1612084500 67 C
1612084501 84 T
1612084502 67 C
1612084503 71 G
1612084504 67 C
1612084505 84 T
1612084506 71 G
1612084507 67 C
1612084508 84 T
1612084509 71 G
1612084510 84 T
1612084511 84 T
1612084512 71 G
1612084513 67 C
1612084514 71 G
1612084515 65 A
1612084516 71 G
1612084517 67 C
1612084518 65 A
1612084519 65 A
1612084520 84 T
1612084521 65 A
1612084522 67 C
1612084523 71 G
1612084524 84 T
1612084525 65 A
1612084526 65 A
1612084527 10
1612084528 43 +
1612084529 10 Error occurred at this position.
1612084530 62 >


So is there a way to remove all of this specific read or just tweek the "+" character out of this read?



Thanks
S










share|improve this question









New contributor



sailajah vishwanathan is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.











An error occurred while processing read where a "+" symbol seems to hinder the sequence. This is the specif error that I got after i converted a multi line fasta to a single line fasta and used sed command to remove unwanted characters.



An error occurred processing read TTGTATGCAGCCCAAAAATACCTAAAATTTGGCTCGCGCGGAATGCTAATTCCACCGTTTACAGCCTCCCTTGGGTTGACGAGCCAAAAAAGTGGATCCTTCATGCATGTGTGCCCATGCTATCCTTTAATAGTAACCTTGTCGTTTATCTTAATACTCCTCCGACAACATAATAGTGATTTTACATGAAAAAGACGTGAGTCAACGTTGCGAATCCATTCCGTGACTGATTCGATCCTAGTTTCGTCGAGTTCTTCAATATCTATTTCGTGGCCACATGTTGAGCTATTTGTGACAAATTTGTCAAATTTGCGTCGCAGTCGGCCAGTCGAAACATATTTCGAGTAACTTTCGTGTTAAATATTTTCTCATTTTGATATTCATGCAGGTAGATCGAATCCCCAATATACCGTATAAAGTTATAGATCTACTCGGCTAACTCGGCTACGAATGTCAAAACCAAAGGTTAGAACACGATAGTTGTGCGAAATGGTCTCCTCAAAGAAAATTTCTAGATAATTACGTGCATGTACGTGTGAGCAGAAAACGTCATCAAAAATATCTAAATGTGAACATTGAATAGAGCTTTAAGCACTCGATACGAACAGTCATGGAAATGGAATCAAAACATCGAGCAACACATCTTTTTCATGTAGACCTTGGCTAAATTAAAGCATGATTAGATAGATCAAGAGGATTAAAAAAGATTGAACTACTTTAGTTAAACACAACATACATAATGTACATGTAACCAAACAACGGGACTTATGTTTCTCCCACATATTAGCGAAAGTGGAACTGTTAGCAACCAAACCATATGCAGAATTAGGCCCAAATCAGAGTTTCCACCTATACAACTTTCCAATGGGCCAAGTTGCAAGAGGGAAACACCCATGTGGGCACCAAACCTCCACAAAAGCAGCTCGTATGCGAGCTCTGCATGAACTTGGCCTAGCTCAGTGCAGCTAGCAAGCCAGCAGCGTAGGTCACTTCCATGTGGGTTCATATTCCCTGTACGTAGGGGCGACATATATGGAGCATGCTGCACCATGGACGAGGAGGATGGTGCTATAGCGAATTATGTGGCGGCATAGTGGGGGTGAGGGAGCAAGGCTGCACGAGTCGTTGTGGCTTGGCATGTCAGCCCCGACGTCGTGGGTGAAGGTTGATGCGGTTTCGCGAGGACTATGTGGTATGCATACGAGTGTGCGGTGGGGAAGGCTTTAGGTGGCCACACACATAGTGTTTTTATTGTGGCCACAGGCTTGAGGTAGAGAGTTTGTGATGTGGCAGGATGATAGGGTGGGATGAACGACCACCAACTCCCAAATAATTTTCCAATTAATAGCAAATATACAAAATTTCTTGCATACTCTCATTGTTTCAATGAATAATACTTCTTTTAAAAAAATCAGGCCAAGTAAAGTTTATCCAAGATGTTATATGAATATCTTATAAAAATATATTTCATGATCTTGTATCTAATGATACTACCTCCCGTCCAGATTCATAGGGTTGCGCGTATCCCTAGGTTGTAAATTTGAAAATTGATATAATATCGTTTGTATAATATAAAAAATATCATTAGAAAGAAAGTAGAACATACAAAGCTTTACTACATTGATATATATTTTTGTAATACGTATGTTGTTATCATTCTGGAATTTACGGCTTACAGTTTCTGTGTGAGTCTGTGGCCAGGACGAAAGTATTGATTTTTCATTTGCATGTTGATAACATTGAGATGGATTTTCTCCATCCTTACGTTATCTTCTCAATCTCTGCATCAGAGTTGGCTTTGGGCTCCTATACGGGAGGCGTCTGTTGGCCGTTTTAAGCGGAGCGGACAGGTGGCTCCGCATACCTTTTTTGGTGATCAGATTTGATGGGCTCGGCCCATTTACCGACCTGATGTTCTTATAGGATTTACCCTTACGAACAGCTGAGTCTTATCATCAGAAAATGATCAGCAGTTATCAATATGTTTAACTCCGTTGACCTAACGTATAGAAATATAAAATGGAATATTATATCGCGCTCTTTTGATCCTCAAAGGGCTTTTTTGGAAGGCCTATCCTCCGAGATCTAAGGCCGATCAATCGCGATCCCAGCAATCATTCCAGCCGCTGCGCATCCAGCAGAGACCTTCGAACCCTCCTCCGACGACCTGTCGGCCGCCGCCGGCGACTACCCTCCCTGGGTGATTTTCGACGAGCAGGAACGCCGCTATCGCCACAAGGAGGTCAATGATACCCCAGCCCGCTGACCCCAGGACGGTGGCGCACGCTCGCACCTCCACCGGCCACCCCGTCCAGTTTCCTTCCGCTTCGCCGCTCCCCGGCAGTTTCGTGTCTGGACCTCAGCGTCGACGACGGCTTGCTGGATGGGGTGGATATTATTTCCAGCGTCGTCGTGGCTCACGGCGATCCGTCCTCCTTACATATGATACTGTTACTGTACAAGTACGAGATCAACACGACCGACTATTTCGTCTACAATGCCGGCGACGGCGCCACGGCCGCCGTCGTTGTCGCTTCTCCCCTTGTTACATGGACGAGGAGGAACAGTCCCATGACAGACGGTACCGGTGACTATGAGTAGGAAACGCACTGGCATCCTGCGTCGCGGCGACGAGGAGTTGGTGGTGATGATGGACCTGAGGATGATGAACACTATTGCGTGCGATGATGATGGGTGGGGCAAAGCAGGTGGAAGGTGAGGTCGCGTACTCCGGTCGGGGAGTGGCAGTGGAAGGCCGAGGCGCTCCGGTCCGCCACGATGTGAGCAAAGGCGAGGTGACTTCGTACTGCTGGTGGTGGGTGTGGGAGACCGACCGGGTGATCCCCGTCGGCGACCGGTTCTTATACTGGGTAGACCTACACCGAGGCATCGTCTTCTCCGACGTGTCCGAAAGCCCCGATCTGAGGTTCGTGCCTCTGCCCGTGGAGCCTATCCTCCGTAGACGCGAGGACTGGGACCGGCCCAGTCGCGCCGCAGCCTGTGCGCCACGGCCGGCGGCGCGACTCTGAAGTTCGTCCACGTCTCACCTCGCTGCTGTTGCGAGCAATACGTAA
The error occurred at buffer index 1612084529while processing file block at offset 81604378624 to 83751862272
File offset at error location is 83216463153
File dump around this offset:
1612084479 84 T
1612084480 71 G
1612084481 65 A
1612084482 65 A
1612084483 71 G
1612084484 84 T
1612084485 84 T
1612084486 67 C
1612084487 71 G
1612084488 84 T
1612084489 67 C
1612084490 67 C
1612084491 65 A
1612084492 67 C
1612084493 71 G
1612084494 84 T
1612084495 67 C
1612084496 84 T
1612084497 67 C
1612084498 65 A
1612084499 67 C
1612084500 67 C
1612084501 84 T
1612084502 67 C
1612084503 71 G
1612084504 67 C
1612084505 84 T
1612084506 71 G
1612084507 67 C
1612084508 84 T
1612084509 71 G
1612084510 84 T
1612084511 84 T
1612084512 71 G
1612084513 67 C
1612084514 71 G
1612084515 65 A
1612084516 71 G
1612084517 67 C
1612084518 65 A
1612084519 65 A
1612084520 84 T
1612084521 65 A
1612084522 67 C
1612084523 71 G
1612084524 84 T
1612084525 65 A
1612084526 65 A
1612084527 10
1612084528 43 +
1612084529 10 Error occurred at this position.
1612084530 62 >


So is there a way to remove all of this specific read or just tweek the "+" character out of this read?



Thanks
S







linux






share|improve this question









New contributor



sailajah vishwanathan is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.










share|improve this question









New contributor



sailajah vishwanathan is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.








share|improve this question




share|improve this question








edited yesterday









cas

41.6k4 gold badges59 silver badges111 bronze badges




41.6k4 gold badges59 silver badges111 bronze badges






New contributor



sailajah vishwanathan is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.








asked yesterday









sailajah vishwanathansailajah vishwanathan

1




1




New contributor



sailajah vishwanathan is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.




New contributor




sailajah vishwanathan is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.


















  • looks like a linefeed (10) followed by a + (43) and then another LF. can you show the sed command (and any other commands) you used to transform the input?

    – cas
    yesterday








  • 1





    also, while i am no expert in fasta files or bioinformatics, I have done some work in the field (as a programmer/sysadmin, not as a bio scientist) and i can't help but think you'd be better off manipulating such files using tools like bioperl or biopython than "rolling your own" with awk or sed. If you can't decide between perl or python, look here For what bioinformatics tasks is Biopython more adapted than Bioperl?

    – cas
    yesterday








  • 1





    BTW, if you haven't run across them before, both bioinformatics.stackexchange.com AND biostars.org are worth checking out.

    – cas
    yesterday













  • The sed command i used was sed -e '/^[^>]/s/[^ATGCatgc]//g' <input.fasta> <output.fasta>

    – sailajah vishwanathan
    yesterday











  • And for converting my fastq files into a single line fasta file i used a Python script

    – sailajah vishwanathan
    yesterday



















  • looks like a linefeed (10) followed by a + (43) and then another LF. can you show the sed command (and any other commands) you used to transform the input?

    – cas
    yesterday








  • 1





    also, while i am no expert in fasta files or bioinformatics, I have done some work in the field (as a programmer/sysadmin, not as a bio scientist) and i can't help but think you'd be better off manipulating such files using tools like bioperl or biopython than "rolling your own" with awk or sed. If you can't decide between perl or python, look here For what bioinformatics tasks is Biopython more adapted than Bioperl?

    – cas
    yesterday








  • 1





    BTW, if you haven't run across them before, both bioinformatics.stackexchange.com AND biostars.org are worth checking out.

    – cas
    yesterday













  • The sed command i used was sed -e '/^[^>]/s/[^ATGCatgc]//g' <input.fasta> <output.fasta>

    – sailajah vishwanathan
    yesterday











  • And for converting my fastq files into a single line fasta file i used a Python script

    – sailajah vishwanathan
    yesterday

















looks like a linefeed (10) followed by a + (43) and then another LF. can you show the sed command (and any other commands) you used to transform the input?

– cas
yesterday







looks like a linefeed (10) followed by a + (43) and then another LF. can you show the sed command (and any other commands) you used to transform the input?

– cas
yesterday






1




1





also, while i am no expert in fasta files or bioinformatics, I have done some work in the field (as a programmer/sysadmin, not as a bio scientist) and i can't help but think you'd be better off manipulating such files using tools like bioperl or biopython than "rolling your own" with awk or sed. If you can't decide between perl or python, look here For what bioinformatics tasks is Biopython more adapted than Bioperl?

– cas
yesterday







also, while i am no expert in fasta files or bioinformatics, I have done some work in the field (as a programmer/sysadmin, not as a bio scientist) and i can't help but think you'd be better off manipulating such files using tools like bioperl or biopython than "rolling your own" with awk or sed. If you can't decide between perl or python, look here For what bioinformatics tasks is Biopython more adapted than Bioperl?

– cas
yesterday






1




1





BTW, if you haven't run across them before, both bioinformatics.stackexchange.com AND biostars.org are worth checking out.

– cas
yesterday







BTW, if you haven't run across them before, both bioinformatics.stackexchange.com AND biostars.org are worth checking out.

– cas
yesterday















The sed command i used was sed -e '/^[^>]/s/[^ATGCatgc]//g' <input.fasta> <output.fasta>

– sailajah vishwanathan
yesterday





The sed command i used was sed -e '/^[^>]/s/[^ATGCatgc]//g' <input.fasta> <output.fasta>

– sailajah vishwanathan
yesterday













And for converting my fastq files into a single line fasta file i used a Python script

– sailajah vishwanathan
yesterday





And for converting my fastq files into a single line fasta file i used a Python script

– sailajah vishwanathan
yesterday










0






active

oldest

votes














Your Answer








StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "106"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});


}
});






sailajah vishwanathan is a new contributor. Be nice, and check out our Code of Conduct.










draft saved

draft discarded


















StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f536774%2fhow-do-i-remove-a-specific-character-from-a-specific-read-in-a-fasta-file%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown

























0






active

oldest

votes








0






active

oldest

votes









active

oldest

votes






active

oldest

votes








sailajah vishwanathan is a new contributor. Be nice, and check out our Code of Conduct.










draft saved

draft discarded


















sailajah vishwanathan is a new contributor. Be nice, and check out our Code of Conduct.













sailajah vishwanathan is a new contributor. Be nice, and check out our Code of Conduct.












sailajah vishwanathan is a new contributor. Be nice, and check out our Code of Conduct.
















Thanks for contributing an answer to Unix & Linux Stack Exchange!


  • Please be sure to answer the question. Provide details and share your research!

But avoid



  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f536774%2fhow-do-i-remove-a-specific-character-from-a-specific-read-in-a-fasta-file%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Hudson River Historic District Contents Geography History The district today Aesthetics Cultural...

The number designs the writing. Feandra Aversely Definition: The act of ingrafting a sprig or shoot of one...

Ayherre Geografie Demografie Externe links Navigatiemenu43° 23′ NB, 1° 15′ WL43° 23′ NB, 1°...