Jupyterhub logs too large(DNA sequences)

Hi

Last days I’m getting some issues about too large logs files for jupyterhub
An user is working with DNA sequences, but I’m unable find an option to disable those logs in jupyterhub.
An example

2022-06-21T11:35:22.509990+02:00 by0q4n jupyterhub[10122]: AAAAAEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEAAAEEEEEEEEEEEEEEEEEEEAEEEEEEE/EEEE/EEE
2022-06-21T11:35:22.510035+02:00 by0q4n jupyterhub[10122]: @NB552507:86:HGYH5AFX3:1:21204:17567:14352 2:N:0:TGGGAAAGTC+ATTGCATCCT
2022-06-21T11:35:22.510073+02:00 by0q4n jupyterhub[10122]: AGAGTAAATACCTTGTAAGACAAGCTGGGTCTACGGCATCAGACCAAGGTTTAGCAACAAGGAAGTATTTCAAA
2022-06-21T11:35:22.510110+02:00 by0q4n jupyterhub[10122]: +
2022-06-21T11:35:22.510147+02:00 by0q4n jupyterhub[10122]: AAAAAEEEEEEEEEEEEEE6EEEEEEEEAEEEEEEEEAEEEE6EEEEEE/6<EEEEEEEEEEEEEEEEEEEEEE
2022-06-21T11:35:22.510183+02:00 by0q4n jupyterhub[10122]: @NB552507:86:HGYH5AFX3:1:21204:22611:14366 2:N:0:TGGGAAAGTC+ATTGCATCCT
2022-06-21T11:35:22.510220+02:00 by0q4n jupyterhub[10122]: CTCCTGTTCATATCGTAACCTCTCTCCTCCTCATTGTCTTTCTTTCTTCCTGTCTCTTATACACATCTGACGCT
2022-06-21T11:35:22.510256+02:00 by0q4n jupyterhub[10122]: +
2022-06-21T11:35:22.510293+02:00 by0q4n jupyterhub[10122]: AA/AAEEEEEE6EEEEEEEEEEEEEEEEEEEEAEEEEE/EEAE</EE<<<EEEEE<A6A6AEE//EE6A<EAE/
2022-06-21T11:35:22.510329+02:00 by0q4n jupyterhub[10122]: @NB552507:86:HGYH5AFX3:1:21204:18020:14379 2:N:0:TGGGAAAGTC+ATTGCATCCT

Has someone a suggestion what I could do?

I don’t know if it will help you, but everything to the right side of 'jupyterhub[10122]: ’ in the block you sent are lines of a fastq file. Each record is beginning with the lines that start with the ‘@’. Those files can be rather large, typically on the order of several Gb, if from high-throughput sequencing techniques.

You may want to include an example of the name of such log files. Is there one or multiples? My concern would be that the log file is coming from one of the many, many types of software to analyze such high-throughput sequencing data.

1 Like