Nebula k36 vs Gedmatch
#16
(10-12-2024, 09:02 AM)Genetics189291 Wrote:
(10-12-2024, 12:12 AM)miquirumba Wrote:
(10-11-2024, 11:47 PM)Genetics189291 Wrote: Does he have plans to add it in the future?

He has updated WGSE DEV a few weeks ago. He said that maybe it would be better to combine all SNPs (AADR, 1240K, HO and others) in the same template as allCombined.kit that's easier to program because all new sci raw data templates are subset of main combined tab.gz

Do you have the link for the aadr 1240k tab.gz file
same format as All_SNPs_GRCh37_ref.tab

https://drive.google.com/file/d/1tzHqeKV...sp=sharing

I had problems to index with tabix, you could unzip and combine with All_SNPs_GRCh37_ref.tab to get all SNPs (maybe will work saved as All_SNPs_GRCh37_ref.tab)
I haven't time now to work in WGSE development, perhaps next month
Reply
#17
(10-12-2024, 10:36 AM)miquirumba Wrote:
(10-12-2024, 09:02 AM)Genetics189291 Wrote:
(10-12-2024, 12:12 AM)miquirumba Wrote: He has updated WGSE DEV a few weeks ago. He said that maybe it would be better to combine all SNPs (AADR, 1240K, HO and others) in the same template as allCombined.kit that's easier to program because all new sci raw data templates are subset of main combined tab.gz

Do you have the link for the aadr 1240k tab.gz file
same format as All_SNPs_GRCh37_ref.tab

https://drive.google.com/file/d/1tzHqeKV...sp=sharing

I had problems to index with tabix, you could unzip and combine with All_SNPs_GRCh37_ref.tab to get all SNPs (maybe will work saved as All_SNPs_GRCh37_ref.tab)
I haven't time now to work in WGSE development, perhaps next month

Let me try this I’ll get back to you
Reply
#18
(10-12-2024, 10:36 AM)miquirumba Wrote:
(10-12-2024, 09:02 AM)Genetics189291 Wrote:
(10-12-2024, 12:12 AM)miquirumba Wrote: He has updated WGSE DEV a few weeks ago. He said that maybe it would be better to combine all SNPs (AADR, 1240K, HO and others) in the same template as allCombined.kit that's easier to program because all new sci raw data templates are subset of main combined tab.gz

Do you have the link for the aadr 1240k tab.gz file
same format as All_SNPs_GRCh37_ref.tab

https://drive.google.com/file/d/1tzHqeKV...sp=sharing

I had problems to index with tabix, you could unzip and combine with All_SNPs_GRCh37_ref.tab to get all SNPs (maybe will work saved as All_SNPs_GRCh37_ref.tab)
I haven't time now to work in WGSE development, perhaps next month

I combined and then sorted by chromosome position if you want the file I can give it to you.
Reply
#19
(10-13-2024, 01:13 AM)Genetics189291 Wrote:
(10-12-2024, 10:36 AM)miquirumba Wrote:
(10-12-2024, 09:02 AM)Genetics189291 Wrote: Do you have the link for the aadr 1240k tab.gz file
same format as All_SNPs_GRCh37_ref.tab

https://drive.google.com/file/d/1tzHqeKV...sp=sharing

I had problems to index with tabix, you could unzip and combine with All_SNPs_GRCh37_ref.tab to get all SNPs (maybe will work saved as All_SNPs_GRCh37_ref.tab)
I haven't time now to work in WGSE development, perhaps next month

I combined and then sorted by chromosome position if you want the file I can give it to you.
share the link to your combined tab file, please
I'll try to run in WGSE DEV Ubuntu version
Reply
#20
(10-13-2024, 11:21 AM)miquirumba Wrote:
(10-13-2024, 01:13 AM)Genetics189291 Wrote:
(10-12-2024, 10:36 AM)miquirumba Wrote: same format as All_SNPs_GRCh37_ref.tab

https://drive.google.com/file/d/1tzHqeKV...sp=sharing

I had problems to index with tabix, you could unzip and combine with All_SNPs_GRCh37_ref.tab to get all SNPs (maybe will work saved as All_SNPs_GRCh37_ref.tab)
I haven't time now to work in WGSE development, perhaps next month

I combined and then sorted by chromosome position if you want the file I can give it to you.
share the link to your combined tab file, please
I'll try to run in WGSE DEV Ubuntu version

I’ll do it when I get home from work
Reply
#21
(10-13-2024, 11:21 AM)miquirumba Wrote:
(10-13-2024, 01:13 AM)Genetics189291 Wrote:
(10-12-2024, 10:36 AM)miquirumba Wrote: same format as All_SNPs_GRCh37_ref.tab

https://drive.google.com/file/d/1tzHqeKV...sp=sharing

I had problems to index with tabix, you could unzip and combine with All_SNPs_GRCh37_ref.tab to get all SNPs (maybe will work saved as All_SNPs_GRCh37_ref.tab)
I haven't time now to work in WGSE development, perhaps next month

I combined and then sorted by chromosome position if you want the file I can give it to you.
share the link to your combined tab file, please
I'll try to run in WGSE DEV Ubuntu version

e

easyupload.io/wug627

That’s the download link let me know how it goes
Reply
#22
(10-13-2024, 05:50 PM)Genetics189291 Wrote:
(10-13-2024, 11:21 AM)miquirumba Wrote:
(10-13-2024, 01:13 AM)Genetics189291 Wrote: I combined and then sorted by chromosome position if you want the file I can give it to you.
share the link to your combined tab file, please
I'll try to run in WGSE DEV Ubuntu version
[/quo
[quote pid="36573" dateline="1728841807"]
e

easyupload.io/wug627

That’s the download link let me know how it goes

FILE NOT FOUND
File may be expired or removed by easyupload.io because of a policy violation
Reply
#23
(10-13-2024, 06:57 PM)miquirumba Wrote:
(10-13-2024, 05:50 PM)Genetics189291 Wrote:
(10-13-2024, 11:21 AM)miquirumba Wrote: share the link to your combined tab file, please
I'll try to run in WGSE DEV Ubuntu version
[/quo
[quote pid="36573" dateline="1728841807"]
e

easyupload.io/wug627

That’s the download link let me know how it goes

FILE NOT FOUND
File may be expired or removed by easyupload.io because of a policy violation

[/quote]
Let me reupload it somewhere else
Reply
#24
(10-13-2024, 06:57 PM)miquirumba Wrote:
(10-13-2024, 05:50 PM)Genetics189291 Wrote:
(10-13-2024, 11:21 AM)miquirumba Wrote: share the link to your combined tab file, please
I'll try to run in WGSE DEV Ubuntu version
[/quo
[quote pid="36573" dateline="1728841807"]
e

easyupload.io/wug627

That’s the download link let me know how it goes

FILE NOT FOUND
File may be expired or removed by easyupload.io because of a policy violation

[/quote]
It’s still showing for me weird I’ll upload it else where though
Reply
#25
(10-13-2024, 06:57 PM)miquirumba Wrote:
(10-13-2024, 05:50 PM)Genetics189291 Wrote:
(10-13-2024, 11:21 AM)miquirumba Wrote: share the link to your combined tab file, please
I'll try to run in WGSE DEV Ubuntu version
[/quo
[quote pid="36573" dateline="1728841807"]
e

easyupload.io/wug627

That’s the download link let me know how it goes

FILE NOT FOUND
File may be expired or removed by easyupload.io because of a policy violation

[/quote]
https://gofile.io/d/d2egCn

here is a new link
Reply
#26
(10-14-2024, 12:09 AM)Genetics189291 Wrote:
(10-13-2024, 06:57 PM)miquirumba Wrote:
(10-13-2024, 05:50 PM)Genetics189291 Wrote: [quote pid="36573" dateline="1728841807"]
e

easyupload.io/wug627

That’s the download link let me know how it goes

FILE NOT FOUND
File may be expired or removed by easyupload.io because of a policy violation
https://gofile.io/d/d2egCn

here is a new link
[/quote]
downloaded, thanks

3,314,072 SNPs with rsID, a good improvement. I'll check this week
Reply
#27
most of new SNPs are duplicates, only about 200,000 new SNPs running bcftools
0 number of records: 2495890
SN 0 number of no-ALTs: 1775780
SN 0 number of SNPs: 718483
SN 0 number of MNPs: 0
SN 0 number of indels: 1627

surely is the reason WGSE crashed trying to extract them
I'll remove duplicates this week and add Twist Bioscience 1.4M 100,000 extra SNPs
Reply
#28
(10-16-2024, 05:13 PM)miquirumba Wrote: most of new SNPs are duplicates, only about 200,000 new SNPs running bcftools
0 number of records: 2495890
SN 0 number of no-ALTs: 1775780
SN 0 number of SNPs: 718483
SN 0 number of MNPs: 0
SN 0 number of indels: 1627

surely is the reason WGSE crashed trying to extract them
I'll remove duplicates this week and add Twist Bioscience 1.4M 100,000 extra SNPs

What I tried to do is take the dnakitstudio template 1240k and replace it in the wgsextractfolder which worked but only issue is it caused too much no calls
Reply
#29
(10-19-2024, 08:23 PM)Genetics189291 Wrote:
(10-16-2024, 05:13 PM)miquirumba Wrote: most of new SNPs are duplicates, only about 200,000 new SNPs running bcftools
0 number of records: 2495890
SN 0 number of no-ALTs: 1775780
SN 0 number of SNPs: 718483
SN 0 number of MNPs: 0
SN 0 number of indels: 1627

surely is the reason WGSE crashed trying to extract them
I'll remove duplicates this week and add Twist Bioscience 1.4M 100,000 extra SNPs

What I tried to do is take the dnakitstudio template 1240k and replace it in the wgsextractfolder which worked but only issue is it caused too much no calls
There is no reason for a lot no calls, I am sure you are using dnakutStudio Build37 template running against your hg38 CRAM. Different reference genomes don't match many SNPs then you have a lot of no calls because you need a 1240k for hg38 reference
Reply
#30
(10-20-2024, 10:50 AM)miquirumba Wrote:
(10-19-2024, 08:23 PM)Genetics189291 Wrote:
(10-16-2024, 05:13 PM)miquirumba Wrote: most of new SNPs are duplicates, only about 200,000 new SNPs running bcftools
0 number of records: 2495890
SN 0 number of no-ALTs: 1775780
SN 0 number of SNPs: 718483
SN 0 number of MNPs: 0
SN 0 number of indels: 1627

surely is the reason WGSE crashed trying to extract them
I'll remove duplicates this week and add Twist Bioscience 1.4M 100,000 extra SNPs

What I tried to do is take the dnakitstudio template 1240k and replace it in the wgsextractfolder which worked but only issue is it caused too much no calls
There is no reason for a lot no calls, I am sure you are using dnakutStudio Build37 template running against your hg38 CRAM. Different reference genomes don't match many SNPs then you have a lot of no calls because you need a 1240k for hg38 reference

It’s still using a lift over, all the templates are hg37 it just lift overs my cram
When converting the file. Do you have a template for hg38?
Reply


Forum Jump:


Users browsing this thread: 1 Guest(s)