Posts: 224
Threads: 4
Joined: Feb 2024
(10-12-2024, 09:02 AM)Genetics189291 Wrote: (10-12-2024, 12:12 AM)miquirumba Wrote: (10-11-2024, 11:47 PM)Genetics189291 Wrote: Does he have plans to add it in the future?
He has updated WGSE DEV a few weeks ago. He said that maybe it would be better to combine all SNPs (AADR, 1240K, HO and others) in the same template as allCombined.kit that's easier to program because all new sci raw data templates are subset of main combined tab.gz
Do you have the link for the aadr 1240k tab.gz file same format as All_SNPs_GRCh37_ref.tab
https://drive.google.com/file/d/1tzHqeKV...sp=sharing
I had problems to index with tabix, you could unzip and combine with All_SNPs_GRCh37_ref.tab to get all SNPs (maybe will work saved as All_SNPs_GRCh37_ref.tab)
I haven't time now to work in WGSE development, perhaps next month
Posts: 503
Threads: 72
Joined: Nov 2023
Gender: Male
Ethnicity: Arab
(10-12-2024, 10:36 AM)miquirumba Wrote: (10-12-2024, 09:02 AM)Genetics189291 Wrote: (10-12-2024, 12:12 AM)miquirumba Wrote: He has updated WGSE DEV a few weeks ago. He said that maybe it would be better to combine all SNPs (AADR, 1240K, HO and others) in the same template as allCombined.kit that's easier to program because all new sci raw data templates are subset of main combined tab.gz
Do you have the link for the aadr 1240k tab.gz file same format as All_SNPs_GRCh37_ref.tab
https://drive.google.com/file/d/1tzHqeKV...sp=sharing
I had problems to index with tabix, you could unzip and combine with All_SNPs_GRCh37_ref.tab to get all SNPs (maybe will work saved as All_SNPs_GRCh37_ref.tab)
I haven't time now to work in WGSE development, perhaps next month
Let me try this I’ll get back to you
Posts: 503
Threads: 72
Joined: Nov 2023
Gender: Male
Ethnicity: Arab
(10-12-2024, 10:36 AM)miquirumba Wrote: (10-12-2024, 09:02 AM)Genetics189291 Wrote: (10-12-2024, 12:12 AM)miquirumba Wrote: He has updated WGSE DEV a few weeks ago. He said that maybe it would be better to combine all SNPs (AADR, 1240K, HO and others) in the same template as allCombined.kit that's easier to program because all new sci raw data templates are subset of main combined tab.gz
Do you have the link for the aadr 1240k tab.gz file same format as All_SNPs_GRCh37_ref.tab
https://drive.google.com/file/d/1tzHqeKV...sp=sharing
I had problems to index with tabix, you could unzip and combine with All_SNPs_GRCh37_ref.tab to get all SNPs (maybe will work saved as All_SNPs_GRCh37_ref.tab)
I haven't time now to work in WGSE development, perhaps next month
I combined and then sorted by chromosome position if you want the file I can give it to you.
Posts: 224
Threads: 4
Joined: Feb 2024
(10-13-2024, 01:13 AM)Genetics189291 Wrote: (10-12-2024, 10:36 AM)miquirumba Wrote: (10-12-2024, 09:02 AM)Genetics189291 Wrote: Do you have the link for the aadr 1240k tab.gz file same format as All_SNPs_GRCh37_ref.tab
https://drive.google.com/file/d/1tzHqeKV...sp=sharing
I had problems to index with tabix, you could unzip and combine with All_SNPs_GRCh37_ref.tab to get all SNPs (maybe will work saved as All_SNPs_GRCh37_ref.tab)
I haven't time now to work in WGSE development, perhaps next month
I combined and then sorted by chromosome position if you want the file I can give it to you. share the link to your combined tab file, please
I'll try to run in WGSE DEV Ubuntu version
Posts: 503
Threads: 72
Joined: Nov 2023
Gender: Male
Ethnicity: Arab
(10-13-2024, 11:21 AM)miquirumba Wrote: (10-13-2024, 01:13 AM)Genetics189291 Wrote: (10-12-2024, 10:36 AM)miquirumba Wrote: same format as All_SNPs_GRCh37_ref.tab
https://drive.google.com/file/d/1tzHqeKV...sp=sharing
I had problems to index with tabix, you could unzip and combine with All_SNPs_GRCh37_ref.tab to get all SNPs (maybe will work saved as All_SNPs_GRCh37_ref.tab)
I haven't time now to work in WGSE development, perhaps next month
I combined and then sorted by chromosome position if you want the file I can give it to you. share the link to your combined tab file, please
I'll try to run in WGSE DEV Ubuntu version
I’ll do it when I get home from work
Posts: 503
Threads: 72
Joined: Nov 2023
Gender: Male
Ethnicity: Arab
(10-13-2024, 11:21 AM)miquirumba Wrote: (10-13-2024, 01:13 AM)Genetics189291 Wrote: (10-12-2024, 10:36 AM)miquirumba Wrote: same format as All_SNPs_GRCh37_ref.tab
https://drive.google.com/file/d/1tzHqeKV...sp=sharing
I had problems to index with tabix, you could unzip and combine with All_SNPs_GRCh37_ref.tab to get all SNPs (maybe will work saved as All_SNPs_GRCh37_ref.tab)
I haven't time now to work in WGSE development, perhaps next month
I combined and then sorted by chromosome position if you want the file I can give it to you. share the link to your combined tab file, please
I'll try to run in WGSE DEV Ubuntu version
e
easyupload.io/wug627
That’s the download link let me know how it goes
Posts: 224
Threads: 4
Joined: Feb 2024
10-13-2024, 06:57 PM
(This post was last modified: 10-13-2024, 06:59 PM by miquirumba.)
(10-13-2024, 05:50 PM)Genetics189291 Wrote: (10-13-2024, 11:21 AM)miquirumba Wrote: (10-13-2024, 01:13 AM)Genetics189291 Wrote: I combined and then sorted by chromosome position if you want the file I can give it to you. share the link to your combined tab file, please
I'll try to run in WGSE DEV Ubuntu version
[/quo [quote pid="36573" dateline="1728841807"]
e
easyupload.io/wug627
That’s the download link let me know how it goes
FILE NOT FOUND
File may be expired or removed by easyupload.io because of a policy violation
Posts: 503
Threads: 72
Joined: Nov 2023
Gender: Male
Ethnicity: Arab
(10-13-2024, 06:57 PM)miquirumba Wrote: (10-13-2024, 05:50 PM)Genetics189291 Wrote: (10-13-2024, 11:21 AM)miquirumba Wrote: share the link to your combined tab file, please
I'll try to run in WGSE DEV Ubuntu version
[/quo [quote pid="36573" dateline="1728841807"]
e
easyupload.io/wug627
That’s the download link let me know how it goes
FILE NOT FOUND
File may be expired or removed by easyupload.io because of a policy violation
[/quote]
Let me reupload it somewhere else
Posts: 503
Threads: 72
Joined: Nov 2023
Gender: Male
Ethnicity: Arab
(10-13-2024, 06:57 PM)miquirumba Wrote: (10-13-2024, 05:50 PM)Genetics189291 Wrote: (10-13-2024, 11:21 AM)miquirumba Wrote: share the link to your combined tab file, please
I'll try to run in WGSE DEV Ubuntu version
[/quo [quote pid="36573" dateline="1728841807"]
e
easyupload.io/wug627
That’s the download link let me know how it goes
FILE NOT FOUND
File may be expired or removed by easyupload.io because of a policy violation
[/quote]
It’s still showing for me weird I’ll upload it else where though
Posts: 503
Threads: 72
Joined: Nov 2023
Gender: Male
Ethnicity: Arab
(10-13-2024, 06:57 PM)miquirumba Wrote: (10-13-2024, 05:50 PM)Genetics189291 Wrote: (10-13-2024, 11:21 AM)miquirumba Wrote: share the link to your combined tab file, please
I'll try to run in WGSE DEV Ubuntu version
[/quo [quote pid="36573" dateline="1728841807"]
e
easyupload.io/wug627
That’s the download link let me know how it goes
FILE NOT FOUND
File may be expired or removed by easyupload.io because of a policy violation
[/quote]
https://gofile.io/d/d2egCn
here is a new link
Posts: 224
Threads: 4
Joined: Feb 2024
(10-14-2024, 12:09 AM)Genetics189291 Wrote: (10-13-2024, 06:57 PM)miquirumba Wrote: (10-13-2024, 05:50 PM)Genetics189291 Wrote: [quote pid="36573" dateline="1728841807"]
e
easyupload.io/wug627
That’s the download link let me know how it goes
FILE NOT FOUND
File may be expired or removed by easyupload.io because of a policy violation https://gofile.io/d/d2egCn
here is a new link
[/quote]
downloaded, thanks
3,314,072 SNPs with rsID, a good improvement. I'll check this week
Posts: 224
Threads: 4
Joined: Feb 2024
most of new SNPs are duplicates, only about 200,000 new SNPs running bcftools
0 number of records: 2495890
SN 0 number of no-ALTs: 1775780
SN 0 number of SNPs: 718483
SN 0 number of MNPs: 0
SN 0 number of indels: 1627
surely is the reason WGSE crashed trying to extract them
I'll remove duplicates this week and add Twist Bioscience 1.4M 100,000 extra SNPs
Posts: 503
Threads: 72
Joined: Nov 2023
Gender: Male
Ethnicity: Arab
(10-16-2024, 05:13 PM)miquirumba Wrote: most of new SNPs are duplicates, only about 200,000 new SNPs running bcftools
0 number of records: 2495890
SN 0 number of no-ALTs: 1775780
SN 0 number of SNPs: 718483
SN 0 number of MNPs: 0
SN 0 number of indels: 1627
surely is the reason WGSE crashed trying to extract them
I'll remove duplicates this week and add Twist Bioscience 1.4M 100,000 extra SNPs
What I tried to do is take the dnakitstudio template 1240k and replace it in the wgsextractfolder which worked but only issue is it caused too much no calls
Posts: 224
Threads: 4
Joined: Feb 2024
(10-19-2024, 08:23 PM)Genetics189291 Wrote: (10-16-2024, 05:13 PM)miquirumba Wrote: most of new SNPs are duplicates, only about 200,000 new SNPs running bcftools
0 number of records: 2495890
SN 0 number of no-ALTs: 1775780
SN 0 number of SNPs: 718483
SN 0 number of MNPs: 0
SN 0 number of indels: 1627
surely is the reason WGSE crashed trying to extract them
I'll remove duplicates this week and add Twist Bioscience 1.4M 100,000 extra SNPs
What I tried to do is take the dnakitstudio template 1240k and replace it in the wgsextractfolder which worked but only issue is it caused too much no calls There is no reason for a lot no calls, I am sure you are using dnakutStudio Build37 template running against your hg38 CRAM. Different reference genomes don't match many SNPs then you have a lot of no calls because you need a 1240k for hg38 reference
Posts: 503
Threads: 72
Joined: Nov 2023
Gender: Male
Ethnicity: Arab
(10-20-2024, 10:50 AM)miquirumba Wrote: (10-19-2024, 08:23 PM)Genetics189291 Wrote: (10-16-2024, 05:13 PM)miquirumba Wrote: most of new SNPs are duplicates, only about 200,000 new SNPs running bcftools
0 number of records: 2495890
SN 0 number of no-ALTs: 1775780
SN 0 number of SNPs: 718483
SN 0 number of MNPs: 0
SN 0 number of indels: 1627
surely is the reason WGSE crashed trying to extract them
I'll remove duplicates this week and add Twist Bioscience 1.4M 100,000 extra SNPs
What I tried to do is take the dnakitstudio template 1240k and replace it in the wgsextractfolder which worked but only issue is it caused too much no calls There is no reason for a lot no calls, I am sure you are using dnakutStudio Build37 template running against your hg38 CRAM. Different reference genomes don't match many SNPs then you have a lot of no calls because you need a 1240k for hg38 reference
It’s still using a lift over, all the templates are hg37 it just lift overs my cram
When converting the file. Do you have a template for hg38?
|