FoldSeek

Message boards : Rosetta@home Science : FoldSeek

To post messages, you must log in.

AuthorMessage
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1991
Credit: 9,520,400
RAC: 12,860
Message 104770 - Posted: 10 Feb 2022, 7:53:43 UTC

Another intersting project: FoldSeek

Foldseek enables fast and sensitive comparisons of large structure sets. It reaches sensitivities similar to state-of-the-art structural aligners while being at least 20,000 times faster.

ID: 104770 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1991
Credit: 9,520,400
RAC: 12,860
Message 106689 - Posted: 1 Aug 2022, 14:58:07 UTC - in response to Message 104770.  

Today they released a new version to targhet AlphaFold UniProt Database
ID: 106689 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1991
Credit: 9,520,400
RAC: 12,860
Message 107903 - Posted: 29 Dec 2022, 11:45:18 UTC
Last modified: 29 Dec 2022, 11:45:29 UTC

Foldseek version 4 is out
ID: 107903 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1991
Credit: 9,520,400
RAC: 12,860
Message 108119 - Posted: 24 Feb 2023, 7:17:02 UTC

ID: 108119 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1991
Credit: 9,520,400
RAC: 12,860
Message 108184 - Posted: 17 Mar 2023, 14:41:12 UTC

ID: 108184 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1991
Credit: 9,520,400
RAC: 12,860
Message 108225 - Posted: 29 Mar 2023, 7:49:28 UTC

ID: 108225 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1991
Credit: 9,520,400
RAC: 12,860
Message 108430 - Posted: 30 Jun 2023, 7:13:22 UTC

ID: 108430 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1991
Credit: 9,520,400
RAC: 12,860
Message 108582 - Posted: 12 Sep 2023, 15:13:22 UTC

ID: 108582 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1991
Credit: 9,520,400
RAC: 12,860
Message 109242 - Posted: 11 May 2024, 6:10:16 UTC

Foldseek 9 is out
ID: 109242 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1991
Credit: 9,520,400
RAC: 12,860
Message 109276 - Posted: 23 May 2024, 9:22:53 UTC

Foldseek find similarities in Alphafold DB

A new algorithm called FoldSeek (van Kempen et al. 2024) efficiently clustered over 214 million protein structures from the AlphaFold database into only 2.30 million clusters, significantly reducing the data by almost 100 times! You can group the data points together to obtain only the non-repetitive, meaningful representative structures for each group. This reduction is possible due to the intrinsic patterns shared across many protein types.

Structural analysis poses a significant time challenge, contrasting with the swiftness of sequence analysis. The authors of FoldSeek highlight this disparity: “Searching with a single query structure through a database with 100 million protein structures would take the popular TM-align tool a month on one CPU core, and an all-versus-all comparison would take 10 millennia on a 1,000-core cluster. Sequence searching is four to five orders of magnitude faster: an all-versus-all comparison of 100 million sequences would take MMseqs2 only around a week on the same cluster” (van Kempen et al. 2024). In contrast, their algorithm, FoldSeek, drastically accelerates structural comparisons while maintaining sensitivity. This speed enhancement, achieved through the development of a 3Di alphabet, facilitates large-scale structural analysis. Unlike conventional methods like CLE, 3D-BLAST, and Protein Blocks, which describe the protein backbone, 3Di characterizes interactions between closely placed residue pairs in 3D space.

ID: 109276 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1991
Credit: 9,520,400
RAC: 12,860
Message 109731 - Posted: 16 Sep 2024, 8:33:22 UTC

Foldseek is now integrated in AFBD

The AlphaFold Protein Structure Database (AFDB), co-developed by Google DeepMind and EMBL-EBI, has significantly enhanced its search capabilities by integrating Foldseek into all its entries. Users can now perform structure-based searches against the Protein Data Bank (PDB), which houses experimentally determined structures, and the AlphaFold Database.

The PDB collection is updated weekly to incorporate the latest releases from the Worldwide Protein Data Bank, ensuring users can always access the most current entries for their searches. Additionally, searching against AlphaFold models involves using a curated AFDB collection clustered at 50% sequence identity, termed AFDB50. This optimisation reduces redundancy in the searches by focusing on searching against representatives of clusters with homologous proteins.

ID: 109731 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Rosetta@home Science : FoldSeek



©2024 University of Washington
https://www.bakerlab.org