<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-7853142166506564202</id><updated>2012-01-10T22:58:05.983-05:00</updated><category term='structural biology'/><category term='algorithm'/><category term='3DNA-related'/><category term='bioinformatics'/><category term='programming'/><category term='misc'/><title type='text'>Xiang-Jun's Corner</title><subtitle type='html'>Random thoughts, mostly on scientific issues</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><link rel='next' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default?start-index=101&amp;max-results=100'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>125</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-1548708909482804285</id><published>2011-10-30T23:12:00.000-04:00</published><updated>2011-10-30T23:12:05.745-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><category scheme='http://www.blogger.com/atom/ns#' term='structural biology'/><title type='text'>3DNA base-mutation functionality for the study of pretein-DNA interactions</title><content type='html'>In my last blog post titled "&lt;a href="http://xiang-jun.blogspot.com/2011/10/mutatebases-new-3dna-tool-for-base.html"&gt;mutate_bases, a new 3DNA tool for in silico base mutations of 3D nucleic acid structures&lt;/a&gt;", I outlined the availability of the &lt;b&gt;mutate_bases&lt;/b&gt; program and some areas of its possible applications, including to "perform base-pair mutations in DNA-protein complexes".&lt;br /&gt;&lt;br /&gt;In the September 6, 2011 issue of &lt;i&gt;PNAS&lt;/i&gt; (vol. 108, no. 36), AlQuraishi and McAdamsa from Stanford University published a high-profile article "&lt;a href="http://www.pnas.org/content/108/36/14819.abstract"&gt;Direct inference of protein–DNA interactions using compressed sensing methods&lt;/a&gt;" (pp. 14819–14824). See also the thoughtful Commentary by Vijay Pande, "&lt;a href="http://www.pnas.org/content/108/36/14713.extract"&gt;(Compressed) sensing and sensibility&lt;/a&gt;". Essentially, by combining concepts from compressed sensing and statistical mechanics, AlQuraishi and McAdamsa have developed a novel approach to determine the energy potential of protein–DNA complexes, leading to "an impressive advance in predictive capability" -- ~90% accuracy compared with ~60% for the best-performing alternative computational methods. &lt;br /&gt;&lt;br /&gt;I am so glad to find that 3DNA (ref nos. 12 and 13) was used in the work: &lt;br /&gt;&lt;blockquote&gt;&lt;b&gt;&lt;i&gt;In silico mutagenesis was carried out using the 3DNA software package (12, 13), which maintains the backbone atoms of the DNA molecule, but replaces the base pair atoms in a way that is consistent with the backbone orientation in the crystal.&lt;/i&gt;&lt;/b&gt; (p.14821, middle of the right column)&lt;/blockquote&gt;&lt;br /&gt;Hopefully, the new and novel base mutation functionality in 3DNA will find more applications and be in wider use. Surely, I'll respond promptly to users' feedback and refine related 3DNA tools as necessary.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-1548708909482804285?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/1548708909482804285/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/10/3dna-base-mutation-functionality-for.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/1548708909482804285'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/1548708909482804285'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/10/3dna-base-mutation-functionality-for.html' title='3DNA base-mutation functionality for the study of pretein-DNA interactions'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-6850122961870449720</id><published>2011-10-25T21:45:00.002-04:00</published><updated>2011-10-30T21:59:33.326-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><category scheme='http://www.blogger.com/atom/ns#' term='structural biology'/><title type='text'>mutate_bases, a new 3DNA tool for in silico base mutations of 3D nucleic acid structures</title><content type='html'>In response to repeated requests from 3DNA users over the years, I recently (well, a few months ago) &lt;a href="http://3dna.rutgers.edu:8080/forum/viewtopic.php?f=1&amp;t=210&amp;p=632#p632"&gt;released a new utility program, &lt;b&gt;mutate_bases&lt;/b&gt;, to the 3DNA suite of software package&lt;/a&gt;. As its name implies, &lt;b&gt;mutate_bases&lt;/b&gt; can be used for mutating bases in a nucleic acid structure, with two key and unique features:&amp;nbsp;(1) the sugar-phosphate backbone conformation is untouched; (2) the base reference frame (position and orientation) is reserved, i.e., the mutated structure shares the same base-pair/step parameters as the original one.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;mutate_bases&lt;/b&gt; is a standalone ANSI C program, on a par with other major 3DNA programs (e.g., &lt;b&gt;find_pair&lt;/b&gt;, &lt;b&gt;analyze&lt;/b&gt;, &lt;b&gt;rebuild&lt;/b&gt;, &lt;b&gt;fiber&lt;/b&gt; etc). As seen from the help message below, it can be used for any nucleic-acid-containing structures (DNA, RNA, or their complexes) in PDB format:&lt;br /&gt;&lt;pre style="background-color: #eeeeee; margin: 10px; font-size: 9px;"&gt;------------------------------------------------------------------------&lt;br /&gt;Usage: mutate_bases mutinfo pdbfile outfile&lt;br /&gt;    'mutinfo' can contain upto 5 fields for each mutation:&lt;br /&gt;        [name=residue_name] [icode=insertion_code]&lt;br /&gt;        chain=chain_id seqnum=residue_number&lt;br /&gt;        mutation=residue_name&lt;br /&gt;    Alternatively, 'mutinfo' can be specified with the '-l' option&lt;br /&gt;        followed by a mutation file: '&lt;span class="Apple-style-span" style="color: red;"&gt;-l mutations.dat&lt;/span&gt;'&lt;br /&gt;    o The five fields per mutation can be in any order or CaSe&lt;br /&gt;    o Each field can be abbreviated to its first character&lt;br /&gt;    o Multiple mutations are separated by ';'&lt;br /&gt;    o Fields in [] (i.e., name and icode) are optional&lt;br /&gt;    o Mutation info should be QUOTED to be taken as one entry&lt;br /&gt;&lt;br /&gt;Examples:&lt;br /&gt;    mutate_bases 'c=a s=2 m=DA' 355d.pdb 355d_G2A.pdb&lt;br /&gt;    # mutate G2 in chain A of B-DNA 355d to A&lt;br /&gt;    mutate_bases 'c=a s=2 m=DA; c=B s=23 m=DT' 355d.pdb 355d_mutfile.pdb&lt;br /&gt;    # mutate base-pair G-C (C23 in chain B) to A-T&lt;br /&gt;        # also create file 'mutations.dat', see below&lt;br /&gt;    &lt;span class="Apple-style-span" style="color: red;"&gt;mutate_bases -l mutations.dat 355d.pdb 355d_mutfile.pdb&lt;br /&gt;        # 355d_mutfile.pdb and 355d_mutfile.pdb would be identical&lt;/span&gt;&lt;br /&gt;    mutate_bases 'c=A s=74 m=U' 1evv.pdb 1evv_C74U.pdb&lt;br /&gt;    # mutate C74 in chain A of tRNA 1evv to U&lt;br /&gt;------------------------------------------------------------------------&lt;br /&gt;&lt;/pre&gt;&lt;br /&gt;&lt;b&gt;mutate_bases&lt;/b&gt; is designed to solve the base mutation problem in a practical sense: robust and efficient, getting its job done and then out of the way. The program can have many possible applications: in addition to perform base-pair mutations in DNA-protein complexes, it should also prove handy RNA modeling and in providing initial structures for QM/MM/MD energy calculations, and in DNA/RNA modeling studies.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-6850122961870449720?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/6850122961870449720/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/10/mutatebases-new-3dna-tool-for-base.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/6850122961870449720'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/6850122961870449720'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/10/mutatebases-new-3dna-tool-for-base.html' title='mutate_bases, a new 3DNA tool for in silico base mutations of 3D nucleic acid structures'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-5122676164353349673</id><published>2011-07-29T23:46:00.001-04:00</published><updated>2011-07-30T07:05:03.691-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>Tracking 3DNA citations: Google scholar vs. Web of Science</title><content type='html'>Over the years, I have been &lt;a href="http://chem.rutgers.edu/~xiangjun/3DNA/"&gt;following citations to 3DNA on a regular basis&lt;/a&gt;, using both &lt;b&gt;Google Scholar&lt;/b&gt; and &lt;b&gt;Web of Science&lt;/b&gt;. From a personal perspective, following the citations has turned out to be an excellent way to keep myself informed of progress related to nucleic acid structures.&lt;br /&gt;&lt;br /&gt;Up to early this year, I had found 3DNA citations based on &lt;b&gt;Google Scholar&lt;/b&gt; to be significantly larger than those from &lt;b&gt;Web of Science&lt;/b&gt;. For example, for &lt;a href="http://nar.oxfordjournals.org/content/31/17/5108.full"&gt;the 2003 &lt;i&gt;Nucleic Acids Research&lt;/i&gt; (NAR) paper&lt;/a&gt;, the difference was well over one hundred. Over the past few months, however, I have noticed that citation numbers to the 2003 NAR paper fluctuating up and down in dozens. Moreover, I have been receiving significantly less email notifications to 3DNA citations from &lt;b&gt;Google Scholar&lt;/b&gt;.  Apparently, Google has been revising the algorithms for &lt;b&gt;Google Scholar&lt;/b&gt;, leading to more conservative citation numbers that are now comparable to those from &lt;b&gt;Web of Science&lt;/b&gt;. &lt;br /&gt;&lt;br /&gt;Here are the current citation numbers to the three 3DNA publications, from both &lt;b&gt;Google Scholar&lt;/b&gt; and &lt;b&gt;Web of Science&lt;/b&gt;:&lt;br /&gt;&lt;br /&gt;&lt;table style="border-spacing: 10px; border-width: 1px; background-color: #eee;"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;th&gt;&lt;/th&gt;&lt;th&gt;Google Scholar&lt;/th&gt;&lt;th&gt;Web of Science&lt;/th&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;&lt;a href="http://nar.oxfordjournals.org/content/31/17/5108.full"&gt;2003 NAR&lt;/a&gt;&lt;/td&gt;&lt;td&gt;544&lt;/td&gt;&lt;td&gt;473&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;&lt;a href="http://www.nature.com/nprot/journal/v3/n7/full/nprot.2008.104.html"&gt;2008 &lt;i&gt;Nature Protocols&lt;/i&gt;&lt;/a&gt;&lt;/td&gt;&lt;td&gt;43&lt;/td&gt;&lt;td&gt;41&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;&lt;a href="http://nar.oxfordjournals.org/content/37/suppl_2/W240.full"&gt;2009 NAR (w3DNA)&lt;/a&gt;&lt;/td&gt;&lt;td&gt;11&lt;/td&gt;&lt;td&gt;13&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;br /&gt;It is interesting to note that for the w3DNA paper, &lt;b&gt;Web of Science&lt;/b&gt; gives a larger number (13) than &lt;b&gt;Google Scholar&lt;/b&gt;. A careful check of the citation result from &lt;b&gt;Web of Science&lt;/b&gt; shows that all of the 13 citations are legitimate journal articles. Clearly, &lt;b&gt;Web of Science&lt;/b&gt; is missing something in this case.&lt;br /&gt;&lt;br /&gt;Overall, I trust &lt;b&gt;Web of Science&lt;/b&gt; more, which is why I have been using it to compile &lt;a href="http://chem.rutgers.edu/~xiangjun/3DNA/citations.html"&gt;citations to the 2003 NAR paper&lt;/a&gt;. However, I use &lt;b&gt;Google Scholar&lt;/b&gt; more frequently for quick reference simply because it is free and easy to access.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-5122676164353349673?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/5122676164353349673/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/07/tracking-3dna-citations-google-scholar.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5122676164353349673'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5122676164353349673'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/07/tracking-3dna-citations-google-scholar.html' title='Tracking 3DNA citations: Google scholar vs. Web of Science'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-5733618139724595966</id><published>2011-07-14T23:18:00.000-04:00</published><updated>2011-07-14T23:18:16.059-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='structural biology'/><title type='text'>GpU dinucleotide platform, the smallest unit with key RNA structural features</title><content type='html'>Compared to DNA, RNA has three salient structural features: it contains ribose sugar, uracil, and is normally single-standed. The O2'(G)...O2P(U) H-bond stabilized &lt;a href="http://xiang-jun.blogspot.com/2010/04/whats-special-about-gpu-dinucleotide.html"&gt;GpU dinucleotide platform&lt;/a&gt; may turn out to be the smallest unit with all those RNA hallmarks (see Figure below).&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/-ufXG8FOlpdk/Th-qUP2FTrI/AAAAAAAAAGw/-EvyEfl1ryQ/s1600/lux_GU.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://3.bp.blogspot.com/-ufXG8FOlpdk/Th-qUP2FTrI/AAAAAAAAAGw/-EvyEfl1ryQ/s320/lux_GU.png" width="230" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;Firstly, it must have the guanosine ribose to form the O2'(G)...O2P(U) H-bond.&lt;br /&gt;&lt;br /&gt;Secondly, the methyl group in position 5 of thymine would cause steric clash with&amp;nbsp;guanosine, thus disrupting the N2(G)...O4(U) base-base H-bond to form the GpU dinucleotide platform.&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/-b7TLTsoVLsQ/Th-sxEud5WI/AAAAAAAAAG4/EtFEFG1EeBs/s1600/lux_GT.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://3.bp.blogspot.com/-b7TLTsoVLsQ/Th-sxEud5WI/AAAAAAAAAG4/EtFEFG1EeBs/s200/lux_GT.png" width="230" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;Thirdly, a dinucleotide, by definition, is single-standed. The two H-bonds, plus the covalent linkage, makes the GpU platform extremely rigid (see&amp;nbsp;&lt;a href="http://nar.oxfordjournals.org/content/38/14/4868.full"&gt;Figure 1 of our 2010 NAR paper&lt;/a&gt;).&lt;br /&gt;&lt;br /&gt;Moreover, the GpU platform is directional: swapping the two bases while &lt;i&gt;keeping the sugar-phosphate backbone fixed&lt;/i&gt; does not allow for a base-base H-bond, thus no UpG dinucleotide platform.&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/-_MXLoZuUqOg/Th-wfWxTbGI/AAAAAAAAAG8/3Hco3C1FdQA/s1600/lux_UG.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://1.bp.blogspot.com/-_MXLoZuUqOg/Th-wfWxTbGI/AAAAAAAAAG8/3Hco3C1FdQA/s320/lux_UG.png" width="230" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-5733618139724595966?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/5733618139724595966/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/07/gpu-dinucleotide-platform-smallest-unit.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5733618139724595966'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5733618139724595966'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/07/gpu-dinucleotide-platform-smallest-unit.html' title='GpU dinucleotide platform, the smallest unit with key RNA structural features'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/-ufXG8FOlpdk/Th-qUP2FTrI/AAAAAAAAAGw/-EvyEfl1ryQ/s72-c/lux_GU.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-9043189057635833215</id><published>2011-06-26T00:06:00.000-04:00</published><updated>2011-06-26T00:06:51.611-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><category scheme='http://www.blogger.com/atom/ns#' term='structural biology'/><title type='text'>PDB v3.3 and partial atom occupancy</title><content type='html'>From the PDB mailing list, I know of the recent announcement "&lt;a href="http://www.wwpdb.org/news/news_2011.html#17-June-2011"&gt;PDB Archive Version 4.0 to be Released July 13, 2011&lt;/a&gt;". This "ambitious" review of the PDB archive has resulted in a new set of corrected files in ten categories, including biological assemblies, residual B factors, peptide inhibitors/antibiotics, polymers containing nonstandard polymer linkages, and partial occupancy etc. "These data reflect the wwPDB's continuing commitment to providing accurate and detailed data to users worldwide."&lt;br /&gt;&lt;br /&gt;I am interested in changes in the PDB format, and read the PDF document "&lt;a href="http://www.wwpdb.org/documentation/2011remediation_overview-061711.pdf"&gt;Description of Changes and Corrections for PDB July 2011 Remediation Release&lt;/a&gt;":&lt;br /&gt;&lt;blockquote&gt;PDB format files updated in this remediation release comply with PDB Format Version 3.30. PDBx and PDBML data files comply with the PDB Exchange Dictionary v.4.0, and PDBML XML Schema V4.0, respectively.&lt;/blockquote&gt;Specifically, I checked carefully the section on "Partial occupancy", which is quoted in full below:&lt;br /&gt;&lt;blockquote style="background-color: #eeeeee; border: 10px; margin: 0.5em;"&gt;&lt;u&gt;Problem&lt;/u&gt;&lt;br /&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: red;"&gt;In the 2009 remediation, occupancies were corrected in 490 X-ray&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: red;"&gt;and neutron entries. A mistake was made in 104 of these entries&lt;/span&gt;:&lt;br /&gt;for atoms with alternate conformer labels and with summed total&lt;br /&gt;occupancy less than 1.0, the occupancies were re-scaled as 1.0/n,&lt;br /&gt;where n is the number of conformers.&lt;br /&gt;&lt;br /&gt;&lt;u&gt;Approach&lt;/u&gt;&lt;br /&gt;&lt;br /&gt;The originally deposited occupancies of the affected atoms were&lt;br /&gt;restored and the remediation was then carried out properly, via: &lt;br /&gt;&lt;ul&gt;&lt;li&gt;Atoms with multiple conformations but identical coordinates  and B-values were merged and their occupancies were summed.&lt;/li&gt;&lt;li&gt;Atoms which now have (total) occupancies &amp;lt;= 1.0 were left  as deposited.&lt;/li&gt;&lt;li&gt;Atoms with (total) occupancies &amp;gt; 1 were rescaled  proportionally to a sum of 1.0&lt;/li&gt;&lt;/ul&gt;&lt;u&gt;Results&lt;/u&gt;&lt;br /&gt;&lt;br /&gt;The occupancies have been corrected in these entries.&lt;/blockquote&gt;&lt;br /&gt;This partial occupancy issue reminds me of an extensive and very informative discussion a few months back in CCP4BB, under the thread "&lt;a href="https://www.jiscmail.ac.uk/cgi-bin/webadmin?A1=ind1103&amp;amp;L=CCP4BB#160"&gt;what to do with disordered side chains&lt;/a&gt;" and its derivatives, about setting "zero" occupancy and/or high B values for PDB&amp;nbsp;ATOM/HETATM records in&amp;nbsp;disordered regions. Over the course of the discussion, &lt;a href="https://www.jiscmail.ac.uk/cgi-bin/webadmin?A2=ind1104&amp;amp;L=CCP4BB&amp;amp;P=R17286"&gt;Frances Bernstein made the following comment&lt;/a&gt;:&lt;br /&gt;&lt;blockquote&gt;I am absolutely positive that there is software that does its voodoo on ATOM/HETATM records and pays absolutely no attention to anything beyond the x, y, z coordinates (i.e. beyond column 54).&lt;/blockquote&gt;3DNA does pay attention beyond column 54 (up to 80, actually) for ATOM/HETATM records, but internally it does not make use of the occupancy info. In future releases of 3DNA, I am planning to take occupancy/B-factor into consideration, probably through configurable parameters.&lt;br /&gt;&lt;br /&gt;Reading through the "&lt;a href="http://www.wwpdb.org/documentation/2011remediation_overview-061711.pdf"&gt;Description of Changes and Corrections for PDB July 2011 Remediation Release&lt;/a&gt;", I noticed cases of re-corrections of previous corrections in wwPDB remediation efforts. A concrete example is about&amp;nbsp;partial occupancy in&amp;nbsp;the 2009 remediation: among the 490 corrected X-ray and neutron entries, a mistake was made in 104 of them. As a side note, there was a post by&amp;nbsp;Morten Kjeldgaard, titled&amp;nbsp;"&lt;a href="https://www.jiscmail.ac.uk/cgi-bin/webadmin?A2=ind1101&amp;amp;L=CCP4BB&amp;amp;P=R117859"&gt;PDB changes data in entries?&lt;/a&gt;" early this year in CCP4BB, where partial occupancy was used as an example.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-9043189057635833215?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/9043189057635833215/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/06/pdb-v33-and-partial-atom-occupancy.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/9043189057635833215'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/9043189057635833215'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/06/pdb-v33-and-partial-atom-occupancy.html' title='PDB v3.3 and partial atom occupancy'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-2163424566079299787</id><published>2011-06-19T22:30:00.002-04:00</published><updated>2011-06-19T23:24:46.895-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>Sugar pucker correlates with phosphorus-related distance</title><content type='html'>The sugar puckers in DNA/RNA structures are predominately in either C3'-endo or C2'-endo (see Figure below), corresponding to the A- or B-form conformation in a DNA duplex.&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://www.chem.ucsb.edu/~jaegergroup/oldpage/images/lg_sugar_pucker.jpg" imageanchor="1" style="margin-left: 1em 0.35em;"&gt;&lt;img border="0" src="http://www.chem.ucsb.edu/~jaegergroup/oldpage/images/lg_sugar_pucker.jpg" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;Recently, I (re-)read a few articles&amp;nbsp;related to the RNA backbone&amp;nbsp;by Jane Richardson &lt;i&gt;et al&lt;/i&gt;., including&lt;br /&gt;&lt;ul&gt;&lt;li&gt;"&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/14612579"&gt;RNA backbone is rotameric&lt;/a&gt;" (&lt;i&gt;PNAS&lt;/i&gt; 2003)&lt;/li&gt;&lt;li&gt;"&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/18192612"&gt;RNA backbone: consensus all-angle conformers and modular string nomenclature&lt;/a&gt;" (&lt;i&gt;RNA&lt;/i&gt; 2008)&lt;/li&gt;&lt;li&gt;"&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/20057044"&gt;MolProbity: all-atom structure validation for macromolecular crystallography&lt;/a&gt;" (&lt;i&gt;Acta Crystallogr D Biol Crystallogr.&lt;/i&gt; 2010)&lt;/li&gt;&lt;li&gt;"&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/20124702"&gt;PHENIX: a comprehensive Python-based system for macromolecular structure solution&lt;/a&gt;" (&lt;i&gt;Acta Crystallogr D Biol Crystallogr.&lt;/i&gt; 2010)&lt;/li&gt;&lt;/ul&gt;I somehow became interested in the correlation between sugar pucker and a simple distance parameter, as reported in these papers:&lt;br /&gt;&lt;blockquote&gt;C3'-endo and C2'-endo sugar puckers are highly correlated to the perpendicular distance between the C1'–N1/9 glycosidic bond vector and the following phosphate: &amp;gt; 2.9 Å for C3'-endo and &amp;lt; 2.9 Å for C2'-endo. (p.16 of the &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/20057044"&gt;MolProbity&lt;/a&gt; paper)&lt;/blockquote&gt;&lt;br /&gt;Out of curiosity and to get a better understanding of this correlation, I played around with some sample cases both visually in RasMol and numerically. Overall, this is a simple geometric problem, i.e., the shortest distance from a point to a line in 3-dimensional space. Given below is the Octave/Matlab script for calculating the distances for G176 and U176 of PDB entry &lt;a href="http://www.pdb.org/pdb/explore/explore.do?structureId=1JJ2"&gt;1JJ2&lt;/a&gt; (the large ribosomal subunit of&amp;nbsp;&lt;i&gt;Haloarcula marismortui&lt;/i&gt;):&lt;br /&gt;&lt;pre style="background: #eee; margin: 0.5em; font-size: smaller;"&gt;function d = get_p3_nc_dist(P3, C1, N)&lt;br /&gt;    N_C1 = N - C1;                 # vector from N to C1'&lt;br /&gt;    nv_N_C1 = N_C1 / norm(N_C1);   # normalized vector&lt;br /&gt;    C1_P3 = P3 - C1;               # vector from C1 to P3&lt;br /&gt;    proj = dot(C1_P3, nv_N_C1);&lt;br /&gt;    d  = norm(C1_P3 - proj * nv_N_C1);&lt;br /&gt;end&lt;br /&gt;&lt;br /&gt;## G175 (1jj2)&lt;br /&gt;P3 = [70.104 112.366  44.586];&lt;br /&gt;C1 = [73.017 109.666  45.304];&lt;br /&gt;N = [74.445 109.380  45.288];&lt;br /&gt;d1 = get_p3_nc_dist(P3, C1, N)    # &lt;span class="Apple-style-span" style="color: red;"&gt;2.2 Å -- C2'-endo&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;## U176 (1jj2)&lt;br /&gt;P3 = [66.871 116.402  46.804];&lt;br /&gt;C1 = [68.213 112.454  49.279];&lt;br /&gt;N = [69.678 112.480  49.438];&lt;br /&gt;d2 = get_p3_nc_dist(P3, C1, N)    # &lt;span class="Apple-style-span" style="color: red;"&gt;4.6 Å -- C3'-endo&lt;/span&gt;&lt;br /&gt;&lt;/pre&gt;&lt;a href="http://xiang-jun.blogspot.com/2010/04/whats-special-about-gpu-dinucleotide.html"&gt;The GpU used in the above example forms a dinucleotide platform&lt;/a&gt;, where the sugar of G175 adopts a C2'-endo conformation, and that of U176 has C3'-endo. Indeed, the distance for the G175 nucleotide is 2.2 Å, less than 2.9 Å; whilst the value for U176 is 4.6 Å, greater than 2.9 Å.&lt;br /&gt;&lt;br /&gt;It is worth noting the above mentioned articles from Richardson &lt;i&gt;et al&lt;/i&gt;. are focused on RNA backbone, without paying attention to the base (pair) geometry. The Zp parameter, which quantifies the &lt;i&gt;z&lt;/i&gt;-coordinate of the phosphorus atom in the mean reference frame (see "&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/10891271"&gt;A-form conformational motifs in ligand-bound DNA structures&lt;/a&gt;", &lt;i&gt;JMB&lt;/i&gt; 2000), can be easily adapted to the analysis of single stranded RNA structures. For example, the vertical distances of the 3' phosphorus atoms to the G175 and U176 base planes are 1.9 Å and 4.4 Å, respectively. &lt;br /&gt;&lt;br /&gt;Since base planes and the phosphorus atoms are the most accurately located entities in a given nucleic acid structure, the nucleotide-based Zp variant presumably would have some advantage over the distance from phosphorus to the glycosidic bond. Naturally, this Zp parameter will be added in future releases of 3DNA.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-2163424566079299787?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/2163424566079299787/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/06/sugar-pucker-correlates-with-phosphorus.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2163424566079299787'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2163424566079299787'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/06/sugar-pucker-correlates-with-phosphorus.html' title='Sugar pucker correlates with phosphorus-related distance'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-1221229463625068124</id><published>2011-06-11T00:10:00.000-04:00</published><updated>2011-06-11T00:10:36.661-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>Conformation of the sugar ring in nucleic acid structures</title><content type='html'>The conformation of the five-membered sugar ring in DNA/RNA structure can be characterized using the five corresponding endocyclic torsion angles (see Figure below).&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://www.imb-jena.de/Piet/help/img.backbone/backbone-torsion.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="362" src="http://www.imb-jena.de/Piet/help/img.backbone/backbone-torsion.jpg" width="472" /&gt;&lt;/a&gt;&lt;/div&gt;i.e.,&lt;br /&gt;&lt;pre style="background-color: #eeeeee; margin: 10px 2em;"&gt;v0: C4'-O4'-C1'-C2'&lt;br /&gt;v1: O4'-C1'-C2'-C3'&lt;br /&gt;v2: C1'-C2'-C3'-C4'&lt;br /&gt;v3: C2'-C3'-C4'-O4'&lt;br /&gt;v4: C3'-C4'-O4'-C1'&lt;/pre&gt;Due to the ring constraint, the conformation can be characterized approximately by &lt;tt&gt;5 - 3 = 2&lt;/tt&gt; parameters. Using the concept of pseudorotation of the sugar ring, the two parameters are the amplitude (&lt;b&gt;τ&lt;/b&gt;&lt;sub&gt;&lt;b&gt;m&lt;/b&gt;&lt;/sub&gt;) and phase angle (&lt;i&gt;&lt;b&gt;P&lt;/b&gt;&lt;/i&gt;). &lt;br /&gt;&lt;br /&gt;One set of widely used formula to convert the five torsion angles to the pseudorotation parameters is due to Altona &amp;amp; Sundaralingam:&amp;nbsp;"Conformational Analysis of the Sugar Ring in Nucleosides and Nucleotides. A New Description Using the Concept of Pseudorotation" [&lt;i&gt;J. Am. Chem. Soc.&lt;/i&gt;, &lt;u&gt;1972&lt;/u&gt;, &lt;b&gt;94&lt;/b&gt;(23), pp 8205–8212]. As always, the concept is best illustrated with an example. Here I use the sugar ring of G4 (chain A) of the Dickerson-Drew dodecamer (&lt;a href="http://www.pdb.org/pdb/explore/explore.do?structureId=1BNA"&gt;1bna&lt;/a&gt;/&lt;a href="http://ndbserver.rutgers.edu/atlas/xray/structures/B/bdl001/bdl001.html"&gt;bdl001&lt;/a&gt;), with Matlab/Octave code:&lt;br /&gt;&lt;pre style="background-color: #eeeeee; font-size: smaller; margin: 12px;"&gt;# xyz coordinates of the sugar ring: G4 (chain A), 1bna/bdl001&lt;br /&gt;ATOM     63  C4'  DG A   4      21.393  16.960  18.505  1.00 53.00&lt;br /&gt;ATOM     64  O4'  DG A   4      20.353  17.952  18.496  1.00 38.79&lt;br /&gt;ATOM     65  C3'  DG A   4      21.264  16.229  17.176  1.00 56.72&lt;br /&gt;ATOM     67  C2'  DG A   4      20.793  17.368  16.288  1.00 40.81&lt;br /&gt;ATOM     68  C1'  DG A   4      19.716  17.901  17.218  1.00 30.52&lt;br /&gt;&lt;br /&gt;# endocyclic torsion angles:&lt;br /&gt;v0 = -26.7; v1 = 46.3; v2 = -47.1; v3 = 33.4; v4 = -4.4&lt;br /&gt;Pconst = sin(pi/5) + sin(pi/2.5)  &lt;span class="Apple-style-span" style="color: red;"&gt;# 1.5388&lt;/span&gt;&lt;br /&gt;P0 = atan2(v4 + v1 - v3 - v0, 2.0 * v2 * Pconst);  &lt;span class="Apple-style-span" style="color: red;"&gt;# 2.9034&lt;/span&gt;&lt;br /&gt;tm = v2 / cos(P0);  # &lt;span class="Apple-style-span" style="color: red;"&gt;amplitude: 48.469&lt;/span&gt;&lt;br /&gt;P = 180/pi * P0;  # &lt;span class="Apple-style-span" style="color: red;"&gt;phase angle: 166.35&lt;/span&gt; [P + 360 if P0 &amp;lt; 0]&lt;br /&gt;&lt;/pre&gt;The&amp;nbsp;Altona &amp;amp; Sundaralingam (1972)&amp;nbsp;pseudorotation parameters are what have been adopted in 3DNA.&amp;nbsp;The &lt;a href="http://nar.oxfordjournals.org/content/37/17/5917.full"&gt;Curves+ program&lt;/a&gt;, however, uses another set of formula due to Westhof &amp;amp; Sundaralingam: "A Method for the Analysis of Puckering Disorder in Five-Membered Rings: The Relative Mobilities of Furanose and Proline Rings and Their Effects on Polynucleotide and Polypeptide Backbone Flexibility" [&lt;i&gt;J. Am. Chem. Soc.&lt;/i&gt;, &lt;u&gt;1983&lt;/u&gt;, &lt;b&gt;105&lt;/b&gt;(4), pp 970–976]. The two sets of formula by&amp;nbsp;Altona &amp;amp; Sundaralingam (1972) and&amp;nbsp;Westhof &amp;amp; Sundaralingam (1983) give slightly different numerical values for the two&amp;nbsp;pseudorotation parameters &amp;nbsp;(amplitude&amp;nbsp;&lt;b&gt;τ&lt;/b&gt;&lt;sub&gt;&lt;b&gt;m&amp;nbsp;&lt;/b&gt;&lt;/sub&gt;and phase angle&amp;nbsp;&lt;i&gt;&lt;b&gt;P&lt;/b&gt;&lt;/i&gt;).&lt;br /&gt;&lt;br /&gt;Since &lt;a href="http://xiang-jun.blogspot.com/2009/08/curves-vs-3dna.html"&gt;3DNA and Curves+&lt;/a&gt; are currently the most commonly used programs for&amp;nbsp;conformational analysis of&amp;nbsp;nucleic acid structures, the subtle differences in&amp;nbsp;pseudorotation parameters may cause confusions for users who use both programs. With the same G4 (chain A, 1bna) sugar ring, here is the Matlab/Octave script showing how Curve+ calculates the&amp;nbsp;pseudorotation parameters: &lt;br /&gt;&lt;pre style="background-color: #eeeeee; font-size: smaller; margin: 12px;"&gt;# xyz coordinates of sugar ring G4 (chain A, 1bna/bdl001)&lt;br /&gt;&lt;br /&gt;# endocyclic torsion angles, same as above&lt;br /&gt;v0 = -26.7; v1 = 46.3; v2 = -47.1; v3 = 33.4; v4 = -4.4&lt;br /&gt;&lt;br /&gt;v = [v2, v3, v4, v0, v1]; # reorder them into vector v[]&lt;br /&gt;A = 0; B = 0;&lt;br /&gt;for i = 1:5&lt;br /&gt;    t = 0.8 * pi * (i - 1);&lt;br /&gt;    A += v(i) * cos(t);&lt;br /&gt;    B += v(i) * sin(t);&lt;br /&gt;end&lt;br /&gt;A *= 0.4;   # -48.476&lt;br /&gt;B *= -0.4;  # 11.516&lt;br /&gt;&lt;br /&gt;tm = sqrt(A * A + B * B);  # &lt;span class="Apple-style-span" style="color: red;"&gt;49.825&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;c = A/tm; s = B/tm;&lt;br /&gt;P = atan2(s, c) * 180 / pi;  # &lt;span class="Apple-style-span" style="color: red;"&gt;166.64&lt;/span&gt;&lt;br /&gt;&lt;/pre&gt;&lt;br /&gt;For this specific example, i.e., the sugar ring G4 (chain A, 1bna/bdl001), the pseudorotation parameters as calculated by 3DNA following&amp;nbsp;Altona &amp;amp; Sundaralingam (1972)&amp;nbsp;and Curves+ following&amp;nbsp;Westhof &amp;amp; Sundaralingam (1983) are as follows:&lt;br /&gt;&lt;br /&gt;&lt;pre style="background-color: #eeeeee; font-size: smaller; margin: 12px;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;amplitude (&lt;b&gt;τ&lt;/b&gt;&lt;sub&gt;&lt;b&gt;m&lt;/b&gt;&lt;/sub&gt;)     phase angle (&lt;i&gt;&lt;b&gt;P&lt;/b&gt;&lt;/i&gt;)&lt;br /&gt;3DNA        48.469             166.35&lt;br /&gt;Curves+     49.825             166.64&lt;br /&gt;&lt;/pre&gt;Needless to say, the differences are subtle, and few people will notice/bother&amp;nbsp;at all. For those who do care about such little details, however, hopefully this post will help you understand where the differences actually come from.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-1221229463625068124?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/1221229463625068124/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/06/conformation-of-sugar-ring-in-nucleic.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/1221229463625068124'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/1221229463625068124'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/06/conformation-of-sugar-ring-in-nucleic.html' title='Conformation of the sugar ring in nucleic acid structures'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-870562219879303657</id><published>2011-06-05T00:38:00.000-04:00</published><updated>2011-06-05T00:38:26.953-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><category scheme='http://www.blogger.com/atom/ns#' term='structural biology'/><title type='text'>Lower case chain identifiers in PDB format</title><content type='html'>First formulated in early 1970s, the &lt;a href="http://www.wwpdb.org/documentation/format32/sect9.html"&gt;PDB format&lt;/a&gt; is rigid with fixed columns for designated contents in its &lt;a href="http://www.wwpdb.org/documentation/format32/sect9.html"&gt;ATOM/HETATM records&lt;/a&gt;. Specificlly, a single column, &lt;b&gt;#22&lt;/b&gt;, is assigned for the chain identifier (id). Traditionally, the 26 upper case letters of English alphabet (A-Z), space (i.e., '&lt;tt&gt;&amp;nbsp;&lt;/tt&gt;'), and the single digits (0-9) have been used as chain ids. Up until the ribosomal structures came up, I guess, those 26 + 1 + 10 = 37 characters had been sufficient for the chain ids.&lt;br /&gt;&lt;br /&gt;To the best of my knowledge, for&amp;nbsp;a long time, most PDB parsers assume upper case chain ids.&amp;nbsp;Indeed, 3DNA v1.5 automatically converts each ATOM/HETATM records to upper cases. The first time I became aware of lower case chain ids was when I saw a post in the 3DNA forum, titled "&lt;a href="http://3dna.rutgers.edu:8080/forum/viewtopic.php?f=1&amp;amp;t=82"&gt;Small bug in find_pair&lt;/a&gt;", where a user reported the '&lt;b&gt;w&lt;/b&gt;' vs '&lt;b&gt;W&lt;/b&gt;' chain ids in PDB entry &lt;b&gt;1VSP&lt;/b&gt;. Then I refined 3DNA so that the case of chain ids can be preserved, through an &lt;i&gt;undocumented&lt;/i&gt; command line option (as a feature for internal testing purpose).&lt;br /&gt;&lt;br /&gt;My view to make 3DNA chain ids case-sensitive has been reinforced when I read the article "&lt;a href="http://nar.oxfordjournals.org/content/early/2011/05/19/nar.gkr368.full"&gt;Crystal structures of CGG RNA repeats with implications for fragile X-associated tremor ataxia syndrome&lt;/a&gt;" recently published in &lt;i&gt;Nucleic Acids Research&lt;/i&gt;. The &lt;i&gt;asymmetric unit&lt;/i&gt; of the unmodified CGG-repeats-containing duplex (G&lt;u&gt;CGGCGG&lt;/u&gt;C)&lt;sub&gt;2&lt;/sub&gt;, NDB entry &lt;a href="http://ndbserver.rutgers.edu/atlas/xray/structures/N/na1017/NA1017-asym.html"&gt;NA1017&lt;/a&gt; / PDB entry &lt;a href="http://www.rcsb.org/pdb/explore/explore.do?structureId=3R1C"&gt;3R1C&lt;/a&gt;, contains a total of &lt;b&gt;&lt;span class="Apple-style-span" style="color: red;"&gt;36&lt;/span&gt;&lt;/b&gt; chains: designated as A-Z, plus a-j. Without distinguishing cases of the chain ids, the 3DNA output would become quite confusing.&lt;br /&gt;&lt;br /&gt;Thus, in future releases of 3DNA, the default would be switched to preserve the case of chain ids. This chain id 'case' serves as an excellent example that&amp;nbsp;scientific&amp;nbsp;software products, unlike publications per se, are &lt;i&gt;not fixed&lt;/i&gt; but need continuous care and maintenance to meet the challenges of an evolving world.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-870562219879303657?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/870562219879303657/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/06/lower-case-chain-identifiers-in-pdb.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/870562219879303657'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/870562219879303657'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/06/lower-case-chain-identifiers-in-pdb.html' title='Lower case chain identifiers in PDB format'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-2959519377328806070</id><published>2011-05-22T16:01:00.000-04:00</published><updated>2011-05-22T16:01:07.531-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>NAR's top ten articles</title><content type='html'>Recently, I noticed a new feature in the website of&amp;nbsp;&lt;i&gt;Nucleic Acids Research&lt;/i&gt; (NAR), i.e., its selection of top ten articles:&lt;br /&gt;&lt;blockquote&gt;NAR’s Top Ten Articles are updated monthly and show recent articles that have been most often accessed in HTML and PDF formats in the specified month.&lt;/blockquote&gt;In the age of information explosion with flood of scientific journals and articles, it is easy get lost. NAR's pick of &lt;a href="http://www.oxfordjournals.org/our_journals/nar/top_ten_articles.html"&gt;top ten&lt;/a&gt; and &lt;a href="http://www.oxfordjournals.org/our_journals/nar/featured_articles.html"&gt;featured articles&lt;/a&gt;&amp;nbsp;draws my attention to significant work I may otherwise overlook.&lt;br /&gt;&lt;br /&gt;The current top ten articles (March 2011) are all selected from 2009/2010 publications in 'Database', 'Methods online', and 'Survey and Summary'. I am browsing the 2009 article by Thomas LaFramboise, "&lt;a href="http://nar.oxfordjournals.org/content/37/13/4181.full"&gt;Single nucleotide polymorphism arrays: a decade of biological, computational and technological advances&lt;/a&gt;", to get a better understanding of SNPs.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-2959519377328806070?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/2959519377328806070/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/05/nars-top-ten-articles.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2959519377328806070'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2959519377328806070'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/05/nars-top-ten-articles.html' title='NAR&apos;s top ten articles'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-712428787410364740</id><published>2011-05-15T19:18:00.000-04:00</published><updated>2011-05-15T19:18:57.071-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>Posts in the 3DNA forum reach 600</title><content type='html'>As of May 6, the total number of posts in the &lt;a href="http://3dna.rutgers.edu:8080/forum/"&gt;3DNA forum&lt;/a&gt; has reached 600. Created in March 2007, with my debut post titled &lt;a href="http://3dna.rutgers.edu:8080/forum/viewtopic.php?f=3&amp;amp;t=31"&gt;"Welcome message from Xiang-Jun Lu"&lt;/a&gt;, the forum is now over four years old. Overall, the forum has served its purpose pretty well. In answering questions, I've been increasingly referring to the posts in the forum. As a concrete example, see the thread of a recent question "&lt;a href="http://3dna.rutgers.edu:8080/forum/viewtopic.php?f=1&amp;amp;t=208"&gt;Base pair step parameters with a missing base pair&lt;/a&gt;".&lt;br /&gt;&lt;br /&gt;At less than three posts (about one question) per week on average, I've not felt too much stress in &lt;a href="http://xiang-jun.blogspot.com/2009/07/on-maintaining-3dna-forum.html"&gt;supporting the forum&lt;/a&gt; (and maintaining 3DNA) in my spare time. For the most part, I've enjoyed interacting with 3DNA users from everywhere in the world, and with diverse backgrounds. Following the &lt;a href="http://en.wikipedia.org/wiki/Unix_philosophy"&gt;Unix philosophy&lt;/a&gt; ("&lt;i&gt;Write programs that do one thing and do it well. Write programs to work together.&lt;/i&gt;"), 3DNA has proved to be robust and flexible in serving its ever-growing user community. As a matter of fact, few questions I received a couple of years ago were beyond my original consideration of the details while I wrote the code. It is this intimated knowledge of all the underlying algorithms and every bit of their implementations that allows me to answer users' questions quickly and concretely. &lt;br /&gt;&lt;br /&gt;As time passes by, however, it has become evident to me that 3DNA needs to be further refined and extended to meet the ever changing needs of its user community. For example, over the past few months, several questions asked in the 3DNA forum are directly relevant but clearly beyond 3DNA's current capabilities. While I'd be interested in implementing some of the requested functionality that make sense to me, doing so is certainly over my spare time limit. On the other hand, my increased understanding of nucleic acid structures and accumulated software expertise make it simply an issue of time and effort to move 3DNA to the next level, far beyond its current application scope and impact.&lt;br /&gt;&lt;br /&gt;With posts in the 3DNA forum reaching 600, and &lt;a href="http://scholar.google.com/scholar?hl=en&amp;amp;lr=&amp;amp;q=allintitle%3A3dna+author%3A%22XJ+Lu%22&amp;amp;btnG=Search"&gt;citations to 3DNA articles over 600&lt;/a&gt; (Google scholar), I am hopeful something good will happen to the 3DNA project. After all, 6 is a lucky number in traditional Chinese culture.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-712428787410364740?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/712428787410364740/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/05/posts-in-3dna-forum-reach-600.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/712428787410364740'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/712428787410364740'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/05/posts-in-3dna-forum-reach-600.html' title='Posts in the 3DNA forum reach 600'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-6889692214812899537</id><published>2011-05-15T18:38:00.002-04:00</published><updated>2011-05-15T19:17:30.847-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Fifty years of operon</title><content type='html'>In the latest issue of &lt;i&gt;Science&lt;/i&gt;, there is a one-page editorial titled "&lt;a href="http://www.sciencemag.org/content/332/6031/767.summary"&gt;The Birth of the Operon&lt;/a&gt;" by François Jacob, who won the Nobel Prize in Physiology or Medicine in 1965:&lt;br /&gt;&lt;blockquote&gt;What is the operon, whose 50th anniversary is being celebrated this week? The word heralded the discovery of how genes are turned on and off, and it launched the now-immense field of gene regulation. ... &lt;i&gt;&lt;b&gt;we cannot presume to know how new ideas will arise and where scientific research will lead&lt;/b&gt;&lt;/i&gt;.&lt;/blockquote&gt;In the next three paragraphs, Jacob provides an insightful and vivid description of his research related to the discovery of the "operon" – a structural gene-regulatory gene ensemble. In consonant with &lt;i&gt;&lt;b&gt;his comment on scientific discovery&lt;/b&gt;&lt;/i&gt;, he concludes:&lt;br /&gt;&lt;blockquote&gt;Our breakthrough was the result of “night science”: a stumbling, wandering exploration of the natural world that relies on intuition as much as it does on the cold, orderly logic of “day science.” In today’s vastly expanded scientific enterprise, obsessed with impact factors and competition, we will need much more night science to unveil the many mysteries that remain about the workings of organisms.&lt;/blockquote&gt;It is worth noting that the &lt;i&gt;&lt;a href="http://www.sciencedirect.com/science/journal/00222836"&gt;Journal of Molecular Biology&lt;/a&gt;&lt;/i&gt; (JMB) has recently published a special issue [&lt;b&gt;Volume&lt;/b&gt; 409, Issue 1, Pages 1-88 (27 May 2011)], titled "The Operon Model and its Impact on Modern Molecular Biology" with historical accounts and reviews to celebrate operon's 50th anniversary. It is because of this event that motivated me to read the Jacob and Monod 1961 JMB review article "&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/13718526"&gt;Genetic regulatory mechanisms in the synthesis of proteins&lt;/a&gt;" – I have come across this paper so many times before, and should have definitely read it long ago!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-6889692214812899537?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/6889692214812899537/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/05/fifty-years-of-operon.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/6889692214812899537'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/6889692214812899537'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/05/fifty-years-of-operon.html' title='Fifty years of operon'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-1986985487931713514</id><published>2011-05-15T17:34:00.000-04:00</published><updated>2011-05-15T17:34:49.220-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>Curves+ web server</title><content type='html'>Through Google Scholar, I become aware of the article online in &lt;i&gt;Nucleic Acids Research (NAR)&lt;/i&gt;, titled "&lt;a href="http://nar.oxfordjournals.org/content/early/2011/05/10/nar.gkr316.full"&gt;CURVES+ web server for analyzing and visualizing the helical, backbone and groove parameters of nucleic acid structures&lt;/a&gt;" by Richard Lavery's group:&lt;br /&gt;&lt;blockquote&gt;Curves+, a revised version of the Curves software for analyzing the conformation of nucleic acid structures, is now available as a web server. This version, which can be freely accessed at http://gbio-pbil.ibcp.fr/cgi/Curves_plus/, allows the user to upload a nucleic acid structure file, choose the nucleotides to be analyzed and after optionally setting a number of input variables, view the numerical and graphic results online or download files containing a set of helical, backbone and groove parameters that fully describe the structure. PDB format files are also provided for offline visualization of the helical axis and groove geometry.&lt;/blockquote&gt;The website looks quite streamlined, with required input information all in a single page, and the test page also ran smoothly. In less than two years following the publication of &lt;a href="http://nar.oxfordjournals.org/content/37/17/5917.full"&gt;Curves+&lt;/a&gt;, it is nice to see the Curves+ web server version available, making this analysis tool more readily available to the nucleic acids community.&lt;br /&gt;&lt;br /&gt;Nowadays, it seems safe (to the best of my knowledge) to say that only &lt;a href="http://xiang-jun.blogspot.com/2009/08/curves-vs-3dna.html"&gt;3DNA and Curves+&lt;/a&gt; conform to the 1999 &lt;a href="http://www-ibmc.u-strasbg.fr/upr9002/westhof/PDF/2001_WOlson_JMB.pdf"&gt;Tsukuba convention&lt;/a&gt; for the description of nucleic acid base-pair geometry, and each of them provides a web interface: &lt;a href="http://nar.oxfordjournals.org/content/37/suppl_2/W240.short"&gt;web 3DNA&lt;/a&gt; and &lt;a href="http://nar.oxfordjournals.org/content/early/2011/05/10/nar.gkr316.full"&gt;web Curves+&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-1986985487931713514?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/1986985487931713514/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/05/curves-web-server.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/1986985487931713514'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/1986985487931713514'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/05/curves-web-server.html' title='Curves+ web server'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-841280353144528705</id><published>2011-05-01T23:49:00.002-04:00</published><updated>2011-05-18T00:05:41.264-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>Scientific journals on nucleic acids</title><content type='html'>In my knowledge, &lt;i&gt;&lt;a href="http://nar.oxfordjournals.org/"&gt;Nucleic Acids Research&lt;/a&gt;&lt;/i&gt; (NAR) is a highly respected scientific journal with a broad impact in the field of nucleic acids. Over the years, I have been browsing NAR webpage on a regular basis to keep myself up to date to the latest development in this area. It is thus no surprise that the initial &lt;a href="http://nar.oxfordjournals.org/content/31/17/5108.full"&gt;3DNA paper was submitted to and published in NAR in 2003&lt;/a&gt;. Among the &lt;a href="http://xiang-jun.blogspot.com/2011/03/3dna-citations-reach-over-500.html"&gt;500+ citations to that 3DNA paper&lt;/a&gt;, over 1/5 (100+) articles are from NAR itself (as an example, please see my January 22, 2011 blog post titled "&lt;a href="http://xiang-jun.blogspot.com/2011/01/three-structural-biology-papers-latest.html"&gt;Three structural biology papers in the latest issue of NAR cite 3DNA&lt;/a&gt;"). My latest contribution to NAR is &lt;a href="http://xiang-jun.blogspot.com/2010/04/whats-special-about-gpu-dinucleotide.html"&gt;the GpU story&lt;/a&gt;, which was actually selected as a featured article.&lt;br /&gt;&lt;br /&gt;Another related journal I am quite familiar with is &lt;a href="http://rnajournal.cshlp.org/"&gt;&lt;i&gt;RNA&lt;/i&gt;&lt;/a&gt;, a publication of the RNA society. As the &lt;a href="http://rnajournal.cshlp.org/site/misc/about.xhtml"&gt;"About" section&lt;/a&gt; of its webpage succinctly summarizes,&lt;br /&gt;&lt;blockquote&gt;&lt;i&gt;RNA&lt;/i&gt; serves as an international forum for publishing original reports on RNA research in the broadest sense. The journal aims to unify this field by cutting across established disciplinary lines and focusing on "RNA-centered" science.&lt;/blockquote&gt;&lt;i&gt;RNA&lt;/i&gt; currently has an impact factor (IF) of 5.198 (2009), slightly lower than NAR's 7.479. It is, nevertheless, a very decent journal in RNA-related research, and I frequently visit its website. As a side note, &lt;a href="http://xiang-jun.blogspot.com/2010/04/whats-special-about-gpu-dinucleotide.html"&gt;the GpU paper&lt;/a&gt; was initially submitted to &lt;i&gt;RNA&lt;/i&gt; for its RNA-specific content and as a way to diversify my publication spectrum (as mentioned above, 3DNA was initially published in NAR). Unfortunately, the GpU paper was rejected by the &lt;i&gt;RNA&lt;/i&gt; journal after two rounds of review, spanning over 6 months.&lt;br /&gt;&lt;br /&gt;Another journal closely related to &lt;i&gt;RNA&lt;/i&gt; (name wise) is called &lt;i&gt;&lt;a href="http://www.landesbioscience.com/journals/rnabiology/"&gt;RNA Biology&lt;/a&gt;&lt;/i&gt;, which even has a slightly higher IF of 5.56. Admittedly, I was not familiar with this journal at all. Browsing through its website, I am interested in seeing the journal's explicit policy to reconsider papers "rejected by high impact journals [CNS] for reasons of novelty and impact, rather than the importance of the study or the integrity of the data." By enclosing "the reviewers’ and/or editorial comments" from these high impact journals, "it is possible the article might be accepted [by &lt;i&gt;&lt;a href="http://www.landesbioscience.com/journals/rnabiology/"&gt;RNA Biology&lt;/a&gt;&lt;/i&gt;] based on its previous review. This will allow the urgent and competitive research to be published on the day of submission."&lt;br /&gt;&lt;br /&gt;I became aware of the journal &lt;a href="http://www.oxfordjournals.org/our_journals/dnares/about.html"&gt;&lt;i&gt;DNA Research&lt;/i&gt;&lt;/a&gt; quite recently through an email. From its website, "&lt;i&gt;DNA Research&lt;/i&gt; is an internationally peer-reviewed journal which aims at publishing papers of highest quality in broad aspects of DNA and genome-related research." The journal currently has an IF of 4.917. Browsing a couple of its online issues, I sense that the journal is more on genome- than structure-related research.&lt;br /&gt;&lt;br /&gt;While following up 3DNA citations recently, I noticed the paper titled "&lt;a href="http://www.sage-hindawi.com/journals/jna/2010/107289/"&gt;Insights into the Structures of DNA Damaged by Hydroxyl Radical: Crystal Structures of DNA Duplexes Containing 5-Formyluracil&lt;/a&gt;" by Tsunoda and Taknaka. It was published in the &lt;a href="http://www.sage-hindawi.com/journals/jna/"&gt;&lt;i&gt;Journal of Nucleic Acids&lt;/i&gt;&lt;/a&gt;, which I have never (but probably should have) heard of before. From its website, "&lt;i&gt;Journal of Nucleic Acids&lt;/i&gt; is a peer-reviewed, open access journal that publishes original research articles as well as review articles in all areas of nucleic acids." By virtue of this structure paper and its citation to 3DNA, I think the journal is surely of personal interest, and I have added it into my watch-list.&lt;br /&gt;&lt;br /&gt;To sum up, there are currently four scientific journals (I know of) that are devoted to nucleic acids:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;&lt;a href="http://nar.oxfordjournals.org/"&gt;Nucleic Acids Research&lt;/a&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;a href="http://rnajournal.cshlp.org/"&gt;RNA&lt;/a&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;a href="http://www.landesbioscience.com/journals/rnabiology/"&gt;RNA Biology&lt;/a&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.oxfordjournals.org/our_journals/dnares/about.html"&gt;&lt;i&gt;DNA Research&lt;/i&gt;&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.sage-hindawi.com/journals/jna/"&gt;Journal of Nucleic Acids&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;Do I still miss something? Please make your suggestion in the comment area.&lt;br /&gt;&lt;br /&gt;[revised on May 17, 2011 by adding &lt;i&gt;&lt;a href="http://www.landesbioscience.com/journals/rnabiology/"&gt;RNA Biology&lt;/a&gt;&lt;/i&gt;&lt;/li&gt;]&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-841280353144528705?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/841280353144528705/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/05/scientific-journals-on-nucleic-acids.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/841280353144528705'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/841280353144528705'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/05/scientific-journals-on-nucleic-acids.html' title='Scientific journals on nucleic acids'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-2387146852380459313</id><published>2011-04-23T22:10:00.005-04:00</published><updated>2011-04-24T11:39:42.836-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Ebook "Gregory Petsko in Genome Biology: The first 10 years"</title><content type='html'>Over the years, I have read some of Gregory Petsko's monthly columns in &lt;a href="http://genomebiology.com/"&gt;Genome Biology&lt;/a&gt; while browsing the journal online, and I like his sensible and entertaining columns quite a bit. Recently, I became aware of the ebook from BioMed Central, "&lt;a href="http://blogs.openaccesscentral.com/blogs/bmcblog/entry/gregory_petsko_in_genome_biology"&gt;Gregory Petsko in Genome Biology: The first 10 years&lt;/a&gt;":&lt;br /&gt;&lt;blockquote&gt;Structural biologist Gregory Petsko has contributed a thought-provoking and entertaining monthly column to the scientific journal Genome Biology every month since its launch in 2000. To mark the 10th anniversary of Genome Biology this eBook brings together 10 years of Petsko's columns.&lt;/blockquote&gt;I downloaded the &lt;b&gt;epub&lt;/b&gt; version of the book, and googled around, trying to find a corresponding ebook reader for my MacBook Pro (Snow Leopard) – even though I have some ebooks in the generic PDF format, I am not that familiar with &lt;b&gt;epub&lt;/b&gt; or &lt;b&gt;mobi&lt;/b&gt;. I finally settled with &lt;a href="http://www.barnesandnoble.com/u/free-nook-apps/379002321/"&gt;NOOK for Mac from B&amp;amp;N&lt;/a&gt;. It turns out reading ebooks with specifically-desinged apps such as NOOK is quite a different, yet more enjoyable, experience than through a PDF reader.&lt;br /&gt;&lt;br /&gt;Now the ebook has become the top one in casual reading list. I am reading it from the very beginning, one column at a time, to have a &lt;i&gt;historical&lt;/i&gt; perspective. So far I found the columns indeed very "thought-provoking and entertaining".&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-2387146852380459313?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/2387146852380459313/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/04/ebook-of-gregory-petsko-in-genome.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2387146852380459313'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2387146852380459313'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/04/ebook-of-gregory-petsko-in-genome.html' title='Ebook &quot;Gregory Petsko in Genome Biology: The first 10 years&quot;'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-7239922663196298638</id><published>2011-04-08T23:46:00.001-04:00</published><updated>2011-04-08T23:52:08.706-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Tips and tricks from "The Geek Stuff"</title><content type='html'>As a devoted command line user, I am always interested in learning new tricks to make my life more enjoyable. Recently, I came across Ramesh Natarajan’s blog “&lt;a href="http://www.thegeekstuff.com/"&gt;The Geek Stuff&lt;/a&gt;” which is full of “instruction guides, how-to, troubleshooting tips and tricks on Linux, database, hardware, security and web” to help solve practical problems.&lt;br /&gt;&lt;br /&gt;For example, in the section “&lt;a href="http://www.thegeekstuff.com/best-of-the-blog/"&gt;Best of the Blog&lt;/a&gt;”, I recently benefitted quite a bit by reading the following posts:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://www.thegeekstuff.com/2009/06/expertise-in-google-search/"&gt;15 Awesome Google Search Tips and Tricks&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.thegeekstuff.com/2008/10/6-awesome-linux-cd-command-hacks-productivity-tip3-for-geeks/"&gt;6 Awesome Linux cd command Hacks&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.thegeekstuff.com/2009/09/the-ultimate-wget-download-guide-with-15-awesome-examples/"&gt;wget Tutorial: 15 Awesome Examples to Download Files from Internet&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;There are many other helpful tips/tricks as well; since I have bookmarked the site, I will surely come back!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-7239922663196298638?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/7239922663196298638/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/04/tips-and-tricks-from-geek-stuff.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/7239922663196298638'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/7239922663196298638'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/04/tips-and-tricks-from-geek-stuff.html' title='Tips and tricks from &quot;The Geek Stuff&quot;'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-4876804717591273818</id><published>2011-04-03T21:11:00.000-04:00</published><updated>2011-04-03T21:11:23.915-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Scripting in Ruby is fun</title><content type='html'>Over the years, I have played around with various scripting languages, including awk, bash, Perl, Python and Ruby. By far, I have enjoyed Ruby the most; nowadays, I write scripts nearly exclusively in Ruby.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://en.wikipedia.org/wiki/Ruby_(programming_language)"&gt;Created by Yukihiro "Matz" Matsumoto&lt;/a&gt; in Japan during the mid-1990s, Ruby became popular worldwide in mid-2000s, with the Rails web application framework. Indeed, I first dug into Ruby through Rails, and by reading David Black's book "&lt;a href="http://www.manning.com/black/"&gt;Ruby for Rails; &lt;i&gt;Ruby techniques for Rails developers&lt;/i&gt;&lt;/a&gt;". As an exercise, I implemented the current &lt;a href="http://3dna.rutgers.edu/"&gt;3DNA v2.0 website&lt;/a&gt; with Rails v1.x. Then I quickly realized that the rapidly evolving Rails framework was beyond my time and interest to follow. However, I did begin to appreciate Ruby's simplicity, consistency and expressiveness. Over the past few years, I have collected over a dozen Ruby-related (e)books, including "&lt;a href="http://www.amazon.com/Well-Grounded-Rubyist-David-Black/dp/1933988657"&gt;The Well-Grounded Rubyist&lt;/a&gt;" (David Black, covering v1.9), "&lt;a href="http://www.amazon.com/Ruby-Programming-Language-David-Flanagan/dp/0596516177"&gt;The Ruby Programming Language&lt;/a&gt;" (David Flanagan and Yukihiro Matsumoto), and "&lt;a href="http://www.amazon.com/Metaprogramming-Ruby-Program-Like-Pros/dp/1934356476"&gt;Metaprogramming Ruby: &lt;i&gt;Program Like the Ruby Pros&lt;/i&gt;&lt;/a&gt;" (Paolo Perrotta). Just as &lt;a href="http://xiang-jun.blogspot.com/2009/06/ansi-c-program-memory-check-with.html"&gt;my experience with (ANSI) C&lt;/a&gt;, I feel Ruby "wears well as one's experience with it grows" (K&amp;amp;R, in the preface of "The C Programming Language"). The better I know Ruby, the more I enjoy using it.&lt;br /&gt;&lt;br /&gt;I recently wrote &lt;a href="http://xiang-jun.blogspot.com/2011/01/ruby-scripts-for-3dna-analysis-of.html"&gt;two Ruby scripts for the analysis of molecular dynamics (MD) simulation trajectories using 3DNA&lt;/a&gt;. Honestly, I would not have bothered with Perl for the task (otherwise, it would have been done long time ago), given the sideline nature of my support of 3DNA. Yet, writing and refining the Ruby scripts (with help of &lt;a href="http://git-scm.com/"&gt;git&lt;/a&gt; and &lt;a href="http://rake.rubyforge.org/"&gt;rake&lt;/a&gt;) have turned out to be a pleasant experience. Another reason why scripting in Ruby is fun is due to its large, active and friendly user community; there are many user-contributed libraries (gems) that serve well of common programming needs. As an example, in the 3DNA-MD scripts, I took advantage of the elegant &lt;a href="http://trollop.rubyforge.org/"&gt;Trollop commandline option parser&lt;/a&gt; by William Morgan. I picked Trollop among many other choices because it is self-contained in a single file, simple to use, and "gets out of your way".&lt;br /&gt;&lt;br /&gt;In the Ruby community, exciting new developments are happening all the time. Recently, I was drawn to &lt;a href="https://github.com/wycats/thor"&gt;thor&lt;/a&gt;, "a simple and efficient tool for building self-documenting command line utilities". Over the past couple of years, I have browsed &lt;a href="http://www.sinatrarb.com/"&gt;Sinatra&lt;/a&gt; and &lt;a href="http://sequel.rubyforge.org/"&gt;Sequel&lt;/a&gt; – they also look brilliant! Of course, for bioinformatics, there is the &lt;a href="http://bioruby.open-bio.org/"&gt;BioRuby project&lt;/a&gt;. &lt;br /&gt;&lt;br /&gt;Overall, in my experience, scripting in Ruby is fun and exciting. Are you a Rubist yet?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-4876804717591273818?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/4876804717591273818/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/04/scripting-in-ruby-is-fun.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/4876804717591273818'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/4876804717591273818'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/04/scripting-in-ruby-is-fun.html' title='Scripting in Ruby is fun'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-1604220782567654823</id><published>2011-03-26T23:42:00.000-04:00</published><updated>2011-03-26T23:42:08.311-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>DNA fiber models ABC</title><content type='html'>Among the &lt;a href="http://xiang-jun.blogspot.com/2009/10/fiber-models-in-3dna.html"&gt;55 fiber models available in 3DNA&lt;/a&gt;, the A-, B- and C-DNA types are the most generic – they can be built with bases A, C, G and T in any combination (see table below). Moreover, in addition to the well-known Arnott fiber models (#1, #4 and #7, all from calf thymus), there are newer variants from &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/11099379"&gt;van Dam &amp;amp; Levitt (#46 and #47)&lt;/a&gt; and &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/3271412"&gt;Premilat &amp;amp; Albiser (#53 to #55)&lt;/a&gt;.&lt;br /&gt;&lt;pre style="color: #666666; font-size: 10px;"&gt;&amp;nbsp;1   32.7   2.548  A-DNA (calf thymus)&lt;br /&gt; 4   36.0   3.375  B-DNA (calf thymus)&lt;br /&gt; 7   38.6   3.310  C-DNA (calf thymus)&lt;br /&gt;46   36.0   3.38   B-DNA (BI-type nucleotides)&lt;br /&gt;47   40.0   3.32   C-DNA (BII-type nucleotides)&lt;br /&gt;53  -38.7   3.29   C-DNA (depreciated)&lt;br /&gt;54   32.73  2.56   A-DNA [cf. #1]&lt;br /&gt;55   36.0   3.39   B-DNA [cf. #4]&lt;/pre&gt;As shown in Figure 9 of the &lt;a href="http://nar.oxfordjournals.org/content/31/17/5108.full"&gt;3DNA 2003 NAR paper&lt;/a&gt; (linked below), the A-, B- and C-DNA fiber models are all right-handed regular straight helices, yet each has distinguished features.&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://nar.oxfordjournals.org/content/31/17/5108/F9.large.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="331" src="http://nar.oxfordjournals.org/content/31/17/5108/F9.large.jpg" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;While I could easily envisioned &lt;a href="http://xiang-jun.blogspot.com/2009/05/fiber-analyze-and-rebuild-in-3dna.html"&gt;possible applications of the fiber models&lt;/a&gt;, especially &lt;a href="http://xiang-jun.blogspot.com/2010/04/c2b2-annual-retreat-dna-shape-matters.html"&gt;in connection with analysis and rebuilding routines in 3DNA&lt;/a&gt;, it was still a nice surprise to see a recent article by Gossett and Harvey, titled "&lt;a href="http://pubs.acs.org/doi/abstract/10.1021/nl103665z"&gt;Computational Screening and Design of DNA-Linked Molecular Nanowires&lt;/a&gt;" [&lt;i&gt;Nano Lett&lt;/i&gt;., 2011, &lt;b&gt;11&lt;/b&gt; (2), pp 604–608]. The abstract is quoted below:&lt;br /&gt;&lt;blockquote&gt;DNA can be used as a structural component in the process of making conductive polymers called nanowires. Accurate molecular models could lead to a better understanding of how to prepare these types of materials. Here we present a computational tool that allows potential DNA-linked polymer designs to be screened and evaluated. The approach involves an iterative procedure that adjusts the positions of DNA-linked monomers in order to obtain reasonable molecular geometry compatible with normal DNA conformations and with the properties of the polymer being formed. This procedure has been used to evaluate designs already reported experimentally, as well as to suggest a new design based on pyrrylene vinylene (PV) monomers.&lt;/blockquote&gt;In the article, 3DNA (&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/19474339"&gt;the web interface version w3DNA&lt;/a&gt;) was cited as follows:&lt;br /&gt;&lt;blockquote&gt;The selection of DNA structures is important because the DNA remains fixed throughout the procedure. To reduce the risk of an incorrect result, one should choose a subset of DNA structures that are in some sense representative of DNA conformational space. The DNA structures (A-, B-, and C-form DNA) were obtained using the Web 3DNA web server. We used a poly(dG)-poly(dC) sequence with ideal geometry for each DNA structure. A-DNA was constructed with rise = 2.548 Å and twist = 32.7˚ , B-DNA was constructed with rise = 3.375 Å and twist = 36.0 ˚, and C-DNA was constructed with rise = 3.310 Å and twist = 38.6  ˚.&lt;/blockquote&gt;Indeed, this is a novel application of fiber DNA ABC models!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-1604220782567654823?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/1604220782567654823/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/03/dna-fiber-models-abc.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/1604220782567654823'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/1604220782567654823'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/03/dna-fiber-models-abc.html' title='DNA fiber models ABC'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-883987410231995667</id><published>2011-03-20T19:26:00.000-04:00</published><updated>2011-03-20T19:26:45.368-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>3DNA citations reach over 500</title><content type='html'>On Friday, June 5, 2009, I blogged on the topic titled "&lt;a href="http://xiang-jun.blogspot.com/2009/06/3dna-citations-reach-over-300.html"&gt;3DNA citations reach over 300&lt;/a&gt;". At that time, I wrote (towards the end):&lt;br /&gt;&lt;blockquote&gt;I still remember that the number of citations to 3DNA was less than 150 nearly two years ago [~ summer 2007], when I started to wrote the first draft of our 2008 &lt;i&gt;Nature Protocols&lt;/i&gt; paper. Now it is more than doubled! &lt;b&gt;I would blog on this topic again when the number reaches 500&lt;/b&gt;.&lt;/blockquote&gt;When I checked &lt;a href="http://scholar.google.com/scholar?hl=en&amp;amp;lr=&amp;amp;q=allintitle%3A3dna+author%3A%22XJ+Lu%22&amp;amp;btnG=Search"&gt;Google scholar for 3DNA citations&lt;/a&gt; right now, the citation number is already over 500 for the initial 2003 3DNA NAR paper alone. Combined with the two direct follow-ups – the 2008 &lt;i&gt;Nature Protocols&lt;/i&gt; paper and the 2009 NAR web server paper – the three 3DNA publications have been cited a total of 550 times.&lt;br /&gt;&lt;br /&gt;Again, as noted in that blog post,&lt;br /&gt;&lt;blockquote&gt;In my opinion, some of 3DNA features are still (heavily) underused. Now that we have a sizable user community, 3DNA could only become better and would be more widely used. I have every reason to believe that in the not-so-distant-future, the citations to 3DNA would reach over 1000.&lt;/blockquote&gt;A decade after its initial humber release, 3DNA has been successfully applied to many real-world problems. As spare time permits, I have&amp;nbsp;&lt;a href="http://xiang-jun.blogspot.com/2009/07/on-maintaining-3dna-forum.html"&gt;actively maintained and continuously refined 3DNA&lt;/a&gt; based largely on users' feedbacks. Over the time, I also see clearly that 3DNA can be moved to the &lt;i&gt;next level&lt;/i&gt; both in functionality and usability to enjoy an even larger/broader impact.&lt;br /&gt;&lt;br /&gt;Now more than half-way through, it won't be long when citations to 3DNA reach 1000, and then beyond.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-883987410231995667?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/883987410231995667/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/03/3dna-citations-reach-over-500.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/883987410231995667'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/883987410231995667'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/03/3dna-citations-reach-over-500.html' title='3DNA citations reach over 500'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-5820807880845539368</id><published>2011-03-13T23:21:00.001-04:00</published><updated>2011-03-13T23:36:18.386-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><category scheme='http://www.blogger.com/atom/ns#' term='structural biology'/><title type='text'>Review article on NMR analysis of protein–DNA interactions by Milon et al</title><content type='html'>Through Google scholar, I became aware of a recent review article by Milon &lt;i&gt;et al&lt;/i&gt;., titled "&lt;a href="http://rsif.royalsocietypublishing.org/content/early/2011/03/08/rsif.2010.0543.full"&gt;Nuclear magnetic resonance analysis of protein–DNA interactions&lt;/a&gt;" in the journal &lt;i&gt;J. R. Soc. Interface&lt;/i&gt;:&lt;br /&gt;&lt;blockquote&gt;This review focuses on the experimental strategies currently employed to solve structures of protein–DNA complexes and to analyse their dynamics. It highlights how these approaches can help in understanding detailed molecular mechanisms of target recognition.&lt;/blockquote&gt;I browsed through the text to get myself more familiar with NMR the methodology and its applications in protein-DNA recognition. I was surprised that 3DNA was cited in the article, especially with respect to its unique &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;analyze/rebuild&lt;/span&gt; complementarity:&lt;br /&gt;&lt;blockquote&gt;In addition, several software programs have been developed to model DNA bending such as the 3DNA program, &lt;i&gt;which allows analysis of DNA structural parameters and enables it to be rebuilt with customized DNA models&lt;/i&gt; [&lt;a href="http://3dna.rutgers.edu:8080/forum/viewtopic.php?f=3&amp;t=97"&gt;76&lt;/a&gt;]. Several Web servers have been created recently and provide interesting tools to analyse and rebuild DNA models [&lt;a href="http://xiang-jun.blogspot.com/2009/07/two-web-interfaces-to-3dna.html"&gt;77,78&lt;/a&gt;].&lt;/blockquote&gt;I am only wishing that 3DNA's neat features could be more widely recognized; hopefully I'd have the opportunity to further refine 3DNA and move it to the next level.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-5820807880845539368?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/5820807880845539368/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/03/review-article-on-nmr-analysis-of.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5820807880845539368'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5820807880845539368'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/03/review-article-on-nmr-analysis-of.html' title='Review article on NMR analysis of protein–DNA interactions by Milon et al'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-6126983001755041165</id><published>2011-03-05T23:24:00.000-05:00</published><updated>2011-03-05T23:24:47.675-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Retraction of scientific publications</title><content type='html'>Once in a while, I come across retraction notices of scientific publications in leading journals/magazines. Even for cases not directly related to my research areas, I normally browse through them. &lt;br /&gt;&lt;br /&gt;In the March 3, 2011 issue of &lt;i&gt;Nature&lt;/i&gt;, there is a retraction of the Letter "&lt;a href="http://www.nature.com/nature/journal/v434/n7030/full/nature03356.html"&gt;Mediation of pathogen resistance by exudation of antimicrobials from roots&lt;/a&gt;" [&lt;i&gt;Nature&lt;/i&gt; &lt;b&gt;434&lt;/b&gt;, 217–221 (2005)]. I am intrigued by the first sentence of the note:&lt;br /&gt;&lt;blockquote&gt;The authors wish to retract this Letter after a key reference by Walker &lt;i&gt;et al&lt;/i&gt;. (ref. 9 in this Letter) was retracted from the scientific literature.&lt;/blockquote&gt;&lt;br /&gt;It turns out that the 2003 Walter &lt;i&gt;et al&lt;/i&gt;.  &lt;i&gt;J. Agric. Food Chem.&lt;/i&gt; paper (withdrawn in October 2009) and the 2005 &lt;i&gt;Nature&lt;/i&gt; Letter were from the same group. Overall, it took ~6 years each for the two papers to be retracted. As of today, they have been cited 76 and 84 times respectively accordingly to Google scholar.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-6126983001755041165?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/6126983001755041165/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/03/retraction-of-scientific-publications.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/6126983001755041165'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/6126983001755041165'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/03/retraction-of-scientific-publications.html' title='Retraction of scientific publications'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-8570409036661092573</id><published>2011-02-27T18:23:00.000-05:00</published><updated>2011-02-27T18:23:02.166-05:00</updated><title type='text'>Evidences for transient Hoogsteen base pairs in canonical DNA duplex</title><content type='html'>In the February 24, 2011 issue of &lt;i&gt;Nature&lt;/i&gt;, there is an interesting article by Nikolova &lt;i&gt;et al.&lt;/i&gt;, titled "Transient Hoogsteen base pairs in canonical duplex DNA". Its main discovery is succinctly summarized in the abstract:&lt;br /&gt;&lt;blockquote&gt;By using nuclear magnetic resonance relaxation dispersion spectroscopy in concert with steered molecular dynamics simulations, we have observed transient sequence-specific excursions away from Watson–Crick base-pairing at CA and TA steps inside canonical duplex DNA towards &lt;i&gt;low-populated&lt;/i&gt; and &lt;i&gt;short-lived&lt;/i&gt; A•T and G•C Hoogsteen base pairs. The observation of Hoogsteen base pairs in DNA duplexes specifically bound to transcription factors and in damaged DNA sites implies that the DNA double helix intrinsically codes for excited state Hoogsteen base pairs as a means of expanding its structural complexity beyond that which can be achieved based on Watson–Crick base-pairing.&lt;/blockquote&gt;Geometrically, the &lt;a href="http://xiang-jun.blogspot.com/2010/03/hoogsteen-base-pair.html"&gt;Hoogsteen base pair&lt;/a&gt; is related to the Watson-Crick base pair by a 180-degree rotation about the glycosidic bond (N9–C1'). While the A•T Hoogsteen base pair is classic, the similar G•C+ Hoogsteen pair (with protonation of cytosine N3) is equally possible. The A•T and G•C Hoogsteen base pairs have two perfect H-bonds, so they are energetically stable. As for their existence in DNA duplex, the most direct evidence comes from the "trap" experiments (see Fig.3 of the paper). In the &lt;i&gt;News &amp; Views&lt;/i&gt; section, Honig and Rohs &lt;a href="http://www.nature.com/nature/journal/v470/n7335/full/470472a.html"&gt;provide a nice recap of the main point and implications of this work&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;As also observed in &lt;a href="http://xiang-jun.blogspot.com/2011/02/g-t-mismatch-with-perfect-watson-crick.html"&gt;another recent publication&lt;/a&gt;, "&lt;a href="http://www.pnas.org/content/108/5/1862.full"&gt;Replication infidelity via a mismatch with Watson–Crick geometry&lt;/a&gt;", the base sequence has a subtle role in influencing the base-pairing schemes, three-dimensional structures and biological functions of DNA. However, we should not forget that only the Watson-Crick base pairs, and to a less extent, the G-U wobble pair, have the correct symmetry to ensure a "regular" double helical structure.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-8570409036661092573?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/8570409036661092573/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/02/evidences-for-transient-hoogsteen-base.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/8570409036661092573'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/8570409036661092573'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/02/evidences-for-transient-hoogsteen-base.html' title='Evidences for transient Hoogsteen base pairs in canonical DNA duplex'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-4174353288192742265</id><published>2011-02-20T23:35:00.002-05:00</published><updated>2011-02-21T11:06:21.363-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>Canned responses in gmail make it easy to send common messages</title><content type='html'>Through Gary Rosenzweig's &lt;a href="http://macmost.com/gmail-labs.html"&gt;MacMost Now video #509 "Gmail Labs"&lt;/a&gt; (January 28, 2011), I first heard of "Canned Responses" in Gmail Labs:&lt;br /&gt;&lt;blockquote&gt;Email for the truly lazy. Save and then send your common messages using a button next to the compose form.&lt;/blockquote&gt;This is a truly handy feature that I have long been waiting for! Yet even though I am aware of Gmail Labs and enabled quite a few experimental features a while ago, I've not been searching Gmail Labs for new features ever since.&lt;br /&gt;&lt;br /&gt;Over the past few weeks, I have found "Canned Responses" increasingly indispensable in my support of the 3DNA forum (as a sideline project):&lt;br /&gt;&lt;ol&gt;&lt;li&gt;When I &lt;a href="http://xiang-jun.blogspot.com/2009/07/on-maintaining-3dna-forum.html"&gt;activate a new 3DNA forum registration&lt;/a&gt;, I've always included a "standard" message to "make the forum policy upfront and explicit, in order to avoid misunderstandings or surprises." Previously, I had to copy-and-paste, e.g., from a specifically created text file or elsewhere. Surely, this worked, but I had felt intuitively that there must be a better way to get the job done. Well, that's exactly where "Canned Responses" fit in!&lt;/li&gt;&lt;li&gt;Over the past few months, I have been ever more bothered by spam registrations. So as a further filter, I have been sending the following enquiry message to each suspicious registration for activation:&lt;br /&gt;&lt;blockquote&gt;Thanks for your registration at &lt;a href="http://3dna.rutgers.edu:8080/forum"&gt;the 3DNA forum&lt;/a&gt;. Please tell me a little bit about yourself and elaborate on how 3DNA could be useful to your project; we would like to make the forum spam-free.&lt;br /&gt;&lt;br /&gt;See also "&lt;a href="http://3dna.rutgers.edu:8080/forum/viewtopic.php?f=3&amp;amp;t=184"&gt;Further notes on forum registration and posting&lt;/a&gt;" -- you may not need to register.&lt;/blockquote&gt;Here once again, the "Canned Responses" feature makes my life much easier! Moreover, this step turns out to be extremely effective; a large percentage of registrations is filtered out at this final stage.&lt;br /&gt;&lt;/li&gt;&lt;/ol&gt;Are you using gmail? If so, you may also want to give "Canned Responses" a try.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-4174353288192742265?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/4174353288192742265/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/02/canned-responses-in-gmail-make-it-easy.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/4174353288192742265'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/4174353288192742265'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/02/canned-responses-in-gmail-make-it-easy.html' title='Canned responses in gmail make it easy to send common messages'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-8944187954162304442</id><published>2011-02-13T23:00:00.000-05:00</published><updated>2011-02-13T23:00:25.064-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Making data maximally available?</title><content type='html'>In the February 11, 2011 issue of &lt;i&gt;Science&lt;/i&gt; (Vol. 331 no. 6018 p. 649), there is an editorial, titled "&lt;a href="http://www.sciencemag.org/content/331/6018/649.summary"&gt;Making Data Maximally Available&lt;/a&gt;". Indeed, the issue contains a special section on "Dealing with Data".&lt;br /&gt;&lt;blockquote&gt;Science is driven by data. New technologies have vastly increased the ease of data collection and consequently the amount of data collected, while also enabling data to &lt;i&gt;&lt;b&gt;be independently mined and reanalyzed by others&amp;nbsp;&lt;/b&gt;&lt;/i&gt;... It is obvious that making data widely available is an essential element of scientific research.&lt;/blockquote&gt;Especially, I like the following two (proposed) new policies:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;To extended data access requirement "to include computer codes involved in the creation or analysis of data." If properly implemented/enforced, this policy could significantly increase the repeatability and assessment of published results. In my experience, I have observed too many times that secrets are hidden in the seemingly "little" subtle details.&lt;/li&gt;&lt;li&gt;"To produce a single list that combines references from the main paper and the SOM" (supporting online material) to "provide credit and reveal data sources more clearly". Potentially, this will also increase the &lt;a href="http://xiang-jun.blogspot.com/2010/11/under-citation-of-method-papers.html"&gt;citation of method papers&lt;/a&gt;.&lt;/li&gt;&lt;/ol&gt;Hopefully, other journals will follow &lt;i&gt;Science&lt;/i&gt;'s lead to make data maximally available, and to present data more transparently.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-8944187954162304442?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/8944187954162304442/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/02/making-data-maximally-available.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/8944187954162304442'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/8944187954162304442'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/02/making-data-maximally-available.html' title='Making data maximally available?'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-6184317983335942282</id><published>2011-02-06T23:24:00.001-05:00</published><updated>2011-02-06T23:25:59.096-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>A G-T mismatch with perfect Watson-Crick geometry</title><content type='html'>In the February 1, 2011 issue of PNAS [108(5)], there is an interesting article "&lt;a href="http://www.pnas.org/content/108/5/1862.full"&gt;Replication infidelity via a mismatch with Watson–Crick geometry&lt;/a&gt;" by Bebenek &lt;i&gt;et al&lt;/i&gt;. They solved the Pol λ DL ternary complex (PDB id: &lt;a href="http://www.pdb.org/pdb/explore/explore.do?structureId=3PML"&gt;3PML&lt;/a&gt;) which has a G-T nascent mispair&amp;nbsp;in "perfect" Watson-Crick geometry (see&amp;nbsp;their Fig 4, linked below).&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://www.pnas.org/content/108/5/1862/F3.large.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="190" src="http://www.pnas.org/content/108/5/1862/F3.large.jpg" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;From an H-bonding (energitic) point of view, the G-T mispair (with three "H-bonds") can only be possible if G or T is in the rare enol tautomeric state, or is ionized. The pH dependence of single nucleotide disincorporation seems to be consistent with an ionized base pair. Note the G-T mispair is different from a &lt;a href="http://xiang-jun.blogspot.com/2009/11/how-shear-affects-twist-angle-of.html"&gt;Wobble pair in which G and T have a relative sheared motion&lt;/a&gt;&amp;nbsp;(see also Fig. 4C above).&lt;br /&gt;&lt;br /&gt;I am glad to find that 3DNA was used in deriving the parameters. By design, 3DNA should be able to identify such "unusual" mispair as easily as for a normal Watson-Crick pair. As noted in our 2008 3DNA &lt;i&gt;Nature Protocols&lt;/i&gt; paper,&lt;br /&gt;&lt;blockquote&gt;By taking advantage of the standard base reference frame and selected geometric features, the &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;find_pair&lt;/span&gt; program within 3DNA can identify &lt;i&gt;&lt;b&gt;all possible&lt;/b&gt;&lt;/i&gt; nucleic-acid base pairs, whether they are canonical Watson–Crick or noncanonical pairs and are made up of normal or modified bases, in any tautomeric form or protonation state. (p1217)&lt;/blockquote&gt;Moreover, 3DNA does notice and signify (wit a &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;*&lt;/span&gt; instead of the normal &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;-&lt;/span&gt;) the atypical H-bonding feature of the G-T mispair to draw further attentions. &lt;br /&gt;&lt;pre style="font-color: #eee; font-size: 9px;"&gt;5 T-*---g  [3]  O2 - N2  3.06  N3 * N1  2.97  O4 * O6  2.66&lt;br /&gt;&lt;/pre&gt;Hopefully, this example helps illustrate some of 3DNA's unique features that would hopefully be more widely recognized and applied.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-6184317983335942282?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/6184317983335942282/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/02/g-t-mismatch-with-perfect-watson-crick.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/6184317983335942282'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/6184317983335942282'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/02/g-t-mismatch-with-perfect-watson-crick.html' title='A G-T mismatch with perfect Watson-Crick geometry'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-1168768086138637412</id><published>2011-01-30T23:50:00.000-05:00</published><updated>2011-01-30T23:50:44.858-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Trial of scientific papers by blogs and tweets?</title><content type='html'>In the January 20, 2011 issue of &lt;i&gt;Nature&lt;/i&gt;, there is an interesting News Feature article by Apoorva Mandavilli, titled "&lt;a href="http://www.nature.com/news/2011/110119/full/469286a.html"&gt;Peer review: Trial by Twitter&lt;/a&gt;":&lt;br /&gt;&lt;blockquote&gt;Blogs and tweets are ripping papers apart within days of publication, leaving researchers unsure how to react.&lt;/blockquote&gt;Specifically, two widely publicized papers in &lt;i&gt;Science&lt;/i&gt;&amp;nbsp;are singled out: one is about the &lt;a href="http://www.sciencemag.org/content/early/2010/11/10/science.1190532"&gt;longevity genes&lt;/a&gt; identified through genome-wide association study (GWAS), published last July; another is a more recent one on &lt;a href="http://www.sciencemag.org/content/early/2010/12/01/science.1197258"&gt;arsenic bacteria&lt;/a&gt;, published last December. In each case, while "the popular media was trumpeting the finding, other researchers were taking to the web to criticize the paper’s methodology." Yet, the authors failed to hold up their claims in the papers.&lt;br /&gt;&lt;br /&gt;With great interest, I've been following the story on arsenic bacteria. I first noticed this work through &lt;i&gt;Science&lt;/i&gt; Podcast, and found the topic of "arsenic" life intriguing. So for general knowledge, I read carefully the abstract and browses through the text. One week later came the &lt;i&gt;Nature&lt;/i&gt; editorial "&lt;a href="http://www.nature.com/nature/journal/v468/n7326/full/468867a.html"&gt;Response required&lt;/a&gt;", and from which, I followed the link to Rosie Redfield's blog post "&lt;a href="http://rrresearch.blogspot.com/2010/12/arsenic-associated-bacteria-nasas.html"&gt;Arsenic-associated bacteria (NASA's claims)&lt;/a&gt;". I read the post, and many of the comments therein; while I do not understand many of the technical details, I had no difficulty in following her argument. The lead author of the arsenic bacteria paper, Dr. Felisa Wolfe-Simon, did &lt;a href="http://ironlisa.com/gfaj/GFAJquestions_Response_16Dec2010.pdf"&gt;respond to comments&lt;/a&gt; on December 16, 2010. &lt;i&gt;Science&lt;/i&gt; also published an interview with  Wolfe-Simon, titled "&lt;a href="http://www.sciencemag.org/content/330/6012/1734.summary"&gt;Discoverer Asks for Time, Patience Over Arsenic Bacteria Controversy&lt;/a&gt;". On the same day, Redfield was quick to write another blog post "&lt;a href="http://rrresearch.blogspot.com/2010/12/text.html"&gt;Comments on Dr. Wolfe-Simon's Response&lt;/a&gt;", which again has received many comments. So far, the story is still unfolding, and  &lt;i&gt;Science&lt;/i&gt;&amp;nbsp;has promised to publish technical comments and responses in early 2011.&lt;br /&gt;&lt;br /&gt;As a related topic, throughout the &lt;a href="http://www.ccp4.ac.uk/ccp4bb.php"&gt;CCP4bb&lt;/a&gt;, I noticed the letter to editor titled "&lt;a href="http://journals.iucr.org/d/issues/2010/09/00/be5150/index.html"&gt;Is too ‘creative’ language acceptable in crystallography?&lt;/a&gt;" by Alexander Wlodawer &lt;i&gt;et al&lt;/i&gt;. I agree fully with the authors that "While figures of speech are often useful and even educational, flashy titles combined with hyperbolae and imprecise language can mislead or deceive nonspecialist readers and should therefore be avoided."&lt;br /&gt;&lt;br /&gt;In the Internet Age, bloggers and tweeters clearly have an important role to play in the assessment of research findings. In scientific publications, what counts is not how much one claims, but to what extent one can hold up such claims. Solid work holds up over time and the scrutiny of peers.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-1168768086138637412?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/1168768086138637412/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/01/trial-of-scientific-papers-by-blogs-and.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/1168768086138637412'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/1168768086138637412'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/01/trial-of-scientific-papers-by-blogs-and.html' title='Trial of scientific papers by blogs and tweets?'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-5510242685117811355</id><published>2011-01-22T20:14:00.002-05:00</published><updated>2011-01-23T21:14:32.013-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>Three structural biology papers in the latest issue of NAR cite 3DNA</title><content type='html'>While browsing the &lt;a href="http://nar.oxfordjournals.org/content/39/2.toc"&gt;latest 39(2) January 2011 issue&lt;/a&gt; of &lt;i&gt;Nucleic Acids Research&lt;/i&gt; (NAR), I found, to my great surprise, three papers that cite 3DNA. These papers, all under the "structural biology" section, are of interest to me from their titles and abstracts, so I downloaded the PDF versions and read through each of them. &lt;br /&gt;&lt;br /&gt;For this blog post, #100 by incidence, it would be intriguing to look into the context to see how 3DNA is cited.&lt;br /&gt;&lt;br /&gt;&lt;hr /&gt;"&lt;a href="http://nar.oxfordjournals.org/content/39/2/712"&gt;Asymmetric DNA recognition by the OkrAI endonuclease, an isoschizomer of BamHI&lt;/a&gt;" by Vanamee &lt;i&gt;et al.&lt;/i&gt; (Mount Sinai School of Medicine, and New England Biolabs):&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;Analysis of the stereochemical quality of the protein model and assignment of secondary structure were conducted with PROCHECK (13). DNA analysis was performed with 3DNA (14). Solvent-accessible surface areas were calculated in CNS with the algorithm of Lee and Richards employing a 1.4-Å probe(15). Figures were prepared using PyMOL (www.pymol.org). [p713, from bottom left to middle right]&lt;/blockquote&gt;&lt;br /&gt;&lt;hr /&gt;"&lt;a href="http://nar.oxfordjournals.org/content/39/2/744"&gt;DNA intercalation without flipping in the specific ThaI–DNA complex&lt;/a&gt;" by&amp;nbsp;Firczuk &lt;i&gt;et al&lt;/i&gt;. (Poland, Germany and UK):&lt;br /&gt;&lt;blockquote&gt;An oligoduplex with the correct sequence in standard B-DNA geometry was generated with the program 3DNA (44), and manually adjusted to fit the highly distorted DNA in the structure. ... The programs COOT (45), REFMAC (46) and CNS (47) were used for refinement. [p747, top left]&lt;/blockquote&gt;&lt;blockquote&gt;Analysis with the 3DNA software (44) shows that the intercalation increases the rise between base pairs to about 7 Å or approximately twice its usual value (Figure 5B). Phosphorus–phosphorus (Pn–Pn+1) distances in the DNA backbone are only mildly altered (values range from 5.6 to 7.0 Å). Instead, the extra height of the two CG steps comes at the expense of the twist, which is reduced from its usual value of about 36° (360°/10) to between 10 and 15°. A view  toward the major groove shows that the inner base pairs of the recognition sequence are strongly tilted (Figure 5). According to the 3DNA software (44), the first CG step has a negative tilt of about ~12°, which results in the oblique orientation of the following base pairs. The central GC step is characterized by a tilt close to 0°, reflecting the nearly parallel arrangement of the middle bases. Finally, the second CG step has a positive tilt of about 15°&amp;nbsp;which restores the standard orientation of the downstream base pairs. A side view of the DNA indicates a bend at the center of the recognition sequence which is primarily due to the positive ~12°&amp;nbsp;roll of the central GC step into the major groove (Table 1). The 3DNA program also indicates that the propeller twist is positive for the specifically recognized sequence, and (as expected for the standard B-DNA) negative for most of the flanking base pairs. [p749, top right]&lt;/blockquote&gt;&lt;blockquote&gt;Table 1. DNA distortion in complex with ThaI restriction endonuclease: all parameters were calculated with the 3DNA software (44). [p750, middle left]&lt;/blockquote&gt;&lt;br /&gt;&lt;hr /&gt;"&lt;a href="http://nar.oxfordjournals.org/content/39/2/767"&gt;On the molecular basis of uracil recognition in DNA: comparative study of T-A versus U-A structure, dynamics and open base pair kinetics&lt;/a&gt;" by Fadda and Pomès (Ireland and Canada): &lt;br /&gt;&lt;blockquote&gt;MD simulations were run with versions 3.3.3 up to 4.0.4 of the GROMACS software package (47,48).&lt;br /&gt;&lt;br /&gt;Structural parameters were determined with the 3DNA software package (51,52). The pymol (www .pymol.org) software package was used to generate figures. [p769, bottom right]&lt;/blockquote&gt;&lt;br /&gt;Established in 1974 and currently with an impact factor of 7.479, NAR has also been chosen by the Special Libraries Association as &lt;a href="http://www.oxfordjournals.org/our_journals/nar/sla100.html"&gt;one of the top 100 most influential journals in medicine and biology&lt;/a&gt; over the last 100 years. The citations by the three papers in the latest issue of NAR illustrate unambiguously 3DNA's big impact in structural biology.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-5510242685117811355?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/5510242685117811355/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/01/three-structural-biology-papers-latest.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5510242685117811355'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5510242685117811355'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/01/three-structural-biology-papers-latest.html' title='Three structural biology papers in the latest issue of NAR cite 3DNA'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-8855889200204093984</id><published>2011-01-19T23:44:00.001-05:00</published><updated>2011-10-16T23:17:31.791-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>Ruby scripts for 3DNA analysis of molecular dynamics simulation trajectories</title><content type='html'>Over the years, I've been very pleased to see 3DNA's ever-increasing applications for the analysis of molecular dynamics (MD) simulation trajectories of nucleic acid structures. Among its other features, this illustrates that the command-line driven approach of&amp;nbsp;3DNA&amp;nbsp;makes it easily&amp;nbsp;integrable&amp;nbsp;into the MD analysis pipeline (with some scripting, of course).&lt;br /&gt;&lt;br /&gt;However, the lack of direct support of 3DNA to the ever more popular field of MD simulations has caused several obvious problems:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Repeated efforts – virtually every lab or even MD&amp;nbsp;practitioner could come up with an &lt;i&gt;ad hoc&lt;/i&gt;&amp;nbsp;scripting solution.&lt;/li&gt;&lt;li&gt;Hinderance to 3DNA's even wider adoption – new comer to the MD field, or &lt;a href="http://wiki.answers.com/Q/What_is_the_definition_of_bench_science"&gt;bench scientists&lt;/a&gt;&amp;nbsp;interested in dynamics simulations would be scared off.&lt;/li&gt;&lt;li&gt;Known issues with existing approaches – most predominately the unnecessary&amp;nbsp;&lt;i&gt;repetitive&lt;/i&gt; run of &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;find_pair&lt;/span&gt; to deduce base pairing information for each snapshot (model), which not only takes time, but more seriously some pairs could be missing due to melted out or distortion along the&amp;nbsp;trajectory.&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;I've been following 3DNA's citations for years and I am well aware of the above issues: in addition to answering relevant questions in the 3DNA forum, I have blogged specifically on the topic a few times:&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://xiang-jun.blogspot.com/2010/07/3dna-for-analysis-of-molecular-dynamics.html"&gt;3DNA for the analysis of molecular dynamics simulations&lt;/a&gt; [Saturday, July 24, 2010]&lt;/li&gt;&lt;li&gt;&lt;a href="http://xiang-jun.blogspot.com/2009/12/3dna-in-pccp-nucleic-acid-simulations.html"&gt;3DNA in the PCCP nucleic acid simulations themed issue&lt;/a&gt; [Sunday, December 6, 2009]&lt;/li&gt;&lt;li&gt;&lt;a href="http://xiang-jun.blogspot.com/2009/10/3dna-in-molecular-dynamics-simulations.html"&gt;3DNA in molecular dynamics simulations&lt;/a&gt; [Sunday, October 4, 2009]&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;Of course, I am in a unique position to help solve the problem. Indeed, for the past couple of years, I've been thinking of writing scripts to make life easier for MD practitioners who care to use 3DNA. However, due to my lack of experience in MD simulations, constraints of "spare" time (plus laziness), and a want of suitable collaborator, I've never found the incentive to get the job done. &lt;br /&gt;&lt;br /&gt;I finally decided to write some Ruby scripts to streamline the process of using 3DNA in MD simulations, after a recent question from Aneesh on "&lt;a href="http://3dna.rutgers.edu:8080/forum/viewtopic.php?f=1&amp;amp;t=192"&gt;script for extracting data from 3DNA output file&lt;/a&gt;" in the 3DNA forum. After a few exchanges of views with Aneesh, and especially with Alpay's contribution of Python script and sample dataset, I've finished up two standalone yet connected Ruby scripts to analyze MD simulation trajectories with 3DNA and then extract various structural parameters. The details, including source code and test examples, are available in the 3DNA forum under "&lt;a href="http://3dna.rutgers.edu:8080/forum/viewtopic.php?f=11&amp;amp;t=222"&gt;Ruby scripts for the analysis of MD simulation trajectories&lt;/a&gt;", in a newly created section titled "Molecular dynamics simulations". &lt;br /&gt;&lt;br /&gt;The sample file ("sample_md0.pdb") distributed with the current v0.1 of the scripts contains 21 snapshots (models, 0..20), separated by MODEL/ENDMDL pairs. While the sample is based on a trajectory file from AMBER, any MD simulation packages, or NMR ensembles, can be similarly handled as well.&lt;br /&gt;&lt;br /&gt;Now the ball is rolling. As time goes by, and with users' feedback, I will refine and expand the functionality of the scripts as necessary. I am confident to see more applications of 3DNA in the "dynamic" molecular simulation field.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-8855889200204093984?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/8855889200204093984/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/01/ruby-scripts-for-3dna-analysis-of.html#comment-form' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/8855889200204093984'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/8855889200204093984'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/01/ruby-scripts-for-3dna-analysis-of.html' title='Ruby scripts for 3DNA analysis of molecular dynamics simulation trajectories'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-6830086759582590427</id><published>2011-01-09T22:46:00.000-05:00</published><updated>2011-01-09T22:46:26.250-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Open dictionary with command-control-d in Mac OS X</title><content type='html'>In a recent &lt;a href="http://macmost.com/"&gt;MacMost Newsletter&lt;/a&gt;, I came across the following handy trick in Mac OS X: while the cursor is over a word (not necessarily selected), one can press &lt;b&gt;command-control-d&lt;/b&gt; to open dictionary which pops up a little window with the word's definition. This works in &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;Safari&lt;/span&gt; and &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;Mail&lt;/span&gt;, but not in &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;Preview&lt;/span&gt; (unfortunately).&lt;br /&gt;&lt;br /&gt;Previously, when I need to check the definition of a word, I right-click on it and then follow the link "Look Up in Dictionary". This will launch or pop up &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;Dictionary&lt;/span&gt; with detailed information about the word (Dictionary/Thesaurus/wikipedia). &lt;br /&gt;&lt;br /&gt;The right-click method seems to integrate better with other Mac OS X applications. For example, it works with &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;Preview&lt;/span&gt; as well. However, now that I know it, I&amp;nbsp;sometimes&amp;nbsp;prefer the &lt;b&gt;command-control-d&lt;/b&gt; approach better; it is quick and non-obstructive.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-6830086759582590427?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/6830086759582590427/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/01/open-dictionary-with-command-control-d.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/6830086759582590427'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/6830086759582590427'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/01/open-dictionary-with-command-control-d.html' title='Open dictionary with command-control-d in Mac OS X'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-4773056464623231605</id><published>2011-01-09T22:07:00.000-05:00</published><updated>2011-01-09T22:07:08.159-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>IUPAC nucleotide symbols and their complements</title><content type='html'>Recently, I was interested in knowing the complements of all the IUPAC nucleotide symbols. As blogged previously, I am quite familiar with the "&lt;a href="http://xiang-jun.blogspot.com/2009/07/nucleotide-iupac-code.html"&gt;meaning of nucleotide IUPAC codes&lt;/a&gt;" (namely A/C/G/T, and R/Y/N etc). However, when I first check &lt;a href="http://www.geneinfinity.org/sp/sp_nucsymbols.html"&gt;the Gene Infinity website on nucleotide symbols&lt;/a&gt;, it still puzzled me for awhile to figure out the meaning of the DNA alphabet (with complements), as except below:&lt;br /&gt;&lt;pre style="background-color: #cccccc;"&gt;A  C  G  T    M  R  W  S  Y  K    B  D  H  V    N&lt;br /&gt;|  |  |  |    |  |  |  |  |  |    |  |  |  |    |&lt;br /&gt;T  G  C  A    K  Y  W  S  R  M    V  H  D  B    N&lt;br /&gt;&lt;/pre&gt;For example, some degenerated IUPAC symbols are complemented to themselves (e.g., W–W and S–S), while others are seemingly "hard" to apprehend (e.g., B–V and D–H).&lt;br /&gt;&lt;br /&gt;After thinking it for a bit, things begin to become clear. They are based on the complementarity of Watson-Crick base-pairs (A–T and G–C) and the meaning of each degenerated IUPAC nucleotide symbol. For example,&lt;br /&gt;&lt;ul&gt;&lt;li&gt;W represents A/T, meaning weak (with only two hydrogen-bonds). The complements of A/T are T/A respectively, which is W again.&lt;/li&gt;&lt;li&gt;B (not A) represents C/G/T, and their complements are G/C/A respectively, which is V (not U/T).&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;It is easy to verify that all other&amp;nbsp;complementary pairs follow exactly the same basic principle.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-4773056464623231605?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/4773056464623231605/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2011/01/iupac-nucleotide-symbols-and-their.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/4773056464623231605'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/4773056464623231605'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2011/01/iupac-nucleotide-symbols-and-their.html' title='IUPAC nucleotide symbols and their complements'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-543007900739903254</id><published>2010-12-13T19:52:00.000-05:00</published><updated>2010-12-13T19:52:14.973-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Extract images from PDF files using 'pdfimages'</title><content type='html'>Once in a while, I need to extract an image (or a portion thereof) from a PDF file. Usually, I open the PDF file using &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;'preview'&lt;/span&gt; or Adobe &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;'Acrobat Reader'&lt;/span&gt;, and take a screenshot which is then cropped to the desired slice. This manual method "works", albeit a bit tedious.&lt;br /&gt;&lt;br /&gt;Recently, I came across the handy command-line&amp;nbsp;&lt;a href="http://en.wikipedia.org/wiki/Pdfimages"&gt;utility program &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;'pdfimages'&lt;/span&gt;&lt;/a&gt;&amp;nbsp;which allows for automatic extraction of all images from a PDF file. The basic usage is simply:&lt;br /&gt;&lt;pre style="background-color: #eeeeee; margin: 1em;"&gt;pdfimages   yourfile.pdf   prefix&lt;br /&gt;&lt;/pre&gt;This will extract all images&amp;nbsp;contained in&amp;nbsp;&lt;span class="Apple-style-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;yourfile.pdf &lt;/span&gt;&lt;/span&gt;to files &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;prefix-001.ppm&lt;/span&gt;, &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;prefix-002.ppm&lt;/span&gt; etc. &lt;a href="http://manpages.ubuntu.com/manpages/lucid/man1/pdfimages.1.html"&gt;With option "-j", images in DCT format are saved as JPEG files&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-543007900739903254?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/543007900739903254/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/12/extract-images-from-pdf-files-using.html#comment-form' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/543007900739903254'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/543007900739903254'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/12/extract-images-from-pdf-files-using.html' title='Extract images from PDF files using &apos;pdfimages&apos;'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-7461923893055439613</id><published>2010-12-05T23:53:00.000-05:00</published><updated>2010-12-05T23:53:26.958-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Nice blog on English writing by Lynn Gaertner-Johnston</title><content type='html'>A while back, I was somehow curious to figure out the differences between entitled vs. titled. A colleague referred me to &lt;a href="http://www.businesswritingblog.com/business_writing/2007/10/entitled-or-tit.html"&gt;a blog post&lt;/a&gt; on this topic by Lynn Gaertner-Johnston. After reading the post, it became clear that over the years I had made the same mistake numerous times. I then corrected nearly all occurrences of "entitled" to "titled" in my blog and 3DNA forum posts. Ever since, I have been following Lynn Gaertner-Johnston's blog, titled &lt;a href="http://www.businesswritingblog.com/"&gt;"Business Writing"&lt;/a&gt;, on a regular basis, and found it helpful to improve my English writing skills.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-7461923893055439613?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/7461923893055439613/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/12/nice-blog-on-english-writing-by-lynn.html#comment-form' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/7461923893055439613'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/7461923893055439613'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/12/nice-blog-on-english-writing-by-lynn.html' title='Nice blog on English writing by Lynn Gaertner-Johnston'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-2792996256051061957</id><published>2010-11-28T22:23:00.001-05:00</published><updated>2010-11-28T22:26:57.428-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Under-citation of method papers?</title><content type='html'>Recently in the CCP4BB, there is an interesting thread with extensive discussions on "&lt;a href="https://www.jiscmail.ac.uk/cgi-bin/webadmin?A1=ind1011&amp;amp;L=CCP4BB#20"&gt;Citations in supplementary material&lt;/a&gt;". The original poster refers to the &lt;a href="http://journals.iucr.org/d/issues/2010/12/00/issconts.html"&gt;recent &lt;i&gt;Acta Cryst D&lt;/i&gt; editorial with the same title&lt;/a&gt; in which the authors&amp;nbsp;highlights the issue of under-citation to papers published in the International Union of Crystallography (IUCr) journals.&lt;br /&gt;&lt;br /&gt;The main point is that method papers are more likely to be cited in the supplementary materials only, which are not indexed by PubMed, Scopus, Web of Science or Google Scholar etc. As a result, they are statistically undercounted, and  "Journals and scientists that focus on publishing methodologically oriented papers are particularly affected." Specifically, through a survey of articles on protein or nucleic acid structure determination published in &lt;i&gt;Cell&lt;/i&gt;, &lt;i&gt;Nature&lt;/i&gt;, &lt;i&gt;Science&lt;/i&gt;, and &lt;i&gt;PNAS&lt;/i&gt; in 2009, the authors found that "almost half of all references to publications in IUCr journals end up&amp;nbsp;being published in the supplementary material only."&lt;br /&gt;&lt;br /&gt;The findings of the editorial resonate with my observations, and I cannot agree more with the authors that "in the end, methods need to be continuously developed and refined in order to ensure progress."&lt;br /&gt;&lt;br /&gt;On the other end of the spectrum, some highly influential method papers are heavily cited. As an extreme case, the large number of citations to the 2008 paper "&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/18156677"&gt;A short history of SHELX&lt;/a&gt;" by George Sheldrick helps &lt;a href="https://www.jiscmail.ac.uk/cgi-bin/webadmin?A2=CCP4BB;a9fc9ae3.1006"&gt;rocket up the impact factor of &lt;i&gt;Acta Crystallographica A&lt;/i&gt; by 20-fold to 49.9&lt;/a&gt; this year!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-2792996256051061957?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/2792996256051061957/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/11/under-citation-of-method-papers.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2792996256051061957'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2792996256051061957'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/11/under-citation-of-method-papers.html' title='Under-citation of method papers?'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-1900439347051405868</id><published>2010-11-21T22:41:00.000-05:00</published><updated>2010-11-21T22:41:28.841-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>Belorussian translation of 3DNA webpages</title><content type='html'>Recently, I communicated with Paul Bukhovko on the translation of 3DNA webpages &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;http://rutchem.rutgers.edu/~xiangjun/3DNA/&lt;/span&gt; into &lt;a href="http://en.wikipedia.org/wiki/Belarusian_language"&gt;Belorussian&lt;/a&gt;.&amp;nbsp;As the author of the original website, referred to as &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;http://rutchem.rutgers.edu/~olson/3DNA/&lt;/span&gt; (which is simply a soft link to the above URL) in &lt;a href="http://nar.oxfordjournals.org/content/31/17/5108"&gt;the 2003 3DNA NAR paper&lt;/a&gt;, I was very surprisingly pleased when Paul asked for permission to perform the translation, which I gladly granted.&lt;br /&gt;&lt;br /&gt;Regarding the process, Paul commented:&lt;br /&gt;&lt;blockquote&gt;Was a pleasure to translate this page! It's kinda fresh and related to my professional interests, so I thought - why not, if the author allows to do so.&lt;/blockquote&gt;&lt;br /&gt;The translated page is at URL:&amp;nbsp;&lt;a href="http://www.movavi.com/opensource/3DNA-be"&gt;http://www.movavi.com/opensource/3DNA-be&lt;/a&gt;. Interestingly, when I used Google Translate to &lt;a href="http://translate.google.com/translate?hl=en&amp;amp;sl=be&amp;amp;tl=en&amp;amp;u=http%3A%2F%2Fwww.movavi.com%2Fopensource%2F3DNA-be"&gt;convert the Belorussian version back to English&lt;/a&gt;, the outcome is pretty readable. In contrast, when&amp;nbsp;the original English version is&amp;nbsp;directly &lt;a href="http://translate.google.com/translate?hl=en&amp;amp;sl=en&amp;amp;tl=zh-CN&amp;amp;u=http%3A%2F%2Frutchem.rutgers.edu%2F~xiangjun%2F3DNA%2F"&gt;translated to Chinese&lt;/a&gt;, the result is beyond recognition!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-1900439347051405868?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/1900439347051405868/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/11/belorussian-translation-of-3dna.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/1900439347051405868'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/1900439347051405868'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/11/belorussian-translation-of-3dna.html' title='Belorussian translation of 3DNA webpages'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-2802811622714256075</id><published>2010-11-14T23:26:00.003-05:00</published><updated>2011-02-21T12:07:07.578-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>Proper labeling of O1P and O2P atoms in a phosphate group</title><content type='html'>Recently, a &lt;a href="http://3dna.rutgers.edu:8080/forum/viewtopic.php?f=1&amp;amp;t=186"&gt;question on the 3DNA&amp;nbsp;&lt;span style="font-family: 'Courier New', Courier, monospace;"&gt;o1p_o2p&lt;/span&gt; utility program&lt;/a&gt; in the forum led me to reflect on the proper labeling of O1P/O2P atoms in a phosphate group. As is well-known, in DNA/RNA structures, the phosphate group (see figure below left) connected two neighboring nucleosides.&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://biochem.co/wp-content/uploads/2008/07/sugarphosphatebackbone.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="200" src="http://biochem.co/wp-content/uploads/2008/07/sugarphosphatebackbone.png" width="178" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;a href="http://2.bp.blogspot.com/_CEuD25cqpCE/TOCjG64eB-I/AAAAAAAAAFU/VlBniJT-0HE/s1600/po4.png" imageanchor="1" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"&gt;&lt;img border="0" height="191" src="http://2.bp.blogspot.com/_CEuD25cqpCE/TOCjG64eB-I/AAAAAAAAAFU/VlBniJT-0HE/s200/po4.png" width="200" /&gt;&lt;/a&gt;The two nonbridging oxygen&amp;nbsp;atoms of the phosphate group (the&amp;nbsp;horizontal&amp;nbsp;&lt;b&gt;O&lt;/b&gt;s in the O-P=O line, left) are named O1P and O2P in PDB files (see also figure to the right). Stereochemically, O1P and O2P are also designated as pro-R and pro-S oxygens, respectively.&lt;br /&gt;&lt;br /&gt;Presumably, the structural files in the PDB and NDB databases should be consistent and follow the standard nomenclature.&amp;nbsp;In practice, however, some entries in the NDB had mislabeled O1P/O2P atoms (e.g.,&amp;nbsp;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;adh026&lt;/span&gt;).&amp;nbsp;I first noticed this issue when I superposed the A-DNA&amp;nbsp;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;adh026&lt;/span&gt;&amp;nbsp;to its 3DNA rebuilt version with the sugar-phosphate backbone. I observed&amp;nbsp;an unreasonably large RMSD &lt;i&gt;only&lt;/i&gt; for the&amp;nbsp;octamer&amp;nbsp;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;adh026&lt;/span&gt;, while&amp;nbsp;the RMSDs were much smaller (as expected) for the B-DNA dodecamer&amp;nbsp;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;bdl084&lt;/span&gt;&amp;nbsp;and the 146-bp nucleosomal DNA in&amp;nbsp;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;pd0001&amp;nbsp;&lt;/span&gt;(see &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;$X3DNA/examples/analyze_rebuild&lt;/span&gt; distributed with 3DNA v2.0). Since the standard building blocks in 3DNA were applied consistently, I traced the cause of the large RMSD problem to the PDB file of&amp;nbsp;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;adh026&amp;nbsp;&lt;/span&gt;itself, and finally identified it was actually due to the mislabeling of the O1P/O2P atoms.&lt;br /&gt;&lt;br /&gt;The utility program &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;o1p_o2p&lt;/span&gt; was written specifically for the purpose of checking if the O1P/O2P atoms are properly labeled in a PDB file. In a&amp;nbsp;phosphate group, if O1P/O2P are correctly labeled, then following &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;O1P--&amp;gt;O2P--&amp;gt;O5'&lt;/span&gt; in a right-handed sense would point the thumb in the direction of O3' (see the figure up right). As always, how it actually works is best illustrated with an example. Shown below, the second phosphate in &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;adh026&lt;/span&gt; is used (as distributed with 3DNA), with GNU &lt;span style="font-family: 'Courier New', Courier, monospace;"&gt;octave&lt;/span&gt; script. Here the O1P/O2P atoms are mislabeled since &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;direction&lt;/span&gt; has a negative value. In contrast, for a properly labeled phosphate group, &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;direction&lt;/span&gt; should be positive.&lt;br /&gt;&lt;br /&gt;&lt;pre style="background-color: #eeeeee; font-size: 66%;"&gt;#ATOM      6  O3*   G A   1       8.396  -3.995  -1.948  1.00 30.86           O  &lt;br /&gt;#ATOM     20  P     G A   2       8.163  -3.069  -0.619  1.00 32.38           P  &lt;br /&gt;#ATOM     21  O1P   G A   2       7.401  -1.917  -1.218  1.00 32.09           O  &lt;br /&gt;#ATOM     22  O2P   G A   2       7.280  -3.934   0.195  1.00 34.05           O  &lt;br /&gt;#ATOM     23  O5*   G A   2       9.600  -2.800  -0.121  1.00 29.41           O  &lt;br /&gt;&lt;br /&gt;P   = [8.163  -3.069  -0.619]&lt;br /&gt;O1P = [7.401  -1.917  -1.218]&lt;br /&gt;O2P = [7.280  -3.934   0.195]&lt;br /&gt;O3  = [8.396  -3.995  -1.948]&lt;br /&gt;O5  = [9.600  -2.800  -0.121]&lt;br /&gt;&lt;br /&gt;O1P_to_O2P = O2P - O1P                    # -0.12100  -2.01700   1.41300&lt;br /&gt;O2P_to_O5 = O5 - O2P                      #  2.32000   1.13400  -0.31600&lt;br /&gt;O1P_O2P_O5 = cross(O1P_to_O2P, O2P_to_O5) # -0.96497   3.23992   4.54223&lt;br /&gt;P_to_O3 = O3 - P                          #  0.23300  -0.92600  -1.32900&lt;br /&gt;&lt;br /&gt;direction = dot(O1P_O2P_O5, P_to_O3)    # -9.2616 &amp;lt; 0: O1P/O2P mislabeld&lt;br /&gt;&lt;/pre&gt;&lt;br /&gt;The O1P/O2P labeling issue is just a little detail I came cross while developing 3DNA. Nevertheless, it serves as an excellent example of the subtleties &lt;strike&gt;&lt;span class="Apple-style-span" style="background-color: yellow;"&gt;subtitles&lt;/span&gt;&lt;/strike&gt; that should be taken care of in scientific programming.&lt;br /&gt;&lt;br /&gt;Please note that&amp;nbsp;as of 2008,&amp;nbsp;in the remediated PDB/NDB entry &lt;span style="font-family: 'Courier New', Courier, monospace;"&gt;adh026&lt;/span&gt;, the mislabeled O1P/O2P pair has been&amp;nbsp;correct. More generally, O1P/O2P atoms have now been renamed as OP1/OP2, respectively. 3DNA v2.0 takes care of such naming changes internally; for generated PDB files, however, 3DNA still adopts the conventional O1P/O2P labeling.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-2802811622714256075?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/2802811622714256075/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/11/proper-labeling-of-o1p-and-o2p-atoms-in.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2802811622714256075'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2802811622714256075'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/11/proper-labeling-of-o1p-and-o2p-atoms-in.html' title='Proper labeling of O1P and O2P atoms in a phosphate group'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_CEuD25cqpCE/TOCjG64eB-I/AAAAAAAAAFU/VlBniJT-0HE/s72-c/po4.png' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-7747360911761310684</id><published>2010-11-06T23:44:00.000-04:00</published><updated>2010-11-06T23:44:51.655-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Transparency in the peer-review process of scientific papers</title><content type='html'>In the Nov. 4, 2010 issue of &lt;i&gt;Nature&lt;/i&gt;, there is an interesting Comment, titled "&lt;a href="http://www.nature.com/nature/journal/v468/n7320/full/468029a.html"&gt;Transparency showcases strength of peer review&lt;/a&gt;", by Bernd Pulverer, head of scientific publications at the European Molecular Biology Organization and chief editor of &lt;i&gt;The EMBO Journal&lt;/i&gt;. In this article, Pulverer "reflects on his experience at &lt;i&gt;The EMBO Journal&lt;/i&gt; of publishing referees’ reports, authors’ responses and editors’ comments alongside papers."&lt;br /&gt;&lt;br /&gt;The peer-review process of scientific articles has traditionally been a "black box": (anonymous) reviewers' reports, editors' comments, and authors' responses – extremely valuable information in shaping the final form of published papers – are all hidden from public view. In the Internet era, technology (e.g., online space) is no longer an issue. Now &lt;i&gt;The EMBO Journal&lt;/i&gt; has led the way, and "the experience has been overwhelmingly positive." Hopefully, other leading journals (e.g., &lt;i&gt;Nature&lt;/i&gt; and &lt;i&gt;Science&lt;/i&gt;) would follow the example. Afterall, making the peer-review process transparent is an excellent mean to increase the accountability of science and scientific publications.&lt;br /&gt;&lt;br /&gt;Overall, this article is well-written, succinct and logical, and it touches an important topic in scientific publication. Over the past few days, I have read several "Review Process Files" accompanying papers I am interested in, e.g., "&lt;a href="http://www.nature.com/emboj/journal/v29/n15/extref/emboj2010139s2.pdf"&gt;Recognition of the amber UAG stop codon by release factor RF1&lt;/a&gt;", and found them highly revealing.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-7747360911761310684?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/7747360911761310684/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/11/transparency-in-peer-review-process-of.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/7747360911761310684'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/7747360911761310684'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/11/transparency-in-peer-review-process-of.html' title='Transparency in the peer-review process of scientific papers'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-2702973143885470411</id><published>2010-10-30T22:42:00.001-04:00</published><updated>2010-10-30T22:43:20.738-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Publication of scientific programming code</title><content type='html'>Recently on the &lt;i&gt;Nature&lt;/i&gt; website, I read with great interest a news article, titled "Publish your computer code: it is good enough", by Nick Barnes, a professional software engineer:&lt;br /&gt;&lt;blockquote&gt;Freely provided working code — whatever its quality — improves programming and enables others to engage with your research&lt;/blockquote&gt;Clearly the author knows the "trade secret" in scientific programming. He lists several common reasons why scientists are reluctant to share their source code, and then provides his responses: &lt;br /&gt;&lt;ol&gt;&lt;li&gt;The code is low quality — "software in all trades is written to be good enough for the job intended". All software has bugs. Sharing code would help improve the code itself and advance the research field.&lt;/li&gt;&lt;li&gt;Not a common practice — this is going to change or is already changing.&lt;/li&gt;&lt;li&gt;Demand for support — "Nobody is entitled to demand technical support for freely provided code."&lt;/li&gt;&lt;li&gt;Intellectual property issue — The most value part "lies in your expertise", code not backed by skilled experts is called &lt;i&gt;&lt;b&gt;abandonware&lt;/b&gt;&lt;/i&gt;. (I cannot agree more with this point.)&lt;/li&gt;&lt;li&gt;Polishing code takes time/effort — not need to, just supply, as supplementary materials in a website, the &lt;i&gt;original&lt;/i&gt; code used in your publication.&lt;/li&gt;&lt;/ol&gt;As is evident from the many comments, this assay is well echoed by the community. As an active computational scientist for over a decade, I share mostly the same opinions. Essentially, the transparency of source code is to ensure &lt;i&gt;repeatability&lt;/i&gt; of scientific publications. In the field of computational biology (bioinformatics), it is &lt;a href="http://xiang-jun.blogspot.com/2009/09/on-reproducibility-of-scientific.html"&gt;virtually impossible to reproduce exactly a published figure/table without direct access to details&lt;/a&gt;, including the source code.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-2702973143885470411?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/2702973143885470411/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/10/publication-of-scientific-programming.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2702973143885470411'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2702973143885470411'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/10/publication-of-scientific-programming.html' title='Publication of scientific programming code'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-8204125533152962257</id><published>2010-10-23T00:04:00.004-04:00</published><updated>2011-03-26T23:50:39.709-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><category scheme='http://www.blogger.com/atom/ns#' term='structural biology'/><title type='text'>Chi (χ) torsion angle characterizes base/sugar relative orientation</title><content type='html'>&lt;a href="http://xiang-jun.blogspot.com/2010/08/calculation-of-chi-torsion-angle-of.html"&gt;Except for pseudouridine&lt;/a&gt;, a nucleoside in DNA/RNA contains an N-glycosidic bond that connects the base to the sugar. The chi (χ) torsion angle, which characterizes the relative base/sugar orientation, is defined by O4'-C1'-N1-C2 for pyrimidines (C, T and U), and O4'-C1'-N9-C4 for purines (A and G).&lt;br /&gt;&lt;br /&gt;Normally (as in A- and B-form DNA/RNA duplex), χ falls into the ranges of +90° to +180°;&amp;nbsp; –90° to&amp;nbsp;–180° (or 90° to 270°), corresponding to the &lt;i&gt;anti&lt;/i&gt; conformation (Figure below, top). Occasionally, χ has values in the range of&amp;nbsp;–90° to +90°, referring to the &lt;i&gt;syn&lt;/i&gt; conformation (Figure below, bottom). Note that in left-handed Z-DNA with CG repeating sequence, the purine G is in&amp;nbsp;&lt;i&gt;syn&lt;/i&gt;&amp;nbsp;conformation whilst the pyrimidine C is &lt;i&gt;anti&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_CEuD25cqpCE/TMLXKha_XKI/AAAAAAAAAFM/AyNavjlmSMY/s1600/chi_anti_syn.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="320" src="http://3.bp.blogspot.com/_CEuD25cqpCE/TMLXKha_XKI/AAAAAAAAAFM/AyNavjlmSMY/s320/chi_anti_syn.png" width="248" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;Presumably, the χ-related &lt;i&gt;anti/&lt;/i&gt;&lt;i&gt;syn&lt;/i&gt; conformation is a very basic and simple concept. In essence, though, the N-glycosidic bond and the corresponding χ torsion angle illustrate that the base and sugar are two &lt;i&gt;separate&lt;/i&gt; entities, i.e. there is an internal degree of freedom between them. In this respect, it is worth noting that the Leontis-Westhod sugar edge for base-pair classification corresponds to the &lt;i&gt;anti&lt;/i&gt; form only. When a base is flipped over into the &lt;i&gt;syn&lt;/i&gt; conformation, &amp;nbsp;the "sugar edge", defined in connection with the minor (shallow) groove side of a nitrogenous bases, simply does not exist.&lt;br /&gt;&lt;br /&gt;Base-flipping (&lt;i&gt;anti/&lt;/i&gt;&lt;i&gt;syn&lt;/i&gt; conformation switch) is one of the factors associated with the two possible relative orientations in a base pair, characterized explicitly in 3DNA as of type M+N or M–N since&amp;nbsp;&lt;a href="http://nar.oxfordjournals.org/content/31/17/5108.full"&gt;the 2003 NAR paper&lt;/a&gt; (Figure 2, linked below). I reemphasized this distinction in our 2010 &lt;a href="http://nar.oxfordjournals.org/content/38/14/4868.full"&gt;GpU dinucleotide platform paper&lt;/a&gt; (in particular, see &lt;a href="http://nar.oxfordjournals.org/content/suppl/2010/03/11/gkq155.DC1/nar-02598-m-2009-File005.pdf"&gt;supplementary Figure S2&lt;/a&gt;). Unfortunately, this subtle (but crucial, in my opinion) point has never been taken seriously (or at all) by the RNA community, even with 3DNA's wide adoption. However, as people know 3DNA deeper/better and take RNA base-pair classification more rigorously, I have no doubt they will begin to appreciate the simplicity of this explicit distinction and the resultant full quantification of &lt;i&gt;each and every possible base pair&lt;/i&gt; using standard geometric parameters.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://nar.oxfordjournals.org/content/31/17/5108/F2.large.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="320" src="http://nar.oxfordjournals.org/content/31/17/5108/F2.large.jpg" width="291" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;On a related issue, current versions of 3DNA (v1.5 and v2.0) output only the χ torsion angle without providing the&amp;nbsp;&lt;i&gt;anti/&lt;/i&gt;&lt;i&gt;syn&lt;/i&gt; classification. This defect, and many others, will &lt;i&gt;hopefully&lt;/i&gt; be rectified in future releases of 3DNA.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-8204125533152962257?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/8204125533152962257/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/10/chi-torsion-angle-characterizes.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/8204125533152962257'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/8204125533152962257'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/10/chi-torsion-angle-characterizes.html' title='Chi (χ) torsion angle characterizes base/sugar relative orientation'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_CEuD25cqpCE/TMLXKha_XKI/AAAAAAAAAFM/AyNavjlmSMY/s72-c/chi_anti_syn.png' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-3374819749768899133</id><published>2010-10-15T23:08:00.000-04:00</published><updated>2010-10-15T23:08:31.985-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Improving the design of existing code by refactoring</title><content type='html'>Another software engineering book I read recently is "&lt;a href="http://www.amazon.com/Refactoring-Improving-Design-Existing-Code/dp/0201485672"&gt;Refactoring: Improving the Design of Existing Code&lt;/a&gt;" by Martin Fowler. According to the author,&lt;br /&gt;&lt;blockquote&gt;Refactoring is the process of changing a software system in such a way that it does not alter the external behavior of the code yet improve its internal structure. It is a disciplined way to clean up code that minimize the chances of introducing bugs. In essence when you refractor you are &lt;i&gt;&lt;b&gt;improving the design of the code after it has been written&lt;/b&gt;&lt;/i&gt;.&lt;/blockquote&gt;&lt;br /&gt;The book is &lt;i&gt;practical&lt;/i&gt; in nature; it not just explains the principles but provides a detailed account of over 70 commonly used refactorings. As vividly explained by the author in the first paragraph of Chapter 1, "Refactoring, a First Example", "it is with examples that I [the author] can see what is going on." This approach fits my style perfectly: to really understand a topic, I always find a worked example far more effective than general principles. While the examples in the book are illustrated in Java, the basic ideas can be applied well to other object-oriented or even procedural languages (such as C).&lt;br /&gt;&lt;br /&gt;Over the years since I left Dr. Olson's laboratory at Rutgers, I have been maintaining and&amp;nbsp;continuously&amp;nbsp;refining 3DNA.&amp;nbsp;I have taken each user's question as an opportunity to fix bugs and improve its design, thus making the code&amp;nbsp;more robust and efficient.&amp;nbsp;The majority of my efforts, as I now realize, is "refactoring" existing 3DNA code to make it easier to maintain and extend. Reading through this book gives me the chance to put my practices into a broader context. I will surely take advantage of some refactoring examples from the book for further refinements of 3DNA.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-3374819749768899133?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/3374819749768899133/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/10/improving-design-of-existing-code-by.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/3374819749768899133'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/3374819749768899133'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/10/improving-design-of-existing-code-by.html' title='Improving the design of existing code by refactoring'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-2363016461018988635</id><published>2010-10-13T22:08:00.004-04:00</published><updated>2010-10-13T23:32:28.128-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>NSMB editorial: "Go figure"</title><content type='html'>In the October 2010 issue of &lt;span style="font-style:italic;"&gt;Nature Structural &amp;amp; Molecular Biology&lt;/span&gt; (NSMB, Vol. 17, No. 10) there is another interesting one-page editorial, titled "&lt;a href="http://www.nature.com/nsmb/journal/v17/n10/full/nsmb1010-1163.html"&gt;Go figure&lt;/a&gt;", which provides tips on how to make a scientific figure that may worth 1000 words:&lt;br /&gt;&lt;blockquote&gt;A picture may be worth a thousand words, but ensuring that those words make sense is important, especially in the context of a scientific figure. Here are some tips for making your figures count.&lt;/blockquote&gt;A recap of the tips is given below; by and large, they all follow conventional wisdom:&lt;div&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-weight:bold;"&gt;&lt;span style="font-style:italic;"&gt;General considerations&lt;/span&gt;&lt;/span&gt;: Each figure should make just one point and be self-explanatory.&lt;/li&gt;&lt;li&gt;&lt;span style="font-weight:bold;"&gt;See guidelines&lt;/span&gt;. "At all stages, the figures should be clear and legible."&lt;/li&gt;&lt;li&gt;&lt;span style="font-weight:bold;"&gt;How many figures?&lt;/span&gt; The figures should complement the Results section, and be included only necessary.&lt;/li&gt;&lt;li&gt;&lt;span style="font-weight:bold;"&gt;How many panels?&lt;/span&gt; Better only one; multiple panels "should be logically connected."&lt;/li&gt;&lt;li&gt;&lt;span style="font-weight:bold;"&gt;What’s in a label? &lt;/span&gt;Keep it succinct, but make the figure self-explanatory.&lt;/li&gt;&lt;li&gt;&lt;span style="font-weight:bold;"&gt;Getting colorful.&lt;/span&gt; Use color wisely and constantly.&lt;/li&gt;&lt;li&gt;&lt;span style="font-weight:bold;"&gt;A legendary figure.&lt;/span&gt; The figure legend should concise and informative.&lt;/li&gt;&lt;li&gt;&lt;span style="font-weight:bold;"&gt;A model paper. &lt;/span&gt;Better have a figure (at the end) of the final model that conveys "the big picture". Honestly, I do not quite get this point.&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;As pointed out by the author, "These are just a few guidelines and suggestions for handling figures." Overall, "simplicity rules in scientific figures, as in life." I guess no one would argue with such general advices. However, it would be even more helpful to illustrate such points with concrete examples (I know that seems to be beyond the scope of a one-page editorial).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-2363016461018988635?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/2363016461018988635/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/10/nsmb-editorial-go-figure.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2363016461018988635'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2363016461018988635'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/10/nsmb-editorial-go-figure.html' title='NSMB editorial: &quot;Go figure&quot;'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-3985360235448733380</id><published>2010-09-30T23:42:00.001-04:00</published><updated>2010-10-07T19:39:36.768-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Further details of the DNA story revealed by Crick's lost correspondence</title><content type='html'>In the September 30 issue of &lt;i&gt;Nature&lt;/i&gt; (Vol. 467, pp519-524), there is an interesting account of "&lt;a href="http://www.nature.com/nature/journal/v467/n7315/full/467519a.html"&gt;The lost correspondence of Francis Crick&lt;/a&gt;" by Gann &amp;amp; Witkowski. The newly found letters, mostly between Crick and Wilkins, unveil further background information on the exciting DNA story. As the authors put it, "Strained relationships and vivid personalities leap off the pages."&lt;br /&gt;&lt;br /&gt;I read Watson's "The Double Helix" book a while ago, and overall I am quite familiar with the DNA story. Still, I found this account fascinating: it provides a "CAST LIST" in "The search for the structure of DNA" with photos (p521); and it succinctly summarizes the relationships among the key players. In science, no other story shows more dramatically the collaborative and competitive nature among scientists working on similar projects.&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://upload.wikimedia.org/wikipedia/commons/8/8a/ABDNAxrgpj.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="320" src="http://upload.wikimedia.org/wikipedia/commons/8/8a/ABDNAxrgpj.jpg" width="222" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;Franklin’s X-ray diffraction photograph 51 of B-form DNA (see figure above, from Wikipedia), with its unambiguous evidence that DNA was helical, proved crucial for Watson and Crick to determine the structure of DNA. Indeed, the Watson-Crick DNA model corresponds to the B-form DNA, with its base-pairs in the middle, parallel to each other and perpendicular to the linear helical axis. &lt;br /&gt;&lt;br /&gt;From this &lt;i&gt;Nature&lt;/i&gt; account, however, I noticed for the first time a subtle detail: when B-form DNA photograph 51 was shown to Watson by Wilkins in early 1953, Franklin also already had the A-form DNA diffraction pattern. According to the authors,&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;It was the A-structure diffraction pattern that had led Franklin away from believing that DNA, in that form at least, was helical, despite her already having produced the most persuasive helical pictures of the B structure — including photograph 51. The crystalline DNA gave better quality diffraction data, more suited to her painstaking, quantita- tive approach, and so she focused on the A form during 1952. It was at this time that she and Gosling made a handwritten, black edged funeral card announcing the death of “DNA Helix(crystalline)”.&lt;/blockquote&gt;&lt;br /&gt;When Crick had the opportunity to look the A-form DNA diffraction picture, on 5 June 1953, he wrote (to Wilkins):&lt;br /&gt;&lt;blockquote&gt;This is the first time I have had an opportunity for a detailed study of the picture of Structure A, and I must say I am glad I didn’t see it earlier, as it would have worried me considerably.&lt;/blockquote&gt;&lt;br /&gt;I am reading "&lt;a href="http://www.amazon.com/Blink-Power-Thinking-Without/dp/0316010669/"&gt;Blink: The Power of Thinking Without Thinking&lt;/a&gt;", a book by Malcolm Gladwell. The above case serves as a vivid example.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-3985360235448733380?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/3985360235448733380/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/09/further-details-of-dna-story-revealed.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/3985360235448733380'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/3985360235448733380'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/09/further-details-of-dna-story-revealed.html' title='Further details of the DNA story revealed by Crick&apos;s lost correspondence'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-4296206099810733770</id><published>2010-09-19T22:22:00.001-04:00</published><updated>2010-09-19T22:23:54.414-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><category scheme='http://www.blogger.com/atom/ns#' term='structural biology'/><title type='text'>New NDB entries have IDs start with the prefix NA?</title><content type='html'>Recently, while reading the article titled "&lt;a href="http://pubs.acs.org/doi/abs/10.1021/cg1009048"&gt;Designing Triple Helical Fragments: The Crystal Structure of the Undecamer d(TGGCCTTAAGG) Mimicking T.AT Base Triplets&lt;/a&gt;" by &lt;i&gt;Van Hecke&lt;/i&gt; (&lt;i&gt;Crystal Growth &amp;amp; Design&lt;/i&gt;), I noticed the following: &lt;br /&gt;&lt;blockquote&gt;The atomic coordinates and structure factors have been deposited in the Protein Data Bank&lt;sup&gt;41&lt;/sup&gt; and Nucleic Acid Database&lt;sup&gt;42&lt;/sup&gt; (PDB and NDB entry codes 3L1Q and NA0392, respectively). &lt;/blockquote&gt;The NDB id NA0392 reminded me of an email communication I had with Dr. Olson early this year when she told me of the id change of new NDB entries. Occupied with other issues, I did not pay much attention to this point until recently.&lt;br /&gt;&lt;br /&gt;Over the years, NDB has established itself prominently as "a repository of three-dimensional structural information about nucleic acids". Traditionally, an NDB id has some associated "meanings", with noticeably exceptions, as discussed one year ago in my blog post "&lt;a href="http://xiang-jun.blogspot.com/2009/07/pdb-id-vs-ndb-id.html"&gt;PDB id vs NDB id&lt;/a&gt;". Thus, it is not surprising that NDB decided to make changes in naming new ids. However, I cannot find any announcement on the id change in the NDB website; a Google search on "NDB id" did not uncover anything new either. Luckily, NDB provides search by release date ("Released Since"). After a few tries, I traced that the new id policy began to be implemented from around March 2010. &lt;br /&gt;&lt;br /&gt;The new NDB id convention appears to be &lt;b&gt;NA&lt;/b&gt; (presumably standing for for Nucleic Acid) followed by 4 digits. As more entries available, it is not that hard to imagine that the number of digits must be expanded to 5, 6, or even more. Naturally, PDB id, with a fixed 4-character length (up to now), is (far) more consistent than the NDB id is.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-4296206099810733770?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/4296206099810733770/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/09/new-ndb-entries-have-ids-start-with.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/4296206099810733770'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/4296206099810733770'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/09/new-ndb-entries-have-ids-start-with.html' title='New NDB entries have IDs start with the prefix NA?'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-2564370758345963461</id><published>2010-09-12T23:32:00.002-04:00</published><updated>2010-09-13T08:27:26.482-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>"Code Complete", a practical handbook of software construction</title><content type='html'>Over the summer, I read quite a few books on C programming, and more generally on &lt;i&gt;software construction&lt;/i&gt;. Among them, the book titled &lt;a href="http://www.amazon.com/Code-Complete-Practical-Handbook-Construction/dp/0735619670"&gt;"Code Complete" (2nd edition) by Steve McConnell&lt;/a&gt; is the most comprehensive: with over 900 pages, it certainly serves as "a practical handbook of software construction".&lt;br /&gt;&lt;br /&gt;I have been writting scientific software applications for over twenty years, using a variety of programming languages. Gradually, writing code &lt;i&gt;per se&lt;/i&gt; is taking less time; nowadays by far the most time-consuming parts have increasingly become (1) to understand a scientific topic thoroughly in order to implement it in new code, and (2) to maintain/refine/adapt existing codebase to meet the needs of its user community (e.g., 3DNA). It is for the benefit of the later that I've found the "Code Complete" book very helpful.&lt;br /&gt;&lt;br /&gt;As an example, following the book's recommendations (1) "Use boolean variables to simplify complicated tests" (pp301-2) and (2) "Simplify complicated tests with boolean function calls" (p359), I've easily improved the readability of some parts of my code.&lt;br /&gt;&lt;br /&gt;Overall, the book is well-written and full of practical advices; I enjoyed reading it.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-2564370758345963461?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/2564370758345963461/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/09/code-complete-practical-handbook-of.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2564370758345963461'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2564370758345963461'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/09/code-complete-practical-handbook-of.html' title='&quot;Code Complete&quot;, a practical handbook of software construction'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-3910276760520407136</id><published>2010-09-05T22:20:00.009-04:00</published><updated>2010-09-07T17:08:22.758-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><category scheme='http://www.blogger.com/atom/ns#' term='structural biology'/><title type='text'>Identification of C-H...N/O H-honds using 3DNA</title><content type='html'>Recently, I came across an interesting article by Kiliszek &lt;span style="font-style: italic;"&gt;et al&lt;/span&gt;., titled "&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/20702420"&gt;Atomic resolution structure of CAG RNA repeats: structural insights and implications for the trinucleotide repeat expansion diseases&lt;/a&gt;". In addition to its biological implications, this paper uses 3DNA to deal with non-canonical-base-pair-containing helical structures in a sensible way:&lt;br /&gt;&lt;blockquote&gt;The helical parameters were calculated using 3DNA (29). Sequence-independent measures were used, based on vectors connecting the C1' atoms of the paired residues, to avoid computational artefacts arising from non-canonical base pairing.&lt;/blockquote&gt;Another significant point, which is the focus of this post, is the observation that "All the adenosines are in the anti-conformation and &lt;span style="font-weight: bold;"&gt;the &lt;span style="font-style: italic;"&gt;only&lt;/span&gt; interaction within each A-A pair is a &lt;span style="font-style: italic;"&gt;single C2-H2...N1 hydrogen bond&lt;/span&gt;&lt;/span&gt;." (Figure below) Given the 0.95 Å ultrahigh resolution of structure &lt;a href="http://www.rcsb.org/pdb/explore/explore.do?structureId=3NJ6"&gt;3nj6&lt;/a&gt;/&lt;a href="http://ndbserver.rutgers.edu/atlas/xray/structures/N/na0608/na0608.html"&gt;na0608&lt;/a&gt;, it is likely that this type of A-A is real.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://2.bp.blogspot.com/_CEuD25cqpCE/TIVpatcMrRI/AAAAAAAAAEc/R6lE2kj-_pA/s1600/3nj6_A4_A7.png" onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"&gt;&lt;img alt="" border="0" id="BLOGGER_PHOTO_ID_5513929226380422418" src="http://2.bp.blogspot.com/_CEuD25cqpCE/TIVpatcMrRI/AAAAAAAAAEc/R6lE2kj-_pA/s400/3nj6_A4_A7.png" style="cursor: pointer; display: block; height: 161px; margin: 0px auto 10px; text-align: center; width: 400px;" /&gt;&lt;/a&gt;&lt;br /&gt;The &lt;span style="font-family: 'courier new';"&gt;find_pair&lt;/span&gt; program in the currently distributed versions of 3DNA (v2.0 and before), however, does &lt;span style="font-weight: bold;"&gt;not&lt;/span&gt; identify this A-A pair (thus the corresponding NDB list of "&lt;a href="http://ndbserver.rutgers.edu/atlas/xray/structures/N/na0608/NA0608-bpsp.html"&gt;Base Pair Step Parameters&lt;/a&gt;" is incomplete) for the following two reasons:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;&lt;span style="font-style: italic;"&gt;No&lt;/span&gt; H-bond exists between N/O base atoms – currently a requirement for a base-pair.&lt;/li&gt;&lt;li&gt;By default, only N/O atoms are used in defining H-bonds (see tag &lt;span style="font-family: 'courier new';"&gt;hb_atoms&lt;/span&gt; in file &lt;span style="font-family: 'courier new';"&gt;"misc_3dna.par"&lt;/span&gt;). Nevertheless, by adding C as a possible atom in forming H-bond, and manually editing &lt;span style="font-family: 'courier new';"&gt;find_pair&lt;/span&gt; generated input file to &lt;span style="font-family: 'courier new';"&gt;analyze&lt;/span&gt;, 3DNA structural parameters can be calculated as usual.&lt;br /&gt;&lt;/li&gt;&lt;/ol&gt;I have updated  &lt;span style="font-family: 'courier new';"&gt;find_pair&lt;/span&gt; in 3DNA to identify such C-H...N/O H-bond automatically (see below for entry 3nj6). Upon further refinements and validations, future releases of 3DNA will have this functionality available.&lt;br /&gt;&lt;br /&gt;&lt;pre style="background: #eee; font-size: 10px;"&gt;    1   95  #    1 | ...1&amp;gt;A:...1_:[..G]G-----C[..C]:..10_:A&amp;lt;...2&lt;br /&gt;    2   94  #    2 | ...1&amp;gt;A:...2_:[..G]G-----C[..C]:...9_:A&amp;lt;...2&lt;br /&gt;    3   93  #    3 | ...1&amp;gt;A:...3_:[..C]C-----G[..G]:...8_:A&amp;lt;...2&lt;br /&gt;    4   92  #    4 | ...1&amp;gt;A:...4_:[..A]A-**--A[..A]:...7_:A&amp;lt;...2&lt;br /&gt;    5   91  #    5 | ...1&amp;gt;A:...5_:[..G]G-----C[..C]:...6_:A&amp;lt;...2&lt;br /&gt;    6   90  #    6 | ...1&amp;gt;A:...6_:[..C]C-----G[..G]:...5_:A&amp;lt;...2&lt;br /&gt;    7   89  #    7 | ...1&amp;gt;A:...7_:[..A]A-**--A[..A]:...4_:A&amp;lt;...2&lt;br /&gt;    8   88  #    8 | ...1&amp;gt;A:...8_:[..G]G-----C[..C]:...3_:A&amp;lt;...2&lt;br /&gt;    9   87  #    9 | ...1&amp;gt;A:...9_:[..C]C-----G[..G]:...2_:A&amp;lt;...2&lt;br /&gt;   10   86  #   10 | ...1&amp;gt;A:..10_:[..C]C-----G[..G]:...1_:A&amp;lt;...2&lt;br /&gt;##### Criteria: 4.00  0.00  15.00  2.50  65.00  4.50  7.50   [ O N C]&lt;br /&gt;##### 2 non-Watson-Crick base-pairs, and 1 helix (0 isolated bps)&lt;br /&gt;##### Helix #1 (10): 1 - 10&lt;br /&gt;&lt;/pre&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-3910276760520407136?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/3910276760520407136/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/09/identification-of-c-hno-h-honds-using.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/3910276760520407136'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/3910276760520407136'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/09/identification-of-c-hno-h-honds-using.html' title='Identification of C-H...N/O H-honds using 3DNA'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_CEuD25cqpCE/TIVpatcMrRI/AAAAAAAAAEc/R6lE2kj-_pA/s72-c/3nj6_A4_A7.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-7599894979206990406</id><published>2010-09-05T21:19:00.001-04:00</published><updated>2010-09-05T21:51:36.257-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='structural biology'/><title type='text'>The Jmol paper: a paradigm shift in crystallographic visualization</title><content type='html'>From the Jmol mailing list, I noticed Prof. Robert Hanson's announcement titled "&lt;a href="http://sourceforge.net/mailarchive/message.php?msg_name=AANLkTinpyA45%3D2%3Dbz%2BLx06brPc44b1N_3rCKRNi9CCkz%40mail.gmail.com"&gt;Jmol and crystallography: the paper&lt;/a&gt;". Finally, an "official" &lt;a href="http://journals.iucr.org/j/issues/2010/05/02/kk5066/index.html"&gt;paper has been published on Jmol&lt;/a&gt;, an increasing popular molecular visualization tool.&lt;br /&gt;&lt;br /&gt;While the paper is categorized under "teaching and education" and summarized as below:&lt;br /&gt;&lt;blockquote&gt;Recent advances in molecular and crystallographic visualization methods are allowing instructors unprecedented opportunities to enhance student learning using virtual models within a familiar web-browser context. In step with these advances, the latest versions of the Jmol molecular visualization applet offer capabilities that hold potential for revolutionizing the way students learn about symmetry, uncertainty and the overall enterprise of molecular structure determination.&lt;/blockquote&gt;Jmol is certainly as useful for research as other commonly used molecular visualization tools, such as PyMOL and RasMol. Specifically, Jmol applets have become dominated in web-based applications/resources (including RCSB PDB). In fact, the HTML version of the paper itself makes extensive use of Jmol applet for interactive manipulation of over 20 figures.&lt;br /&gt;&lt;br /&gt;Over the past couple of years, I have witnessed the many new features added to Jmol, driven by an active and friendly user community. I have been particularly impressed by Prof. Hanson's quick and to-the-point responses to users' requests for new features and bug reports, including to my request for Jmol's "&lt;a href="http://www.mail-archive.com/jmol-users@lists.sourceforge.net/msg11733.html"&gt;Support of 'alchemy' format for rectangular schematic base-pair geometry&lt;/a&gt;". Overall, the paper presents a nice overview of some of Jmol's new functionality, and could become highly cited.&lt;br /&gt;&lt;br /&gt;Reading through the paper, I was just a bit surprised that PyMOL is &lt;b&gt;not&lt;/b&gt; mentioned. Nowadays, Jmol and PyMOL are apparently the top two molecular graphics software tools, with common and complementary functionality.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-7599894979206990406?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/7599894979206990406/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/09/jmol-paper-paradigm-shift-in.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/7599894979206990406'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/7599894979206990406'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/09/jmol-paper-paradigm-shift-in.html' title='The Jmol paper: a paradigm shift in crystallographic visualization'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-580216293767165702</id><published>2010-08-21T22:50:00.008-04:00</published><updated>2010-08-23T11:37:57.434-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><category scheme='http://www.blogger.com/atom/ns#' term='structural biology'/><title type='text'>Calculation of the chi (χ) torsion angle of pseudouridines in RNA structures</title><content type='html'>In RNA structures, chi (χ) is defined as the torsion angle about the N-&lt;a href="http://en.wikipedia.org/wiki/Glycosidic_bond"&gt;glycosidic bond&lt;/a&gt; between the ribose sugar and the canonical A/C/G/U bases or their modified variants. Specifically, for pyrimidines (C and U), χ is defined by O4'-C1'-N1-C2; and for purines (A and G) by O4'-C1'-N9-C4. See the following figure from the IMB Jena Image Library:&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.imb-jena.de/Piet/help/img.backbone/backbone-torsion.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 472px; height: 362px;" src="http://www.imb-jena.de/Piet/help/img.backbone/backbone-torsion.jpg" alt="" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Pseudouridine (5-ribosyluracil, PSU) was the &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/10902565"&gt;first identified modified nucleoside in RNA and is the most abundant&lt;/a&gt;. PSU is unique in that it has a C-glycosidic bond instead of the N-glycosidic bond common to all other nucleosides, canonical or modified. It thus poses a problem as to how to calculate the χ torsion angle: should it be O4'-C1'-C5-C4 reflecting the actual glycosidic bond connection, or should the conventional definition O4'-C1'-N1-C2 still be applied literally? As a concrete example, the figure below shows the (slightly) different numerical values, as given by the two definitions, for PSU 6 on chain A of PDB/NDB entry &lt;a href="http://ndbserver.rutgers.edu/atlas/xray/structures/A/ar0093/ar0093.html"&gt;3cgp/ar0093&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_CEuD25cqpCE/THHrzY7FqRI/AAAAAAAAAEE/MdvFcAKHH1E/s1600/3cgp_psu6.png"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 281px; height: 336px;" src="http://2.bp.blogspot.com/_CEuD25cqpCE/THHrzY7FqRI/AAAAAAAAAEE/MdvFcAKHH1E/s400/3cgp_psu6.png" alt="" id="BLOGGER_PHOTO_ID_5508443087346968850" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Needless to say, definition of the χ torsion angle of PSU in RNA structures is a very subtle/minor point, and I am not aware of any discussion on this issue in literature (I'd appreciate your sharing of related information in the comment). In 3DNA, PSU is identified explicitly, and χ is defined by O4'-C1'-C5-C4. In NDB and a couple of other tools (I've played with), χ for PSU is defined by O4'-C1'-N1-C2. Again using &lt;a href="http://ndbserver.rutgers.edu/atlas/xray/structures/A/ar0093/ar0093.html"&gt;3cgp/ar0093&lt;/a&gt; (figure above) as an example, 3DNA gives -162.7°, whilst NDB gives -163.9°. Hopefully, this post will help clarify a confusion for those who care about such little details.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-580216293767165702?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/580216293767165702/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/08/calculation-of-chi-torsion-angle-of.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/580216293767165702'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/580216293767165702'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/08/calculation-of-chi-torsion-angle-of.html' title='Calculation of the chi (χ) torsion angle of pseudouridines in RNA structures'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_CEuD25cqpCE/THHrzY7FqRI/AAAAAAAAAEE/MdvFcAKHH1E/s72-c/3cgp_psu6.png' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-2527119631373696939</id><published>2010-08-14T22:13:00.005-04:00</published><updated>2010-08-15T13:55:31.657-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><category scheme='http://www.blogger.com/atom/ns#' term='structural biology'/><title type='text'>PDB format, how many variants are there?</title><content type='html'>In the field of structural biology of macromolecules (proteins and nucleic acids), the &lt;a href="http://www.wwpdb.org/docs.html"&gt;PDB format&lt;/a&gt; is still the primary one, even though it has shown its age/limitations, and mmCIF and PDBML have been introduced as alternatives.&lt;br /&gt;&lt;br /&gt;In 3DNA, PDB is currently the only accepted format for the analysis routines (&lt;span class="Apple-style-span"  style="font-family:'courier new';"&gt;analyze/cehs&lt;/span&gt;), the primary format for rebuilt structures (&lt;span class="Apple-style-span"  style="font-family:'courier new';"&gt;rebuild/fiber&lt;/span&gt;), and various other output structure files (e.g., &lt;span class="Apple-style-span"  style="font-family:'courier new';"&gt;bestpairs.pdb&lt;/span&gt; etc.). By and large, 3DNA follows &lt;a href="http://www.wwpdb.org/docs.html"&gt;the PDB format documented at the RCSB website&lt;/a&gt;. In my maintaining and supporting of 3DNA and the forum over the years , I have come across other variants of the PDB format. Software packages customize PDB in subtle, sometimes substantial, ways that cause problems with 3DNA. Some examples follow:&lt;ul&gt;&lt;li&gt;Recently, a user tried to analyze an RNA duplex using w3DNA, but found that "&lt;a href="http://3dna.rutgers.edu:8080/forum/viewtopic.php?f=1&amp;amp;t=175"&gt;the server seems not to recognize the file&lt;/a&gt;". It turned out that the so-called PDB format does not have residue name/number and chain id aligned in columns properly. For example, among other issues, the chain id in the problematic PDB is not at column #22 as specified by the standard.&lt;/li&gt;&lt;br /&gt;&lt;li&gt;A couple of months back, another user asked how to handle &lt;a href="http://3dna.rutgers.edu:8080/forum/viewtopic.php?f=1&amp;amp;t=167"&gt;AMBER-introduced ribonucleotide abreviations (RA/RC/RG/RU)&lt;/a&gt;. Compared to the above "PDB" variant, the AMBER PDB is well-behavored, and can be easily recognized by 3DNA after introducing corresponding one letter matches in file &lt;span style="font-weight:bold;"&gt;baselist.dat&lt;/span&gt;. Interestingly, a couple of years ago, DA/DC/DG/DT was introduced into the PDB format to distinguish deoxyribonucleotide (DNA) from ribonucleotide A/C/G/U (RNA). The AMBER convention, by using RA/RC/RG/RU for RNA, appears to be more symmetric.&lt;/li&gt;&lt;br /&gt;&lt;li&gt;From my (limited) experience in helping a 3DNA user several ago, I know that CHARMM uses three-letter residue names for DNA (e.g., ADE/CYT/GUA/THY). There is actually a short Perl utility program (&lt;span class="Apple-style-span"  style="font-family:'courier new';"&gt;x3dna2charmm_pdb&lt;/span&gt;) in 3DNA for converting 3DNA generated PDB file to that recognized by CHARMM.&lt;/li&gt;&lt;br /&gt;&lt;li&gt;As noted in another post, "&lt;a href="http://xiang-jun.blogspot.com/2009/05/pdb-atom-coordinates-record.html"&gt;PDB ATOM coordinates record&lt;/a&gt;", &lt;a href="http://openbabel.org/wiki/Main_Page"&gt;Babel/Open Babel&lt;/a&gt; converted PDB files (mostly of small molecules) do not follow atom naming conventions.&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;As widely used and standard as PDB format is, there are still many variants in common use. Thus, when you meet a problem related to a PDB file, it helps to check if it conforms to &lt;a href="http://www.wwpdb.org/docs.html"&gt;the standard&lt;/a&gt;. With 3DNA, I've always tried to incorporate as many PDB variants as possible, so users can run 3DNA on them transparently.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-2527119631373696939?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/2527119631373696939/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/08/pdb-format-how-many-variants-are-there.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2527119631373696939'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2527119631373696939'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/08/pdb-format-how-many-variants-are-there.html' title='PDB format, how many variants are there?'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-5017536651132258409</id><published>2010-08-07T19:34:00.008-04:00</published><updated>2011-03-20T11:04:35.603-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><category scheme='http://www.blogger.com/atom/ns#' term='structural biology'/><title type='text'>Calculation of DNA bending angle</title><content type='html'>Once in a while, I receive the question (via email or on the 3DNA forum) on how to calculate &lt;a href="http://3dna.rutgers.edu:8080/forum/viewtopic.php?f=1&amp;amp;t=174"&gt;DNA bending angle&lt;/a&gt;, mostly associated with DNA-protein complexes. This question certainly qualifies as an FAQ; Indeed, I did consider writing an entry on it in the 3DNA website. However, I've hesitated to do so because the topic is more subtle and complicated that it appears.&lt;br /&gt;&lt;br /&gt;On its face, an angle is defined by two vectors; let's call them &lt;span style="font-weight:bold;"&gt;a&lt;/span&gt; and &lt;span style="font-weight:bold;"&gt;b&lt;/span&gt;, and if each is normalized, then the angle (in degrees) between them is: &lt;span class="Apple-style-span"  style="font-family:'courier new';"&gt;acos(dot(a, b)) * 180/pi&lt;/span&gt;. Geometrically, after moving the tails of the vectors into the same position (e.g., origin), the heads would normally define a plane, unless &lt;span style="font-weight:bold;"&gt;a&lt;/span&gt; and &lt;span style="font-weight:bold;"&gt;b&lt;/span&gt; are strictly parallel (0°) or anti-parallel (180°).&lt;br /&gt;&lt;br /&gt;DNA structures are three-dimensional, normally far more complicated than a single number can quantify. The concept of DNA bending angle, as I understand it, is only applicable to DNA structures with two relatively straight fragments (as in CAP-DNA complexes). Under such situations,  one can fit a least-squares (ls) linear helical axis to each of the two fragments, and calculate the angle between them. Towards this end, 3DNA outputs the following section when it judges that the input structure is not strongly curved. Using 355d/bdl084, which is distributed with 3DNA, as an example:&lt;br /&gt;&lt;pre style="background-color: #eee; border-width: 1px;"&gt;Global linear helical axis defined by equivalent C1' and RN9/YN1 atom pairs                                    &lt;br /&gt;Deviation from regular linear helix: 3.30(0.52)&lt;br /&gt;Helix:    -0.127  -0.275  -0.953&lt;br /&gt;HETATM 9998  XS    X X 999      17.536  25.713  25.665&lt;br /&gt;HETATM 9999  XE    X X 999      12.911  15.677  -9.080&lt;br /&gt;Average and standard deviation of helix radius:&lt;br /&gt;P: 9.42(0.82), O4': 6.37(0.85),  C1': 5.85(0.86)&lt;/pre&gt;Where the &lt;span style="font-weight:bold;"&gt;Helix:&lt;/span&gt; line gives the normalized vector along the "best-fit" helical axis. The two HETATM records provides the two end points on the helix, and they are directly related to the &lt;span style="font-weight:bold;"&gt;Helix:&lt;/span&gt; line by a simple equation. Following the above example,  we have (Octave/Matlab code):&lt;br /&gt;&lt;code style="background-color: #eee; border-width: 1px;"&gt;&lt;br /&gt;XE = [12.911  15.677  -9.080];&lt;br /&gt;XS = [17.536  25.713  25.665];&lt;br /&gt;&lt;br /&gt;dd = XE - XS&lt;br /&gt;%   -4.6250  -10.0360  -34.7450&lt;br /&gt;&lt;br /&gt;Helix = dd / norm(dd)&lt;br /&gt;%  -0.12685  -0.27526  -0.95296  ==&gt; [-0.127  -0.275  -0.953]&lt;br /&gt;&lt;/code&gt;&lt;br /&gt;With the two HETATM records,  one can easily add them into the original PDB file to display the helical axis using a molecular graphics programs (e.g., RasMol, Jmol, PyMol). Moreover, the two helix vectors can be used to reorient the original PDB structure into a view so that one helical fragment lies along the x-axis, and the other in the xy-plane. As documented in detail in recipes #4 on "Automatic identification of double-helical regions in a DNA–RNA junction" (&lt;a href="http://3dna.rutgers.edu:8080/forum/viewtopic.php?f=3&amp;amp;t=97"&gt;2008 3DNA &lt;i&gt;Nature Protocols&lt;/i&gt; paper&lt;/a&gt;), "The chosen view allows for &lt;span style="font-style:italic;"&gt;easy visualization and protractor measurement&lt;/span&gt; of the overall bending angle between the two relatively straight helices."&lt;br /&gt;&lt;br /&gt;The following points are well worth noting:&lt;ul&gt;&lt;li&gt;The ls fitting procedure used in 3DNA follows SCHNArP, which was based on the algorithm in the well-known NewHelix program, maintained by Dr. Richard Dickerson unto the 1990s. While fitting a global linear helical axis to strongly curved DNA structures clearly makes no much sense with derived parameters (NewHelix itself has been replaced by FreeHelix, also from Dickerson), I do believe it is meaningful to fit a linear helix with relatively straight DNA fragment. That's why I have kept this functionality in SCHNArP and 3DNA; 3DNA bending angle calculation serves as an example illustrating the point &lt;span class="Apple-style-span"   style="  line-height: 21px; font-family:sans-serif;font-size:11.6667px;"&gt;–&lt;/span&gt; it provides an "intuitive" way for biologists to understand how the bending angle is calculated; it can actually be measured directly.&lt;/li&gt;&lt;li&gt;Instead of directly ls-fitting a linear helical axis with 3DNA, one can alternatively superimpose a regular fiber model into the DNA fragment, and then derive the straight helical axis from the fitted coordinates. The two approaches normally gives slightly different numerical values, as would be expected.&lt;/li&gt;&lt;li&gt;Overall, bending angle is (at most) an approximate measure of DNA curvature. In my opinion, the concept is only applicable for comparing a set of structures, each with two relatively straight fragments. Even in such cases, the relative spatial relationship between two helical fragments are more complicated than a simple (bending) angle could quantify. Specially, it is not meaningful to make a selling point in slight differences of DNA bending angles.&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-5017536651132258409?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/5017536651132258409/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/08/calculation-of-dna-bending-angle.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5017536651132258409'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5017536651132258409'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/08/calculation-of-dna-bending-angle.html' title='Calculation of DNA bending angle'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-4060156990067054094</id><published>2010-08-06T21:08:00.004-04:00</published><updated>2010-08-07T19:33:43.361-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Genomic code for nucleosome positioning?</title><content type='html'>Over the past few years, the research on nucleosome positioning has been a hot topic: Segal &lt;i&gt;et al.&lt;/i&gt;, through a series of high-profile publications, have proposed that "the intrinsic nucleosome sequence preferences contribute substantially to nucleosome organization and chromatin function &lt;span style="font-style:italic;"&gt;in vivo&lt;/span&gt;" (i.e., there exists a "genomic code") and developed bioinformatics tools to predict patterns of nucleosome positions based on genomic DNA sequence. However, as is clear from the title of a paper by &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/19620965"&gt;Zhang &lt;span style="font-style:italic;"&gt;et al.&lt;/span&gt;&lt;/a&gt;,  "Intrinsic histone-DNA interactions are not the major determinant of nucleosome positions &lt;span style="font-style:italic;"&gt;in vivo&lt;/span&gt;", it is highly debatable if the nucleosome sequence preferences is strong enough to be called a "code", in the canonical sense as for the genetic code.&lt;br /&gt;&lt;br /&gt;While I am (generally) aware of the controversy about the "genomic code", my understanding of the issue has become much clearer after reading the correspondences between both sides published in the &lt;a href="http://www.nature.com/nsmb/journal/v17/n8/index.html"&gt;&lt;span style="font-weight:bold;"&gt;17(8)&lt;/span&gt;, August 2010&lt;/a&gt; issue of Nature Structural &amp;amp; Molecular Biology [NSMB]:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;"Nucleosome sequence preferences influence in vivo nucleosome organization" (pp918 - 920) by Kaplan … and Segal&lt;/li&gt;&lt;li&gt;"Evidence against a genomic code for nucleosome positioning" &lt;span style="font-style:italic;"&gt;Reply to “Nucleosome sequence preferences influence in vivo nucleosome organization”&lt;/span&gt; (pp920 - 923) by Zhang … and Struhl&lt;/li&gt;&lt;li&gt;"A preoccupied position on nucleosomes" (p923) by Pugh&lt;/li&gt;&lt;/ul&gt;Reading carefully through the text, it is clear how important the little details could be: 20-bp vs 40-bp windows? How significant the preference should be to be claimed as a "code"? Pugh clarifies another source of confusion, in the terminology of nucleosome ‘occupancy’ vs ‘positioning’. In my understanding, the dispute on such details itself speaks strongly against the existence of "genomic code".&lt;br /&gt;&lt;br /&gt;It should be noted that early work by &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/3912515"&gt;Drew &amp;amp; Travers (JMB85)&lt;/a&gt; and &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/3806678"&gt;Satchwell et al. (JMB86)&lt;/a&gt; provided evidence that the &lt;span style="font-style:italic;"&gt;&lt;span style="font-weight:bold;"&gt;rotational&lt;/span&gt;&lt;/span&gt; positioning of DNA about the histone octamer is regulated by DNA sequence-dependent bending preferences. &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/7816643"&gt;Goodsell &amp;amp; Dickerson&lt;/a&gt; quantified the Satchwell trinucleotide model by assigning a set of roll angles to predict DNA bending from base sequence. This bending model, along with five others, was implemented by &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/9356256"&gt;SCHNArP&lt;/a&gt; for building approximate 3D DNA structures (with fixed values for slide, twist etc other parameters). However, this functionality is dropped out from 3DNA, simply because of the arbitrariness/subtleties in picking up the parameters.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-4060156990067054094?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/4060156990067054094/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/08/genomic-code-for-nucleosome-positioning.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/4060156990067054094'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/4060156990067054094'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/08/genomic-code-for-nucleosome-positioning.html' title='Genomic code for nucleosome positioning?'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-7210985140170271621</id><published>2010-08-01T21:22:00.007-04:00</published><updated>2010-08-01T23:51:47.809-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><category scheme='http://www.blogger.com/atom/ns#' term='structural biology'/><title type='text'>Principle axes of inertia as a mean to reorient a molecule</title><content type='html'>In structural biology, it is often helpful to reorient a molecule in a specific view for easy visualization and comparison. With no &lt;span style="font-style: italic;"&gt;a priori&lt;/span&gt;  knowledge of a feature, the (objective) principle axes of inertia can be used as a mean to automatically reset a structure into its most extended orientation (and two other orthogonal views). In 3DNA, &lt;span style="font-family:courier new;"&gt;rotate_mol&lt;/span&gt; has such a functionality which is explored by &lt;span style="font-family:courier new;"&gt;blocview&lt;/span&gt; to generate the &lt;a href="http://xiang-jun.blogspot.com/2009/06/blocview-simple-effective-visualization.html"&gt;characteristic, simplified molecular images&lt;/a&gt; adopted by the RCSB PDB and NDB.&lt;br /&gt;&lt;br /&gt;Recently, the thread (initiated by Dr. Pascal Auffinger) at the 3DNA forum titled "&lt;a href="http://3dna.rutgers.edu:8080/forum/viewtopic.php?f=1&amp;amp;t=170"&gt;rotate_mol ambiguous orientations&lt;/a&gt;"   has led me to think the topic deeper. As always, the issue is best illustrated with an example. The file &lt;a href="http://3dna.rutgers.edu:8080/data/355d_A6_T19.pdb"&gt;355d_A6_T19.pdb&lt;/a&gt; contains the coordinates of the base pair A6-T19 of PDB entry 355d. Let xyz be a variable in Octave (Matlab) containing the coordinates of all atoms in file 355d_A6_T19.pdb, the following m-file provides the detailed procedure:&lt;br /&gt;&lt;pre style="background: none repeat scroll 0% 0% rgb(238, 238, 238);"&gt;xyz = [&lt;br /&gt;   13.954  12.080  15.071&lt;br /&gt;   13.092  11.246  15.953&lt;br /&gt;   14.769  11.453  14.020&lt;br /&gt;   13.056  13.202  14.406&lt;br /&gt;   12.184  13.965  15.222&lt;br /&gt;   11.308  14.846  14.376&lt;br /&gt;   12.033  15.991  13.896&lt;br /&gt;   10.654  14.188  13.168&lt;br /&gt;    9.262  14.480  13.256&lt;br /&gt;   11.373  14.820  11.974&lt;br /&gt;   11.810  16.185  12.511&lt;br /&gt;   13.051  16.742  11.953&lt;br /&gt;   14.253  16.097  11.802&lt;br /&gt;   15.197  16.853  11.305&lt;br /&gt;   14.585  18.084  11.106&lt;br /&gt;   15.066  19.322  10.605&lt;br /&gt;   16.329  19.520  10.209&lt;br /&gt;   14.198  20.357  10.535&lt;br /&gt;   12.938  20.162  10.946&lt;br /&gt;   12.370  19.048  11.444&lt;br /&gt;   13.259  18.034  11.497&lt;br /&gt;   14.865  29.619   5.710&lt;br /&gt;   14.172  30.844   5.281&lt;br /&gt;   16.107  29.695   6.515&lt;br /&gt;   13.846  28.701   6.510&lt;br /&gt;   12.684  28.174   5.864&lt;br /&gt;   11.909  27.311   6.826&lt;br /&gt;   12.626  26.090   7.129&lt;br /&gt;   11.584  27.960   8.173&lt;br /&gt;   10.189  27.833   8.395&lt;br /&gt;   12.409  27.166   9.184&lt;br /&gt;   12.544  25.808   8.521&lt;br /&gt;   13.716  24.979   8.867&lt;br /&gt;   13.496  23.688   9.310&lt;br /&gt;   12.381  23.202   9.467&lt;br /&gt;   14.631  22.971   9.551&lt;br /&gt;   15.943  23.388   9.393&lt;br /&gt;   16.852  22.613   9.632&lt;br /&gt;   16.107  24.759   8.938&lt;br /&gt;   17.488  25.309   8.745&lt;br /&gt;   14.995  25.475   8.711 ]&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(255, 0, 0);"&gt;cov_mtx = cov(xyz)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;% cov_mtx =&lt;br /&gt;%     3.4094    2.3860   -1.4222&lt;br /&gt;%     2.3860   33.7089  -15.7175&lt;br /&gt;%    -1.4222  -15.7175    7.8786&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(255, 0, 0);"&gt;[v, lambda] = eig(cov_mtx)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;% v =&lt;br /&gt;%    0.094830   0.992835  -0.072709&lt;br /&gt;%    0.419520  -0.106092  -0.901525&lt;br /&gt;%    0.902779  -0.054989   0.426575&lt;br /&gt;&lt;br /&gt;% lambda =&lt;br /&gt;%     0.42535          0          0&lt;br /&gt;%           0    3.23319          0&lt;br /&gt;%           0          0   41.33836&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(255, 0, 0);"&gt;x = v(:, 2);&lt;/span&gt;&lt;br /&gt;&lt;span style="color: rgb(255, 0, 0);"&gt;y = v(:, 3);&lt;/span&gt;&lt;br /&gt;&lt;span style="color: rgb(255, 0, 0);"&gt;z = cross(x, y);&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;% z =&lt;br /&gt;%   -0.094830&lt;br /&gt;%   -0.419520&lt;br /&gt;%   -0.902779&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(255, 0, 0);"&gt;num = size(xyz, 1);&lt;/span&gt;&lt;br /&gt;&lt;span style="color: rgb(255, 0, 0);"&gt;xyz2 = (xyz - ones(num, 1) * mean(xyz)) * [x y z]&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;% xyz2 =&lt;br /&gt;%     1.148136   10.032849   -0.514731&lt;br /&gt;%     0.332293   11.223635   -0.879359&lt;br /&gt;%     2.081609   10.090517    0.619843&lt;br /&gt;%    .................................&lt;br /&gt;%     2.277813   -4.170312   -0.501254&lt;br /&gt;%     3.601180   -4.848890   -0.688714&lt;br /&gt;%     1.110301   -4.831784   -0.491249&lt;/pre&gt;Among &lt;span style="font-style: italic;"&gt;other subtitles&lt;/span&gt;, note that:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;lambda 3 has the largest value (41.33836), i.e., with the biggest variance, and its corresponding eigenvector is set as the vertical &lt;span style="font-style: italic;"&gt;y&lt;/span&gt;-axis.&lt;/li&gt;&lt;li&gt;lambda 2 shows intermediate variation (3.23319), and its corresponding eigenvector is set as the horizontal &lt;span style="font-style: italic;"&gt;x&lt;/span&gt;-axis.&lt;/li&gt;&lt;li&gt;The &lt;span style="font-style: italic;"&gt;z&lt;/span&gt;-axis is derived as the cross-product of &lt;span style="font-style: italic;"&gt;x&lt;/span&gt;- and &lt;span style="font-style: italic;"&gt;y&lt;/span&gt;-axes to ensure a right-handed frame. Note that it is the &lt;span style="font-weight: bold;"&gt;opposite&lt;/span&gt; of eigenvector 1.&lt;/li&gt;&lt;li&gt;The original xyz coordinates are moved to its geometric center [&lt;span style="font-weight: bold;"&gt;xyz - ones(num, 1) * mean(xyz)&lt;/span&gt;], and then projected onto the preferred frame to get the transformed coordinates (&lt;span style="font-weight: bold;"&gt;xyz2&lt;/span&gt;) with the molecule in the most extended view (vertical then horizontal).&lt;/li&gt;&lt;/ol&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-7210985140170271621?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/7210985140170271621/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/08/principle-axes-of-inertia-as-mean-to.html#comment-form' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/7210985140170271621'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/7210985140170271621'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/08/principle-axes-of-inertia-as-mean-to.html' title='Principle axes of inertia as a mean to reorient a molecule'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-798644565356726847</id><published>2010-07-24T22:06:00.006-04:00</published><updated>2010-07-25T00:24:46.127-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>3DNA for the analysis of molecular dynamics simulations</title><content type='html'>To my surprise and delight, 3DNA has been increasingly used in the analysis of trajectories of molecular dynamics (MD) simulations of nucleic acid structures.  While I have heard of CHARM/AMBER for a long time and know of the basic principles of MD simulations, I have never performed any simulations (&lt;span style="font-weight: bold;"&gt;yet&lt;/span&gt;). Over the years, I have been mostly interested in methods development and algorithms implementation for better understanding of nucleic acids structures, and have always used NDB/PDB entries for my tests/applications. Thus &lt;a href="http://xiang-jun.blogspot.com/2009/08/curves-vs-3dna.html"&gt;unlike Curves+, 3DNA lacks direct support for the analysis of MD trajectories&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;From the MD-related questions I have received over emails and on the 3DNA forum, I sense that users do not have that much of a problem or inconvenience in applying 3DNA for the task. There is one issue that does stand out, however, i.e.,  the use of &lt;span class="Apple-style-span"  style="font-family:'courier new';"&gt;find_pair&lt;/span&gt; for each and every structure in the trajectories to prepare an input file that contains base-pairing information for &lt;span class="Apple-style-span"  style="font-family:'courier new';"&gt;analyze&lt;/span&gt; to calculate various parameters. As an example, a recent article by Ricci &lt;span style="font-style: italic;"&gt;et al.&lt;/span&gt;,  "&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/20614923"&gt;Molecular Dynamics of DNA: Comparison of Force Fields and Terminal Nucleotide Definitions&lt;/a&gt;" (&lt;span style="font-style: italic;"&gt;J Phys Chem&lt;/span&gt; B. 2010 Jul 8. [Epub ahead of print]), says that:&lt;br /&gt;&lt;blockquote&gt;It is noteworthy that, because the DNA structure endured strong deformations in these simulations, 3DNA could not perform the analysis of the angles for all the 5000 snapshots. Actually, for system 3 the calculation of the backbone parameters could be performed in only 59% of the snapshots, whereas in system 4 only 18% of the configurations allowed this calculation.&lt;/blockquote&gt;Dr. Netz, the corresponding author, confirmed to me that:&lt;br /&gt;&lt;blockquote&gt;I don't think this is a 3DNA problem, but is rather an effect of the distortion caused by the problems in parameterization of the GROMOS forcefield.&lt;/blockquote&gt;Essentially, though, the problem lies in the coupled use of &lt;span class="Apple-style-span"  style="font-family:'courier new';"&gt;find_pair/analyze&lt;/span&gt;; for strongly deformed DNA structures along the simulation trajectories, some base pairs melt out, and are thus beyond the recognition of &lt;span class="Apple-style-span"  style="font-family:'courier new';"&gt;find_pair&lt;/span&gt;. On the other hand, one does &lt;span style="font-weight: bold;"&gt;not&lt;/span&gt; need to run &lt;span class="Apple-style-span"  style="font-family:'courier new';"&gt;find_pair&lt;/span&gt; for each structure; simply prepare a &lt;span style="font-weight: bold;"&gt;template&lt;/span&gt; input file for &lt;span class="Apple-style-span"  style="font-family:'courier new';"&gt;analyze&lt;/span&gt;, with the help of &lt;span class="Apple-style-span"  style="font-family:'courier new';"&gt;find_pair&lt;/span&gt; and probably some manual editing, then all the structures can be analyzed, no matter how distorted they are. Furthermore, saving the &lt;span class="Apple-style-span"  style="font-family:'courier new';"&gt;find_pair&lt;/span&gt; step should speed up the analysis of thousands of structures, typical of MD simulations.&lt;br /&gt;&lt;br /&gt;In fact, 3DNA (from v1.5) has a Perl script named &lt;span class="Apple-style-span"  style="font-family:'courier new';"&gt;nmr_strs&lt;/span&gt; that implements the above idea in a &lt;span style="font-style: italic;"&gt;preliminary&lt;/span&gt; form. The script is intended to serve as a starting point for the analysis of multiple similar structures, in an NMR ensemble or from MD simulations. See 3DNA &lt;a href="http://3dna.rutgers.edu:8080/dokuwiki/doku.php?id=documentation:nmr_strs"&gt;wiki page on nmr_strs&lt;/a&gt; for details.&lt;br /&gt;&lt;br /&gt;In the long run, a better integration between 3DNA and AMBER or other MD packages would solve this problem from the upstream, thus making users' life (downstream) much easier. Until that happens, I hope this post could help clarify the issue somewhat in this important application area of 3DNA. As always, I greatly value your comments.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-798644565356726847?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/798644565356726847/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/07/3dna-for-analysis-of-molecular-dynamics.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/798644565356726847'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/798644565356726847'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/07/3dna-for-analysis-of-molecular-dynamics.html' title='3DNA for the analysis of molecular dynamics simulations'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-4640042767404134913</id><published>2010-07-16T22:07:00.011-04:00</published><updated>2010-07-16T23:59:12.319-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>PDB fingerprints make salient features of PDB entries obvious at a glance</title><content type='html'>In the past couple of days, I have followed with great interest the thread "&lt;a href="http://www.ebi.ac.uk/pdbe-apps/widgets/pdbprints"&gt;PDBprints&lt;/a&gt; - salient, at-a-glance info about PDB entries" in CCP4BB. The idea behind PDBprints (short for PDB finger prints) is very simple: now with over 66 thousands entries, no one could possibly be intimately familiar with many of the structures in PDB. Yet "it would be very useful to have a way of obtaining at-a-glance information about the salient features of one or more PDB entries." In this scheme, key features (e.g., experimental method, species, protein/NA/ligand components, published or not) are represented by stylized icons called PDBlogos; a set of which organized in a specific order gives PDBprints.&lt;br /&gt;&lt;br /&gt;The &lt;a href="https://www.jiscmail.ac.uk/cgi-bin/webadmin?A1=ind1007&amp;L=CCP4BB#43"&gt;initial announcement&lt;/a&gt;, posted on Thursday by Gerard Kleywegt (the current Director of &lt;a href="http://www.ebi.ac.uk/pdbe/"&gt;PDBe&lt;/a&gt;), has been well-received by the community. I am very impressed by the quality of the (over a dozen) feedbacks on icons coloring for presence/absence of a feature, and the proper icons for species etc. On the other hand, I am surprised by the quick and concrete &lt;a href="https://www.jiscmail.ac.uk/cgi-bin/webadmin?A2=ind1007&amp;L=CCP4BB&amp;F=&amp;S=&amp;P=185973"&gt;response Gerard posted today&lt;/a&gt;, addressing the various issues.&lt;br /&gt;&lt;br /&gt;Over the years, I have taken PDB/NDB mostly as data repositories; I regularly download (update) nucleic-acid containing structures for local analyses. Among the three wwPDB sites (RCSB, PDBe and PDBj), I've only visited the RCSB site on a regular basis. Now I find more features are available from PDBe, which I will browse more in the future.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-4640042767404134913?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/4640042767404134913/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/07/pdb-fingerprints-make-salients-features.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/4640042767404134913'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/4640042767404134913'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/07/pdb-fingerprints-make-salients-features.html' title='PDB fingerprints make salient features of PDB entries obvious at a glance'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-8647240286052789157</id><published>2010-07-03T23:46:00.003-04:00</published><updated>2010-07-04T00:43:03.103-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>In 3DNA, analyze and rebuild are two sides of the same coin</title><content type='html'>Recently, I noticed the following two papers:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;"&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/20123729"&gt;A measure of bending in nucleic acids structures applied to A-tract DNA&lt;/a&gt;" by Lankas &lt;span style="font-style: italic;"&gt;et al&lt;/span&gt;. (2010), &lt;span style="font-style: italic;"&gt;Nucleic Acids Res.,&lt;/span&gt;&lt;span style="font-weight: bold;"&gt;38(10)&lt;/span&gt;, 3414–3422.&lt;/li&gt;&lt;li&gt;"&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/20496935"&gt;The chirality of DNA: elasticity cross-terms at base-pair level including A-tracts and the influence of ionic strength&lt;/a&gt;" by Noy and Golestanian (2010), &lt;span style="font-style: italic;"&gt;J. Phys. Chem. B&lt;/span&gt;, &lt;span style="font-weight: bold;"&gt;114&lt;/span&gt;, 8022–8031.&lt;/li&gt;&lt;/ul&gt;They both cite 3DNA, where the combined usage of its &lt;span style="font-family:courier new;"&gt;analyze&lt;/span&gt;/&lt;span style="font-family:courier new;"&gt;rebuild&lt;/span&gt; components has played a significant role.&lt;br /&gt;&lt;br /&gt;When I first had access to the &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/7666417"&gt;CEHS scheme&lt;/a&gt;, I was immediately attracted by its mathematical rigor which allows for complete reversibility in DNA structural analysis and model rebuilding. Historically, CEHS was the initial seed of the SCHNAaP/SCHNArP programs, and &lt;span style="font-family:courier new;"&gt;analyze/rebuild&lt;/span&gt; in 3DNA were directly derived from them.&lt;br /&gt;&lt;span style="font-family:monospace;"&gt;&lt;br /&gt;&lt;/span&gt;Frequently, I think of &lt;span style="font-family:courier new;"&gt;analyze/rebuild&lt;/span&gt; as two sides of the same coin: starting from a nucleic acid structure (e.g., a DNA double helix), the &lt;span style="font-family:courier new;"&gt;analyze&lt;/span&gt; program gives a set of base-pair (propeller, buckle etc) and step (roll, slide etc) parameters. The structural parameters can be used to rebuild the structure, which is virtually identical in base geometry (i.e., without taking consideration of the sugar-phosphate backbone) to the original structure. Conversely, analyzing  the rebuilt structure again gives exactly the same set of structural parameters describing the relative base geometry.&lt;br /&gt;&lt;br /&gt;Reversibility is essentially a simple concept, nevertheless a very powerful one. Over the years, I am glad to see more people are taking advantage of this 3DNA unique feature in addressing real-world problems related to nucleic acid structures. I am confident to see more such applications.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-8647240286052789157?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/8647240286052789157/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/07/in-3dna-analyze-and-rebuild-are-two.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/8647240286052789157'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/8647240286052789157'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/07/in-3dna-analyze-and-rebuild-are-two.html' title='In 3DNA, analyze and rebuild are two sides of the same coin'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-5511635566980096195</id><published>2010-06-27T23:20:00.002-04:00</published><updated>2010-06-28T00:12:05.274-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><category scheme='http://www.blogger.com/atom/ns#' term='structural biology'/><title type='text'>Get all torsion angles in a nucleic acid structure</title><content type='html'>In the field of nucleic acid structure analysis, a commonly calculated set of parameters is the torsion angles. Specifically, they include the main chain and chi (χ) torsion angles defined as follows:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;α: O3'(i-1)-P-O5'-C5'&lt;/li&gt;&lt;li&gt;β: P-O5'-C5'-C4'&lt;/li&gt;&lt;li&gt;γ: O5'-C5'-C4'-C3'&lt;/li&gt;&lt;li&gt;δ: C5'-C4'-C3'-O3'&lt;/li&gt;&lt;li&gt;ε: C4'-C3'-O3'-P(i+1)&lt;/li&gt;&lt;li&gt;ζ: C3'-O3'-P(i+1)-O5'(i+1)&lt;/li&gt;&lt;li&gt;χ: for pyrimidines (Y, i.e., T, U, C): O4'-C1'-N1-C2; for purines (R, i.e., A, G): O4'-C1'-N9-C4&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;A related set of parameters characterizes the sugar conformation:&lt;ul&gt;&lt;li&gt;ν0: C4'-O4'-C1'-C2'&lt;/li&gt;&lt;li&gt;ν1: O4'-C1'-C2'-C3'&lt;/li&gt;&lt;li&gt;ν2: C1'-C2'-C3'-C4'&lt;/li&gt;&lt;li&gt;ν3: C2'-C3'-C4'-O4'&lt;/li&gt;&lt;li&gt;ν4: C3'-C4'-O4'-C1'&lt;/li&gt;&lt;li&gt;tm: amplitude of pseudorotation of the sugar ring&lt;/li&gt;&lt;li&gt;&lt;span style="font-style:italic;"&gt;P&lt;/span&gt;:  phase angle of pseudorotation of the sugar ring&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;These torsion angles are clearly defined and are readily available from various informatics programs. Not surprisingly, 3DNA also provides a complete set of DNA/RNA backbone torsions, calculated &lt;span style="font-style:italic;"&gt;robustly&lt;/span&gt; and &lt;span style="font-style:italic;"&gt;efficiently&lt;/span&gt;. The key is the "&lt;span style="font-family:'courier new';"&gt;&lt;b&gt;-s&lt;/b&gt;&lt;/span&gt;" option of &lt;span style="font-family:'courier new';"&gt;find_pair&lt;/span&gt;" program, which is nevertheless little used, mostly because it is not the default.&lt;/div&gt;&lt;br /&gt;&lt;div&gt;Using the &lt;span style="font-style:italic;"&gt;Haloarcula marismortui&lt;/span&gt; 50S large ribosomal subunit as an example (&lt;span style="font-family:'courier new';"&gt;1jj2&lt;/span&gt;), the output file &lt;span style="font-family:'courier new';"&gt;1jj2.outs &lt;/span&gt; from the following command contains all the above mentioned main chain and sugar conformational parameters:&lt;pre&gt;find_pair -s 1jj2.pdb stdout | analyze&lt;/pre&gt;&lt;/div&gt;Please see the Jena "&lt;a href="http://www.imb-jena.de/Piet/help/backbone.html"&gt;Nucleic acid backbone parameters&lt;/a&gt;" website for a diagram (based on Saenger's book) defining the various backbone torsion angles.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-5511635566980096195?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/5511635566980096195/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/06/get-all-torsion-angles-in-nucleic-acid.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5511635566980096195'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5511635566980096195'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/06/get-all-torsion-angles-in-nucleic-acid.html' title='Get all torsion angles in a nucleic acid structure'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-8566628529466110582</id><published>2010-06-20T15:48:00.004-04:00</published><updated>2010-06-20T16:29:18.613-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Subscribing to mailing lists in daily digest mode</title><content type='html'>I am currently on quite a few mailing lists that are relevant to my research topics in a broad sense, including &lt;a href="http://wiki.jmol.org/index.php/Mailing_Lists"&gt;Jmol&lt;/a&gt;, &lt;a href="https://lists.sdsc.edu/mailman/listinfo.cgi/pdb-l"&gt;pdb-l&lt;/a&gt; and &lt;a href="http://www.ccp4.ac.uk/ccp4bb.php"&gt;CCP4&lt;/a&gt;. Over the years, I've found it very handy to keep informed of a field by following in its (major) mailing list. However, I could easily get lost with so many posts each day on some active lists. So subscribing to the daily digest mode, provided by many mailing list management software, has become a norm. Thus, I would receive only one (or a few) email(s) from a mailing list. I could then easily browse through the subject lines to decide if to read further of a specific topic.&lt;br /&gt;&lt;br /&gt;Among the three lists mentioned above, Jmol is quite active with several core contributors, most impressively from Prof. Robert Hanson. The PDB mailing list is of unbelievably low volume, with less than a post per day. CCP4 bullet board is overall the most well-organized; I am surprised by the knowledge-base from the posts there – it is a great resource in structural biology.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://archive.ambermd.org/"&gt;AMBER mailing list&lt;/a&gt; and &lt;a href="http://www.ccl.net/"&gt;Computational Chemistry List (CCL)&lt;/a&gt; are the other two lists I subscribed to before. However, I could not get into the daily digest mode; soon my mail box was flooded by many unrelated posts so I had to un-subscribe from them.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-8566628529466110582?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/8566628529466110582/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/06/subscribing-mailing-lists-in-daily.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/8566628529466110582'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/8566628529466110582'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/06/subscribing-mailing-lists-in-daily.html' title='Subscribing to mailing lists in daily digest mode'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-5668284367778031236</id><published>2010-06-20T13:07:00.005-04:00</published><updated>2010-06-20T16:34:42.843-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Journal impact factor and individual researcher h-index</title><content type='html'>The &lt;a href="http://www.nature.com/nature/journal/v465/n7300/index.html"&gt;June 17, 2010 issue of &lt;span style="font-style: italic;"&gt;Nature&lt;/span&gt;&lt;/a&gt; (Vol. &lt;span style="font-weight: bold;"&gt;465&lt;/span&gt;, No. 7300) has extensive discussions on assessing the impact (influence/significance) of a journal or an individual researcher using quantitative metrics. Not surprisingly, there is no consensus. Over the past 20 years, the field of &lt;span style="font-style: italic;"&gt;bibliometrics&lt;/span&gt; (&lt;span style="font-style: italic;"&gt;scientometrics&lt;/span&gt;) has seen a ten-fold explosion in publications. In fact, the title of the editorial is "Assessing assessment"; it boils down to whether we should use a quantitative metric, a combination of several such metrics, or which metrics to choose among the so many possibilities.&lt;br /&gt;&lt;br /&gt;Among the various view points, I agree more with David Pendlebury, Citation Analyst of Thomson Reuters. "No one enjoys being measured — unless he or she comes out on top." "Importantly, publication-based metrics provide an objective counterweight ... to bias of many kinds." "there are dangers [to] put too much faith in them. A quantitative profile should always be used to foster discussion, rather than to end it."&lt;br /&gt;&lt;br /&gt;Among the many currently available metrics, it seems fair to say:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-weight: bold;"&gt;impact factor&lt;/span&gt; (&lt;span style="font-weight: bold;"&gt;IF&lt;/span&gt;), introduced in 1963, is the most important one to measure the impact of a journal. Nowadays, it is common to see the IF of a journal at its home page. For example, currently &lt;span style="font-style: italic;"&gt;Nucleic Acids Research&lt;/span&gt; has an IF of 6.878. However, as emphasized by Anthony van Raan, director of the Centre for Science and Technology Studies at Leiden University in the Netherlands: "You should never use the journal impact factor to evaluate research performance for an article or for an individual — that is a mortal sin." Indeed, "In 2005, 89% of Nature’s impact factor was&lt;br /&gt;generated by 25% of the articles."&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;span style="font-weight: bold;"&gt;h-index&lt;/span&gt;, &lt;a href="http://www.pnas.org/content/102/46/16569.full"&gt;introduced in 2005 by Hirsch&lt;/a&gt;, is currently the most influential metric to quantify the productivity and impact of an individual researcher. An h-index is "defined as the number of papers with citation number ≥h". It "gives an estimate of the importance, significance, and broad impact of a scientist’s &lt;span style="font-style: italic;"&gt;cumulative&lt;/span&gt; research contributions." I found it very informative to read the original, 4-page long PNAS paper to understand clearly why h-index is defined that way and to be aware of some of its caveats. &lt;/li&gt;&lt;li&gt;&lt;span style="font-weight: bold;"&gt;number of citations&lt;/span&gt; is undoubtedly the most objective metric to measure the impact of a publication.&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;Overall, it is easy to come up with a quantitative measure; the point is what that number really means. Along the line, it is of crucial importance to know exactly how a metric is calculated. No metric could be perfect; however, a metric, defined transparently and applied consistently, is more objective and convincing than other means.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-5668284367778031236?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/5668284367778031236/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/06/journal-impact-factor-and-individual-h.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5668284367778031236'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5668284367778031236'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/06/journal-impact-factor-and-individual-h.html' title='Journal impact factor and individual researcher h-index'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-6609107083980212604</id><published>2010-06-11T23:11:00.008-04:00</published><updated>2010-06-12T00:27:07.995-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>3DNA citations from mainland Chinese researchers</title><content type='html'>Over the past couple of weeks, I have been quite &lt;span style="font-style: italic;"&gt;surprised&lt;/span&gt; to notice the following four papers, by researchers from mainland China, that cite 3DNA:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Feng &lt;span style="font-style: italic;"&gt;et al&lt;/span&gt;. "&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/20512977"&gt;Crystal structure of the crenarchaeal conserved chromatin protein Cren7 and double-stranded DNA complex&lt;/a&gt;." &lt;span style="font-style: italic;"&gt;Protein Sci&lt;/span&gt;. 2010 Jun;&lt;span style="font-weight: bold;"&gt; 19&lt;/span&gt;(6):1253-7.&lt;/li&gt;&lt;li&gt;Zhang &lt;span style="font-style: italic;"&gt;et al&lt;/span&gt;. "&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/20345658"&gt;Structural insights into the interaction of the crenarchaeal chromatin protein Cren7 with DNA&lt;/a&gt;." &lt;span style="font-style: italic;"&gt;Mol Microbiol.&lt;/span&gt; 2010 Mar 16. [Epub ahead of print]&lt;/li&gt;&lt;li&gt;Li &lt;span style="font-style: italic;"&gt;et al&lt;/span&gt;. "&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/20508957"&gt;Molecular dynamics studies of the 3D structure and planar ligand binding of a quadruplex dimer&lt;/a&gt;." &lt;span style="font-style: italic;"&gt;J Mol Model.&lt;/span&gt; 2010 May 29. [Epub ahead of print]&lt;/li&gt;&lt;li&gt;Chen &lt;span style="font-style: italic;"&gt;et al&lt;/span&gt;. "Theoretical studies on the stacking interactions between methylated and unmethylated DNA bases." &lt;span style="font-style: italic;"&gt;ACTA CHIMICA SINICA&lt;/span&gt; (in Chinese) &lt;span style="font-weight: bold;"&gt;68&lt;/span&gt;(8):739-46. Published: Apr. 28, 2010. &lt;/li&gt;&lt;/ul&gt;For more ten years, this is the first time (to the best of my knowledge) that 3DNA has been cited by scientists from China, and it so happens that four come in a row. I am so happy to see that structural biology is shaping up in China. Surely, I would expect more &lt;a href="http://scholar.google.com/scholar?hl=en&amp;amp;lr=&amp;amp;q=allintitle%3A3dna+author%3A%22XJ+Lu%22&amp;amp;btnG=Search"&gt;3DNA-citing papers&lt;/a&gt; from China to come in the following years.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-6609107083980212604?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/6609107083980212604/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/06/3dna-citations-from-mainland-chinese.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/6609107083980212604'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/6609107083980212604'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/06/3dna-citations-from-mainland-chinese.html' title='3DNA citations from mainland Chinese researchers'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-2014473206258921789</id><published>2010-06-11T22:25:00.006-04:00</published><updated>2010-06-11T23:08:10.561-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>block_atom, a little used 3DNA utility script</title><content type='html'>Following my previous post, "&lt;a href="http://xiang-jun.blogspot.com/2010/06/how-is-base-pair-rectangular-block.html"&gt;How is a base-pair rectangular block defined in 3DNA?&lt;/a&gt;", I feel it is in order to mention a tiny Perl script named &lt;span style="font-family:courier new;"&gt;block_atom&lt;/span&gt; that seems to have been little used by the 3DNA user community.&lt;br /&gt;&lt;br /&gt;The basic idea of &lt;span style="font-family:courier new;"&gt;block_atom&lt;/span&gt; is to get an ALCHEMY data file which combines both atomic and block representations of nucleic-acid-containing structure in PDB format. The ALCHEMY file can then be displayed &lt;span style="font-style: italic;"&gt;interactively&lt;/span&gt; using &lt;a href="http://xiang-jun.blogspot.com/2009/06/rasmol-264-by-roger-sayle.html"&gt;RasMol (v2.6.4, by its original author)&lt;/a&gt; (or Jmol).&lt;br /&gt;&lt;br /&gt;Again, the point is best illustrated with an example. Here I am using &lt;a href="http://ndbserver.rutgers.edu/atlas/xray/structures/B/bdl084/bdl084.html"&gt;355d&lt;/a&gt;, the famous Drew-Dickerson B-DNA dodecamer solved at a high resolution by Williams and colleagues. Let the input PDB file be &lt;span style="font-family:courier new;"&gt;355d.pdb&lt;/span&gt;, then the output file is automatically named &lt;span style="font-family:courier new;"&gt;355d.alc&lt;/span&gt;, which can be displayed using RasMol with the options '&lt;span style="font-weight: bold;font-family:courier new;"&gt;-alchemy -noconnect&lt;/span&gt;'.&lt;br /&gt;&lt;pre style="background-color: rgb(238, 238, 238);"&gt;block_atom 355d.pdb&lt;br /&gt;rasmol -alchemy -noconnect 355d.alc&lt;br /&gt;&lt;/pre&gt;One view of RasMol rendered image is as below, which shows more clearly the minor-groove (colored blue) of the DNA duplex and the various intra-base-pair deformations (e.g., propeller and buckle):&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_CEuD25cqpCE/TBL3JMju-mI/AAAAAAAAAD8/_GavtMptdjU/s1600/3dna_355d.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 230px; height: 400px;" src="http://3.bp.blogspot.com/_CEuD25cqpCE/TBL3JMju-mI/AAAAAAAAAD8/_GavtMptdjU/s400/3dna_355d.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5481715433825303138" /&gt;&lt;/a&gt;I initially wrote this utility script mostly for verification purpose, i.e., to make sure that the base reference frames are defined properly. For those who want to know more about 3DNA, having a look of the script (which is &lt;b&gt;&lt;i&gt;tiny&lt;/i&gt;&lt;/b&gt;) and understanding how it works could be a good exercise.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-2014473206258921789?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/2014473206258921789/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/06/blockatom-little-used-3dna-utility.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2014473206258921789'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2014473206258921789'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/06/blockatom-little-used-3dna-utility.html' title='block_atom, a little used 3DNA utility script'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_CEuD25cqpCE/TBL3JMju-mI/AAAAAAAAAD8/_GavtMptdjU/s72-c/3dna_355d.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-7702354986205215797</id><published>2010-06-05T22:27:00.009-04:00</published><updated>2010-06-06T00:31:36.456-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>How is a base-pair rectangular block defined in 3DNA?</title><content type='html'>One of 3DNA's unique features is the base-pair (and base) rectangular blocks, as shown in the figure below. Since such a schematic was  initialized by Calladine and Drew in their &lt;a href="http://www.sciencedirect.com/science/book/9780121550899"&gt;Understanding DNA&lt;/a&gt; book, I normally refer it as the Calladine-Drew style representation.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_CEuD25cqpCE/TAsSzlzMR5I/AAAAAAAAADs/ox2vS2ioPAg/s1600/3dna_blocks.png"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 256px;" src="http://4.bp.blogspot.com/_CEuD25cqpCE/TAsSzlzMR5I/AAAAAAAAADs/ox2vS2ioPAg/s400/3dna_blocks.png" alt="" id="BLOGGER_PHOTO_ID_5479494049155794834" border="0" /&gt; &lt;/a&gt;&lt;br /&gt;By &lt;span style="font-style: italic;"&gt;default&lt;/span&gt;, a base-pair [BP, (a)] has a dimension of &lt;span style="font-family:courier new;"&gt;10x4.5x0.5&lt;/span&gt; Å;  a purine [R, (b) left] &lt;span style="font-family:courier new;"&gt;4.5x4.5x0.5&lt;/span&gt; Å; a pyrimidine [Y, (b) right] &lt;span style="font-family:courier new;"&gt;3x4.5x0.5&lt;/span&gt;  Å; and a &lt;span style="font-style: italic;"&gt;mean&lt;/span&gt; base [M, (c)], which is exactly half of the base-pair, &lt;span style="font-family:courier new;"&gt;5x4.5x0.5&lt;/span&gt; Å.&lt;br /&gt;&lt;br /&gt;The blocks are put into specific files: &lt;span style="font-family:courier new;"&gt;Block_BP.alc&lt;/span&gt;, &lt;span style="font-family:courier new;"&gt;Block_R.alc&lt;/span&gt;, and &lt;span style="font-family:courier new;"&gt;Block_Y.alc&lt;/span&gt; respectively. To use M for R and Y, one just needs to copy file &lt;span style="font-family:courier new;"&gt;Block_M.alc&lt;/span&gt; to overwrite &lt;span style="font-family:courier new;"&gt;Block_R.alc&lt;/span&gt; and &lt;span style="font-family:courier new;"&gt;Block_Y.alc&lt;/span&gt; in the current directory or the system directory (&lt;span style="font-family:courier new;"&gt;$X3DNA/config/&lt;/span&gt;). These blocks are used in the rebuilding and visualization components of 3DNA (see my blog post "&lt;a href="http://xiang-jun.blogspot.com/2009/06/blocview-simple-effective-visualization.html"&gt;blocview: a simple, effective visualization tool for nucleic acid structures&lt;/a&gt;").&lt;br /&gt;&lt;br /&gt;The blocks are stored in the &lt;span style="font-family:courier new;"&gt;ALCHEMY&lt;/span&gt; format for easy specification of the nodes and edges, and since ALCHEMY is a format supported by RasMol (and Jmol). As an example, &lt;span style="font-family:courier new;"&gt;Block_BP.alc&lt;/span&gt; has the following content:&lt;pre style="background-color: rgb(204, 204, 204); font-size: smaller;"&gt;   12 ATOMS,    12 BONDS&lt;br /&gt;    1 N      -2.2500   5.0000   0.2500&lt;br /&gt;    2 N      -2.2500   0.5000   0.2500&lt;br /&gt;    3 N      -2.2500   0.5000  -0.2500&lt;br /&gt;    4 N      -2.2500   5.0000  -0.2500&lt;br /&gt;    5 C       2.2500   3.5000   0.2500&lt;br /&gt;    6 C       2.2500   0.5000   0.2500&lt;br /&gt;    7 C       2.2500   0.5000  -0.2500&lt;br /&gt;    8 C       2.2500   3.5000  -0.2500&lt;br /&gt;    9 C      -2.2500   5.0000   0.2500&lt;br /&gt;   10 C      -2.2500   0.5000   0.2500&lt;br /&gt;   11 C      -2.2500   0.5000  -0.2500&lt;br /&gt;   12 C      -2.2500   5.0000  -0.2500&lt;br /&gt;    1     1     2&lt;br /&gt;    2     2     3&lt;br /&gt;    3     3     4&lt;br /&gt;    4     4     1&lt;br /&gt;    5     5     6&lt;br /&gt;    6     6     7&lt;br /&gt;    7     7     8&lt;br /&gt;    8     5     8&lt;br /&gt;    9     9     5&lt;br /&gt;   10    10     6&lt;br /&gt;   11    11     7&lt;br /&gt;   12    12     8&lt;/pre&gt;Astute viewers may observe that nodes 1-4 are identified as N (nitrogen) and have exactly the same coordinates as 9-12 (C, carbon). This is a trick so RasMol can display the the minor groove edge in a different color than the other five sides of the rectangular, as shown in the following figure:&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_CEuD25cqpCE/TAsb6k4mQpI/AAAAAAAAAD0/bJ-Oj5YZ18w/s1600/3dna_Block_BP.png"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 387px; height: 75px;" src="http://2.bp.blogspot.com/_CEuD25cqpCE/TAsb6k4mQpI/AAAAAAAAAD0/bJ-Oj5YZ18w/s400/3dna_Block_BP.png" alt="" id="BLOGGER_PHOTO_ID_5479504064773767826" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Note that the rectangular is &lt;span style="font-style: italic; font-weight: bold;"&gt;preset&lt;/span&gt; in the standard base reference frame. Thus the nodes have &lt;span style="font-style: italic;"&gt;y&lt;/span&gt;-coordinates of +5 Å and -5 Å along the long edge of the base pair, and &lt;span style="font-style: italic;"&gt;x&lt;/span&gt;-coordinates of +2.25 Å and -2.25 Å  along the short edge.&lt;br /&gt;&lt;br /&gt;As an extra bonus of storing the blocks in external ALCHEMY text files, the dimensions of the blocks are be readily changed. For example, the &lt;span style="font-style: italic;"&gt;depth&lt;/span&gt; of a block (&lt;span style="font-style: italic;"&gt;z&lt;/span&gt;-coordinates) can be easily increased from 0.5 to 1.0 Å to make it thicker (see &lt;span style="font-family: courier new;"&gt;$X3DNA/config/Block_BP1.alc&lt;/span&gt;). Moreover, the blocks do not need to be rectangular either – see &lt;span style="font-family: courier new;"&gt;$X3DNA/config/block/Block_R_nr.alc&lt;/span&gt; for an example.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-7702354986205215797?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/7702354986205215797/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/06/how-is-base-pair-rectangular-block.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/7702354986205215797'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/7702354986205215797'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/06/how-is-base-pair-rectangular-block.html' title='How is a base-pair rectangular block defined in 3DNA?'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_CEuD25cqpCE/TAsSzlzMR5I/AAAAAAAAADs/ox2vS2ioPAg/s72-c/3dna_blocks.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-5554303919969711425</id><published>2010-05-28T22:22:00.003-04:00</published><updated>2010-05-28T23:33:18.909-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Naming conventions: RasMol, PyMOL and Jmol</title><content type='html'>Over the years, I've used the following three molecular graphics programs the most:  RasMol, PyMOL and Jmol. It is interesting to note their different naming conventions: while their names all end with m-o-l (for molecule, I believe), the cases (used in convention) are different: &lt;span style="font-weight: bold;"&gt;Mol&lt;/span&gt; as in RasMol, &lt;span style="font-weight: bold;"&gt;MOL&lt;/span&gt; as in PyMOL, and &lt;span style="font-weight: bold;"&gt;mol&lt;/span&gt; as in Jmol.&lt;br /&gt;&lt;br /&gt;For &lt;a href="http://jmol.sourceforge.net/"&gt;Jmol&lt;/a&gt;, there is actually an FAQ item "&lt;a href="http://jmol.sourceforge.net/faqs/index.en.html#How%20do%20you%20write%20Jmol?"&gt;How do you write Jmol?&lt;/a&gt;" that reads "Capital J, lower case mol. Please do not write it any other way." For &lt;a href="http://www.pymol.org/"&gt;PyMOL&lt;/a&gt;, I am not aware of a specification on its name – I know that's the "official name" from its home page. Regarding RasMol, that's the most common way to name it, even though the title of its 1995 publication is "&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/7482707"&gt;RASMOL: biomolecular graphics for all.&lt;/a&gt;"&lt;br /&gt;&lt;br /&gt;As for the prefix in the names, J in Jmol stands for Java; Py in PyMOL for Python; and Ras in RasMol could well be the initials of Roger A. Sayle, the original author of the program.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-5554303919969711425?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/5554303919969711425/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/05/naming-conventions-rasmol-pymol-and.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5554303919969711425'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5554303919969711425'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/05/naming-conventions-rasmol-pymol-and.html' title='Naming conventions: RasMol, PyMOL and Jmol'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-9174306629546231522</id><published>2010-05-23T23:08:00.007-04:00</published><updated>2010-05-26T11:59:55.724-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><category scheme='http://www.blogger.com/atom/ns#' term='structural biology'/><title type='text'>MODEL/ENDMDL records in PDB X-ray crystal structures</title><content type='html'>Over the years, I have always been using the PDB format for nucleic acid structures. Naturally I've thought that I know the format well, especially the "&lt;a href="http://www.wwpdb.org/documentation/format32/sect9.html"&gt;Coordinate Section&lt;/a&gt;." According to the PDB documentation,&lt;br /&gt;&lt;blockquote&gt;MODEL/ENDMDL records are used  only when more than one structure is  presented in the entry, as is often the  case with NMR entries.&lt;br /&gt;&lt;/blockquote&gt; In my experience, I have always connected the MODEL/ENDMDL pair &lt;span style="font-style: italic;"&gt;only&lt;/span&gt; with the different models in an NMR entry. However, I was recently bitten by a subtlety in PDB format that is related to the MODEL/ENDMDL records in X-ray crystal structures which contain more than one asymmetric unit in their   biological assembly.&lt;br /&gt;&lt;br /&gt;As always, the point is best illustrated with an example – here I am using the four-stranded DNA Holliday junction X-ray crystal structure (&lt;a href="http://www.rcsb.org/pdb/explore/explore.do?structureId=1ZF4"&gt;1zf4&lt;/a&gt;/&lt;a href="http://ndbserver.rutgers.edu/atlas/xray/structures/U/ud0061/ud0061.html"&gt;ud0061&lt;/a&gt;) solved by Ho and colleagues [&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/15870206"&gt;&lt;span style="font-style: italic;"&gt;PNAS&lt;/span&gt; 2005 May 17;&lt;span style="font-weight: bold;"&gt;102(20)&lt;/span&gt;:7157-62]&lt;/a&gt;. As shown in the NDB website, the asymmetric unit of &lt;a href="http://ndbserver.rutgers.edu/atlas/xray/structures/U/ud0061/ud0061.html"&gt;1zf4/ud0061&lt;/a&gt; contains only two chains; it is the biological assembly that has the four-stranded DNA junction.&lt;br /&gt;&lt;br /&gt;I downloaded the &lt;span style="font-style: italic;"&gt;biological&lt;/span&gt; assembly coordinates (in PDB format) from the NDB (named it '&lt;span style="font-family:courier new;"&gt;1zf4.pdb&lt;/span&gt;'), and ran &lt;span style="font-family:courier new;"&gt;blocview&lt;/span&gt; on it: &lt;span style="font-weight: bold;font-family:courier new;" &gt;blocview -i=1zf4.png 1zf4.pdb&lt;/span&gt;. However, the generated image [see Figure (A) below] has only half of what expected – as if the downloaded file contains only coordinates of an asymmetric unit. The mystery was gone when I checked the PDB file and realized that the MODEL/ENDMDL records now also apply to X-ray  crystal structures to delineate symmetric-related units. Since 3DNA is designed to handle only one structure – it stops processing whenever an END or ENDMDL record is encountered. Simply change each 'ENDMDL' record to ' ENDMDL' (i.e., adding a space, or any character, for that matter) and run &lt;span style="font-family:courier new;"&gt;blocview&lt;/span&gt; again will get the expected image [Figure (B)].&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_CEuD25cqpCE/S_1FQ-9UK5I/AAAAAAAAADk/7wBykHuxm7w/s1600/3dna_1zf4.png"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 212px;" src="http://3.bp.blogspot.com/_CEuD25cqpCE/S_1FQ-9UK5I/AAAAAAAAADk/7wBykHuxm7w/s400/3dna_1zf4.png" alt="" id="BLOGGER_PHOTO_ID_5475608880032787346" border="0" /&gt;&lt;/a&gt;Note that by default, RasMol also only displays the first model in a PDB file. To see multiple structures, the  option '&lt;span style="font-family:courier new;"&gt;-nmrpdb&lt;/span&gt;' must be specified in the  command line.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-9174306629546231522?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/9174306629546231522/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/05/modelendmdl-records-in-pdb-x-ray.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/9174306629546231522'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/9174306629546231522'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/05/modelendmdl-records-in-pdb-x-ray.html' title='MODEL/ENDMDL records in PDB X-ray crystal structures'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_CEuD25cqpCE/S_1FQ-9UK5I/AAAAAAAAADk/7wBykHuxm7w/s72-c/3dna_1zf4.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-5405987051386223837</id><published>2010-05-15T17:59:00.003-04:00</published><updated>2010-05-18T08:50:11.970-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>3DNA forum registration -- your id</title><content type='html'>Over the past couple of months, I have received a few emails with a subject line, as the title of this post, "3DNA forum registration -- your id". Presumably, such an email is from a (potential) 3DNA user, requesting for activation after 3DNA forum registration.&lt;br /&gt;&lt;br /&gt;Clearly, "your id" should have been a specified user id. In a couple of cases, I tried to figure out the corresponding 3DNA forum ids based on email addresses and activated them. More recently, I've switched to ask the sender a very simple question: "&lt;span style="font-weight: bold;"&gt;So what is '&lt;/span&gt;&lt;span style="font-weight: bold;" class="il"&gt;your&lt;/span&gt;&lt;span style="font-weight: bold;"&gt; &lt;/span&gt;&lt;span style="font-weight: bold;" class="il"&gt;id&lt;/span&gt;&lt;span style="font-weight: bold;"&gt;' specifically?&lt;/span&gt;" In at least two cases, I have never heard back from the original senders with 'your id' specified. To a certain extent, this is a surprising result: providing such information won't take more than a couple of minutes, and this will help me (and others) to help them better with 3DNA-related questions. I can certainly understand occasional negligence, but users must follow some common-sense rules. Thus those registrations are deleted, as junk ones, after some grace period.&lt;br /&gt;&lt;br /&gt;The 3DNA forum is open to public for browsing, without registration. At its current settings, no emails are sent from the forum; do not confuse an online forum with a mailing list. You only need to register if you want (and are certainly more than welcome)  to get more actively involved in, e.g. asking/answering questions, sharing tips/tricks with other users. These policies are enforced to make the 3DNA forum free from spams, as much as possible.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-5405987051386223837?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/5405987051386223837/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/05/3dna-forum-registration-your-id.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5405987051386223837'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5405987051386223837'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/05/3dna-forum-registration-your-id.html' title='3DNA forum registration -- your id'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-7987583672563102025</id><published>2010-05-08T08:40:00.005-04:00</published><updated>2010-05-08T10:24:17.400-04:00</updated><title type='text'>Some key combinations in MacBook Pro (Snow Leopard)</title><content type='html'>One of the inconveniences I experienced when first switching to a MacBook Pro running Mac OS X (Snow Leopard) was its missing of the PgUp, PgDn, Home and End keys. Over the past few months, I have learned that the functionality of such 'convenience' keys can be achieved via a combination of the arrow-keys (bottom right), with the 'fn', 'option', or 'command' keys (bottom left), as follows:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;fn + right-arrow – end of a document&lt;/li&gt;&lt;li&gt;fn + left-arrow – beginning of a document&lt;/li&gt;&lt;li&gt;fn + up-arrow – Page up&lt;/li&gt;&lt;li&gt;fn + down-arrow – Page down&lt;/li&gt;&lt;li&gt;command + right-arrow – beginning of a line (Home)&lt;/li&gt;&lt;li&gt;command + left-arrow – end of a line (End)&lt;/li&gt;&lt;li&gt;option + right-arrow – one word to the right&lt;/li&gt;&lt;li&gt;option + left-arrow – one word to the left&lt;/li&gt;&lt;/ul&gt;According to Wikipedia, the &lt;a href="http://en.wikipedia.org/wiki/Fn_key"&gt;'fn' key&lt;/a&gt; "is a &lt;i&gt;modifier&lt;/i&gt; key on many keyboards, especially on laptops, used in a compact layout to combine keys which are usually kept separate."&lt;br /&gt;&lt;br /&gt;In addition to modifying cursor movements as noted above, the 'fn' key in Mac OS X can also be used to switch the default functionalities of F1 to F12 (e.g., F1 and F2 for screen brightness control) to the &lt;i&gt;standard&lt;/i&gt; function keys. As an example, in MS Word, the keyboard shortcut for toggling case is "shift + f3", a trick I recently learned. In the default setting, one must press "fn + shift + f3" to achieve the desired effect.&lt;br /&gt;&lt;br /&gt;Any tricks to share? I'd like to hear them!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-7987583672563102025?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/7987583672563102025/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/05/some-key-combinations-in-macbook-pro.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/7987583672563102025'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/7987583672563102025'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/05/some-key-combinations-in-macbook-pro.html' title='Some key combinations in MacBook Pro (Snow Leopard)'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-2856711861221063018</id><published>2010-05-01T14:01:00.005-04:00</published><updated>2010-05-01T23:07:19.684-04:00</updated><title type='text'>One year of blogging</title><content type='html'>When I checked the date of &lt;a href="http://xiang-jun.blogspot.com/2009/05/about-my-name.html"&gt;my first blog post&lt;/a&gt; today, I was a bit surprised to find that it is exactly one year since I begin to blog on May 2, 2009. Altogether, I have written over 60 posts, slightly more than one per week. At this time, I feel it appropriate to summarize my thought on blogging in general, to provide a perspective to those who care to visit here and make comments. &lt;div&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Why&lt;/b&gt;? The initial motivation was to use blog as a platform to express my &lt;i&gt;personal&lt;/i&gt; views on issues I am interested in. As made clear in &lt;a href="http://xiang-jun.blogspot.com/2009/05/about-my-name.html"&gt;my first post&lt;/a&gt;, "this is Xiang-Jun's Corner on the Internet: all views are mine, and I am &lt;i&gt;&lt;b&gt;opinionated&lt;/b&gt;&lt;/i&gt;." Over the time, the blog posts have served as a convenient notebook (searchable and archived) either for my personal reference, or to refer others to a particular post (e.g., "&lt;a href="http://xiang-jun.blogspot.com/2009/07/on-maintaining-3dna-forum.html"&gt;On maintaining the 3DNA forum&lt;/a&gt;" when being asked a 3DNA-related question via email).&lt;/li&gt;&lt;li&gt;&lt;b&gt;What&lt;/b&gt;? "Random thoughts, mostly on scientific issues". I write only on issues I am familiar with and feel comfortable to say something, to the limit that I can respond quickly and concretely to users comments. I am always open to suggestions and will be prompt in acknowledging errors and making corrections. So far, the largest portion of the posts has been devoted to nucleic acid structures in general, and 3DNA-related topics in particular.&lt;/li&gt;&lt;li&gt;&lt;b&gt;How often&lt;/b&gt;? Due to time constraints, I will try to write one post per week at the minimum, maybe two, or in rare occasion three. Exceptions are possible, but by and large, I will aim for ~100 posts per year.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Comment&lt;/b&gt;? Currently, the policy is set such that "Anyone - includes Anonymous Users" can make a comment, and the comments are moderated. So far, I have always approved all the comments as soon as I see them in my gmail alert, and follow up where appropriate. Note that due to global time difference, commenters may experience some lag in time.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Does it work&lt;/b&gt;? Not unexpectedly, my blog has gradually attracted attentions from quite a broad audience, especially those interested in nucleic acid structures (3DNA), including leading scientists in the field (I know from emails I have received).&lt;/li&gt;&lt;li&gt;&lt;b&gt;Hot posts&lt;/b&gt;? According to Google Analytics, the most frequently visited nine posts are as follows (with posting date in parentheses):&lt;ol&gt;&lt;li&gt;&lt;a href="http://xiang-jun.blogspot.com/2009/08/curves-vs-3dna.html"&gt;Curves+ vs 3DNA &lt;/a&gt;(Sunday, August 16, 2009)&lt;/li&gt;&lt;li&gt;&lt;a href="http://xiang-jun.blogspot.com/2009/07/does-3dna-work-for-rna.html"&gt;Does 3DNA work for RNA?&lt;/a&gt; (Friday, July 10, 2009)&lt;/li&gt;&lt;li&gt;&lt;a href="http://xiang-jun.blogspot.com/2009/07/two-web-interfaces-to-3dna.html"&gt;Two web-interfaces to 3DNA, and more&lt;/a&gt; (Sunday, July 5, 2009)&lt;/li&gt;&lt;li&gt;&lt;a href="http://xiang-jun.blogspot.com/2009/08/fit-least-squares-plane-to-set-of.html"&gt;Fit a least squares plane to a set of points &lt;/a&gt;(Saturday, August 22, 2009)&lt;/li&gt;&lt;li&gt;&lt;a href="http://xiang-jun.blogspot.com/2009/05/two-3dna-figures-made-into-textbook-on.html"&gt;Two 3DNA figures made into a textbook on structural biology&lt;/a&gt; (Sunday, May 3, 2009)&lt;/li&gt;&lt;li&gt;&lt;a href="http://xiang-jun.blogspot.com/2010/01/chemical-diagram-of-watson-crick-base.html"&gt;Chemical diagram of Watson-Crick base-pairs&lt;/a&gt; (Saturday, January 23, 2010)&lt;/li&gt;&lt;li&gt;&lt;a href="http://xiang-jun.blogspot.com/2010/04/whats-special-about-gpu-dinucleotide.html"&gt;What's special about the GpU dinucleotide platform?&lt;/a&gt; (Friday, April 2, 2010)&lt;/li&gt;&lt;li&gt;&lt;a href="http://xiang-jun.blogspot.com/2009/09/double-helix-groove-width-parameters.html"&gt;Double helix groove width parameters from 3DNA&lt;/a&gt; (Saturday, September 5, 2009)&lt;/li&gt;&lt;li&gt;&lt;a href="http://xiang-jun.blogspot.com/2009/10/how-to-calculate-torsion-angle.html"&gt;How to calculate torsion angle?&lt;/a&gt; (Saturday, October 31, 2009)&lt;/li&gt;&lt;br /&gt;&lt;/ol&gt;While some of the posts are well-expected to be in the list, a few of them (e.g., ls-plane fitting, calculation of torsion angle) could look a bit surprising. It does, however, verify an observation based on my personal experience and intuitive feeling about a technical niche that my expertise can make a difference.&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;Again, as I wrote in &lt;a href="http://xiang-jun.blogspot.com/2009/05/about-my-name.html"&gt;my first blog post&lt;/a&gt;, "Now the ball is rolling, and only time can tell where the destination will be -- but surely it will no longer stand where it was!" One year later, I can confidently say that the ball in rolling in the right direction, as I'd have hoped for. Of course, I know for sure more time and efforts are need to move to the next level, and I value &lt;i&gt;your feedback&lt;/i&gt;!&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-2856711861221063018?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/2856711861221063018/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/05/one-year-of-blogging.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2856711861221063018'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2856711861221063018'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/05/one-year-of-blogging.html' title='One year of blogging'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-4562950333190487858</id><published>2010-04-30T23:17:00.012-04:00</published><updated>2010-05-01T19:33:48.871-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='bioinformatics'/><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><category scheme='http://www.blogger.com/atom/ns#' term='structural biology'/><title type='text'>C2B2 annual retreat; DNA shape matters</title><content type='html'>As in previous years, today's &lt;a href="http://www.c2b2.columbia.edu/"&gt;C2B2&lt;/a&gt; annual retreat provided a good opportunity to know  the projects going on, and to meet the people performing the researches in the broad area of computational biology and bioinformatics at Columbia. The depth and breadth of the topics covered, and the close collaborations among different labs were amazing; revealing the multidisciplinary nature of bioinformatics and systems biology. After lunch, there was only slightly more than 30 minutes left for poster session. I quickly walked around the posters and I only wished to have more time to read more carefully some of them and discuss with the presenters. &lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;From a DNA structural perspective, of special note is the plenary talk by &lt;a href="http://www.bu.edu/chemistry/faculty/profile/tullius.html"&gt;Dr. Tom Tullius&lt;/a&gt;, titled "Function follow form: the role of DNA shape in the working of human genome". I am aware of Dr. Tullius work on hydroxyl radical footprinting, and his &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/19286520"&gt;2009 Science paper&lt;/a&gt; on the correlation between local DNA topography (the shape of the backbone and grooves) with functional noncoding regions in the human genome. His talk connects naturally to the work by &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/19865164"&gt;Rohs &lt;span style="font-style:italic;"&gt;et al&lt;/span&gt;. (&lt;i&gt;Nature&lt;/i&gt;, 2009)&lt;/a&gt; on the "The role of DNA shape in protein-DNA recognition" which shows that narrow DNA minor groove (associated with A-tracts and negative electrostatic potential) interacts preferably with arginine residues. The last talk by Dr. Richard Mann, titled "New insights into transcript factors specificity" echoed nicely with Dr. Tullius's presentation, emphasizing the importance of DNA shape. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Along the line, it is very interesting to note that Dr. Tullius mentioned a recently published paper titled "The shape of the DNA minor groove directs binding by the DNA-bending protein Fis" by &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/20395367"&gt;Stella &lt;span style="font-style:italic;"&gt;et al&lt;/span&gt;., Genes Dev. 2010 Apr 15; 24(8):814-26&lt;/a&gt;. I happened to read this paper just a coupe of days ago and was impressed by the work. Additionally, I am happy to see that 3DNA's analyze/rebuild/fiber components combined together helped to pinpoint the fact that only three parameters &lt;span class="Apple-style-span"  style=" line-height: 21px; font-family:sans-serif;"&gt;–&lt;/span&gt;  roll, twist, and slide &lt;span class="Apple-style-span"  style=" line-height: 21px; font-family:sans-serif;"&gt;–&lt;/span&gt;  are needed to explain observed DNA shapes in Fis-DNA complexes. See also a related blog post, titled "&lt;a href="http://xiang-jun.blogspot.com/2009/05/fiber-analyze-and-rebuild-in-3dna.html"&gt;Fiber, analyze and rebuild in 3DNA&lt;/a&gt;". &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Overall, it is nice to see that the community has gradually realized that DNA structural features, not just (or in addition to) its base sequence, are a major player in protein/ligand-DNA recognition. From my (certainly not that extensive) understanding of the field, the impressive list of publications over the past few years by Dr. Remo Rohs (Barry Honig group at Columbia University) has played a significant role in "shaping" DNA.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-4562950333190487858?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/4562950333190487858/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/04/c2b2-annual-retreat-dna-shape-matters.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/4562950333190487858'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/4562950333190487858'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/04/c2b2-annual-retreat-dna-shape-matters.html' title='C2B2 annual retreat; DNA shape matters'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-5167167887119673310</id><published>2010-04-23T23:18:00.007-04:00</published><updated>2010-04-26T01:33:49.296-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Life is complicated -- is there a way to make it simpler?</title><content type='html'>To mark the 10th anniversary on completion of the draft sequence of human genome, the &lt;a href="http://www.nature.com/nature/journal/v464/n7289/index.html"&gt;April 01, 2010 issue&lt;/a&gt; of &lt;span style="font-style: italic;"&gt;Nature&lt;/span&gt; [&lt;span style="font-weight: bold;"&gt;464&lt;/span&gt; (7289)] published a series of interesting and revealing articles, including an Editorial and (historical) accounts from Francis Collins and Craig Venter. I  browsed through the whole list, and I especially liked the News article titled "&lt;a href="http://www.nature.com/news/2010/100331/full/464664a.html"&gt;Life is complicated&lt;/a&gt;" by &lt;a href="http://www.erikacheck.com/biography.html"&gt;Erika Check Hayden&lt;/a&gt;, a senior reporter for &lt;i&gt;Nature&lt;/i&gt;. I was attracted by its catchy title and brief summary: "The more biologists look, the more complexity there seems to be. Erika Check Hayden asks if there's a way to make life simpler." Obviously, the title of this blog post was inspired by the sources. &lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Over the past decade, the Human Genome Project has helped to clarify the number of genes from previously assumed ~100,000 to the "true" number of only ~21,000. The dramatically reduced number of genes illustrates the crucial importance of non-coding (used to be called "junk") DNA to biology, yet what non-coding DNA does is still befuddling. Nowadays, we are faced with data deluge from sequencing, gene expression, protein (transcription factor) binding,  and other new technologies. The complexity of biology has grown significantly, instead of simplified, even for the most extensively studied protein p53. As put by Jennifer Doudna, “The more we know, the more we realize there is to know.”&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;The community has gradually realized that information gathering does not always bring corresponding increase of meaningful biological insights. We are facing an age of “drowning in information, starved for knowledge.” For example, systems biology, a new discipline “supposed to help scientists make sense of the complexity”, has turned out that “In many cases, the models themselves quickly become so complex that they are unlikely to reveal insights about the system, degenerating instead into mazes of interactions that are simply exercises in cataloguing.” I cannot agree more with the comment by Leonid Kruglyak: it is naive to think that “you can simply take very large amounts of data and run a data-mining program and understand what is going on in a generic way.”&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Are we lost in the sea of biological data? Not necessarily. Eric Davidson's work is an excellent example in “taking smarter systems approaches” to reveal overarching biological rules. Instead of a “machine learning” type top-down approach, the “insights have come when scientists systematically analyse the components of processes that are easily manipulated in the laboratory — largely in model organisms. They’re still using &lt;b&gt;a systems approach, but focusing it through a more traditional, bottom–up lens&lt;/b&gt;.” Through this systemic bottom-up approach, Davidson's group has deciphered the mechanism of how gene expressions are controlled through regulatory interactions and specify the construction of sea-urchin’s skeleton.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;Interestingly, Eric Davidson gave a seminar at C2B2 on Thursday, April 22, titled "Causal Systems Biology: the Sea Urchin Embryo Gene Regulatory Network". I was very impressed by his talk, and the points he made in the &lt;i&gt;Nature&lt;/i&gt; News article.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-5167167887119673310?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/5167167887119673310/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/04/life-is-complicated-is-there-way-to.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5167167887119673310'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5167167887119673310'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/04/life-is-complicated-is-there-way-to.html' title='Life is complicated -- is there a way to make it simpler?'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-548253043729217058</id><published>2010-04-16T22:16:00.007-04:00</published><updated>2010-04-16T23:46:19.086-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>3DNA in the June 2010 issue of JBSD on "Current Perspectives on Nucleosome Positioning"</title><content type='html'>While updating 3DNA citations this week, I noticed five of them are from the same June 2010 issue of &lt;span style="font-style:italic;"&gt;JOURNAL OF BIOMOLECULAR STRUCTURE &amp; DYNAMICS&lt;/span&gt; (JBSD), which is focused on "Current Perspectives on Nucleosome Positioning". Most of the papers in this issue are from well-known laboratories in computational structural biology. It is my pleasure to see 3DNA being widely used in the important research area of nucleosome positioning.&lt;br /&gt;&lt;br /&gt;I browsed through the abstracts of all the papers in this JBSD issue to refresh my knowledge of this field. While DNA sequence surely plays some role in nucleosome positioning, I remain to be convinced of the existence of a nucleosome "code" (yet) in the sense of the generally applicable "genetic code". Overall, DNA is so flexible and the signal is so week,  thus allowing for tailored data fitting to specific analysis, which is not transferable to other situations. Clearly, the area is hot, yet still wide open.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-548253043729217058?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/548253043729217058/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/04/3dna-in-june-2010-issue-of-jbsd-on.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/548253043729217058'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/548253043729217058'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/04/3dna-in-june-2010-issue-of-jbsd-on.html' title='3DNA in the June 2010 issue of JBSD on &quot;Current Perspectives on Nucleosome Positioning&quot;'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-3363344418313885238</id><published>2010-04-08T20:36:00.008-04:00</published><updated>2010-04-09T21:01:59.601-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>NSMB editorial: "Making your point-by-point"</title><content type='html'>In the April 2010 issue of Nature Structural &amp;amp; Molecular Biology [NSMB, 17(4)], there is another interesting editorial, titled "&lt;a href="http://www.nature.com/nsmb/journal/v17/n4/full/nsmb0410-389.html"&gt;Making your point-by-point&lt;/a&gt;". This editorial addresses an important issue in the process of publishing papers in peer-reviewed journals, that is: how to make effective point-by-point response to "those ever-demanding editors and reviewers"?&lt;br /&gt;&lt;blockquote&gt;Overall, it can be helpful to put yourself in the reviewer’s shoes and compose a response s/he would find appropriate, where the concerns raised are considered and fully addressed. In its ideal state, the review process is a positive and constructive back and forth, an intellectual discussion in which the manuscript is the ultimate beneficiary.&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;Here is my re-cap of the main points, as I understand it. I am also taking this opportunity to read this one-page editorial one more time.&lt;br /&gt;&lt;p style="font-size: 130%; font-weight: bold;"&gt;What to do?&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-weight: bold;"&gt;Keep to the point&lt;/span&gt; – "makes a series of [succinct] points in response [directly] to each point raised by the reviewers."&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;span style="font-weight: bold;"&gt;Keep it objective&lt;/span&gt; – be diplomatic in your point-by-point response to the reviewers, "even if the reviewer’s wording might have seemed overly strong." You could be forthright in your cover letter to the editors, though.&lt;/li&gt;&lt;li&gt;&lt;span style="font-weight: bold;"&gt;Keep things under control&lt;/span&gt; – "Know when to go to the bench and when to argue."&lt;/li&gt;&lt;li&gt;&lt;span style="font-weight: bold;"&gt;The scope of things&lt;/span&gt; – "Say clearly and succinctly" when "some requests might genuinely be beyond the scope of the manuscript or might simply be unfeasible." "Try not to &lt;a href="http://en.wikipedia.org/wiki/Salami_slicing"&gt;salami-slice&lt;/a&gt;", one strong and solid paper is (much) better than two weak ones!&lt;/li&gt;&lt;/ul&gt;&lt;p style="font-size: 130%; font-weight: bold;"&gt;Some don'ts, especially:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-weight: bold;"&gt;Mentioning celebrity endorsements&lt;/span&gt;. "you never know—they could be moonlighting as your most critical anonymous reviewer."&lt;/li&gt;&lt;li&gt;&lt;span style="font-weight: bold;"&gt;Trying to guess who the reviewers are&lt;/span&gt; when communicating to the editors – it does &lt;span style="font-style: italic;"&gt;not&lt;/span&gt; help. Additionally, you could be plain wrong in your guess (again, &lt;span style="font-weight: bold;"&gt;you never know&lt;/span&gt;) – they are anonymous, literally.&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;Generally speaking, I think authors should be appreciative of the work of the reviewers and editors. Occasionally, I serve as a reviewer and I know the time and efforts it takes to make a fair and thorough assessment of a manuscript.&lt;br /&gt;&lt;br /&gt;It is certainly not just because of politeness that in our &lt;a href="http://www.natureprotocols.com/2008/07/03/3dna_a_versatile_integrated_so.php"&gt;2008 3DNA &lt;i&gt;Nature Protocols&lt;/i&gt; paper&lt;/a&gt;, we acknowledged:&lt;/div&gt;&lt;blockquote&gt;We also thank the editor and the anonymous reviewers whose comments helped to clarify the presentation of the protocols.&lt;/blockquote&gt;&lt;div&gt;More recently, in &lt;a href="http://nar.oxfordjournals.org/cgi/content/full/gkq155"&gt;our 2010 &lt;i&gt;NAR&lt;/i&gt; GpU paper&lt;/a&gt;, we acknowledged:&lt;/div&gt;&lt;blockquote&gt;They also thank the anonymous reviewers, whose comments helped clarify the presentation of the manuscript.&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-3363344418313885238?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/3363344418313885238/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/04/nsmb-editorial-making-your-point-by.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/3363344418313885238'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/3363344418313885238'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/04/nsmb-editorial-making-your-point-by.html' title='NSMB editorial: &quot;Making your point-by-point&quot;'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-3331646567262430747</id><published>2010-04-02T20:29:00.004-04:00</published><updated>2010-04-07T00:12:31.882-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='bioinformatics'/><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><category scheme='http://www.blogger.com/atom/ns#' term='structural biology'/><title type='text'>What's special about the GpU dinucleotide platform?</title><content type='html'>Recently, I (together with Drs. Wilma Olson and Harmen Bussemaker – a team with a unique combination of complementary expertise) published a new article in &lt;i&gt;Nucleic Acids Research &lt;/i&gt;(NAR): "&lt;a href="http://nar.oxfordjournals.org/cgi/content/full/gkq155"&gt;The RNA backbone plays a crucial role in mediating the intrinsic stability of the GpU dinucleotide platform and the GpUpA/GpA miniduplex&lt;/a&gt;". The key findings of this work are summarized in the abstract:&lt;br /&gt;&lt;blockquote&gt;The side-by-side interactions of nucleobases contribute to the organization of RNA, forming the planar building blocks of helices and mediating chain folding. Dinucleotide platforms, formed by side-by-side pairing of adjacent bases, frequently anchor helices against loops. Surprisingly, GpU steps account for over half of the dinucleotide platforms observed in RNA-containing structures. Why GpU should stand out from other dinucleotides in this respect is not clear from the single well-characterized H-bond found between the guanine N2 and the uracil O4 groups. Here, we describe how an RNA-specific H-bond between O2'(G) and O2P(U) adds to the stability of the GpU platform. Moreover, we show how this pair of oxygen atoms forms an out-of-plane backbone ‘edge’ that is specifically recognized by a non-adjacent guanine in over 90% of the cases, leading to the formation of an asymmetric miniduplex consisting of ‘complementary’ GpUpA and GpA subunits. Together, these five nucleotides constitute the conserved core of the well-known loop-E motif. The backbone-mediated intrinsic stabilities of the GpU dinucleotide platform and the GpUpA/GpA miniduplex plausibly underlie observed evolutionary constraints on base identity. We propose that they may also provide a reason for the extreme conservation of GpU observed at most 5'-splice sites.&lt;/blockquote&gt;&lt;div&gt;&lt;br /&gt;As a nice surprise, this publication was selected by NAR as a featured article! According to the NAR website:&lt;/div&gt;&lt;blockquote&gt;Featured Articles highlight the &lt;span style="font-weight: bold;"&gt;best papers&lt;/span&gt; published in NAR. These articles are chosen by the Executive Editors on the recommendation of Editorial Board Members and Referees. They represent the &lt;span style="font-weight: bold;"&gt;top 5% of papers in terms of originality, significance and scientific excellence&lt;/span&gt;.&lt;/blockquote&gt;&lt;div&gt;&lt;br /&gt;I feel very gratified with the "extra" recognition. From my own perspective, I can easily rank this paper as &lt;span style="font-style: italic; font-weight: bold;"&gt;the top one in my publication list&lt;/span&gt;: from the very beginning, I has been struck by the simplicity and elegance of the GpU story. Hopefully, time will verify the validity of this scientific contribution. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Behind the hood, though, there is a long, complex (sometimes perplexing), yet interesting story associated with this work. Here is how it got started. While writing the &lt;a href="http://www.nature.com/nprot/journal/v3/n7/full/nprot.2008.104.html"&gt;3DNA 2008 Nature Protocols (NP) paper&lt;/a&gt;, I selected the (previously undocumented) &lt;a href="http://xiang-jun.blogspot.com/2010/03/what-findpair-in-3dna-can-do.html"&gt;"-p" option of "find_pair"&lt;/a&gt; to showcase its capability to identify higher-order base associations, using the large ribosomal subunit (1JJ2) as an example. I noticed the unexpected O2'(G)⋅⋅⋅O2P(U) H-bond within the GpU dinucleotide platform in the pentaplet shown left in Figure A below. I was well aware of Leontis-Westholf's pioneering work on "&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/11345429"&gt;Geometric nomenclature and classification of RNA base pairs&lt;/a&gt;" which involves three distinct edges – the Watson-Crick edge, the Hoogsteen edge, and the Sugar edge, yet without taking into consideration of possible sugar-phosphate backbone interactions (Figure B below). So I decided to double-check, just to be sure that the H-bond was not spurious due to defects in the H-bond detecting scheme of "find_pair", and the results were very surprising.&lt;span style="text-decoration: underline;"&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_CEuD25cqpCE/S7aMUH7cNbI/AAAAAAAAADU/C3afhrotVoI/s1600/backbone_edge.png"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 400px; height: 240px;" src="http://1.bp.blogspot.com/_CEuD25cqpCE/S7aMUH7cNbI/AAAAAAAAADU/C3afhrotVoI/s400/backbone_edge.png" alt="" id="BLOGGER_PHOTO_ID_5455702275959567794" border="0" /&gt;&lt;/a&gt;The following section was re-added into the 3DNA NP paper in the very last revision:&lt;br /&gt;&lt;blockquote&gt;It is also worth noting that the G1971–U1972 platform is stabilized not only by the well-characterized G(N2)⋅⋅⋅U(O4) H-bond interaction, but also by a little-noticed G(O2’)⋅⋅⋅U(O2P) sugar-phosphate backbone interaction (Fig. 6a). Examination of the 50S large ribosomal unit (1JJ2) alone reveals ten such double H-bonded G–U platforms, far more occurrences than those registered by any other dinucleotide platform (including A–A) in this structure. Apparently, the G–U platform is more stable than other platforms with only a single base–base H-bond interaction. We are currently investigating this overrepresented G–U dinucleotide platform in other RNA structures. (p.1226) &lt;/blockquote&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-3331646567262430747?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/3331646567262430747/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/04/whats-special-about-gpu-dinucleotide.html#comment-form' title='8 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/3331646567262430747'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/3331646567262430747'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/04/whats-special-about-gpu-dinucleotide.html' title='What&apos;s special about the GpU dinucleotide platform?'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_CEuD25cqpCE/S7aMUH7cNbI/AAAAAAAAADU/C3afhrotVoI/s72-c/backbone_edge.png' height='72' width='72'/><thr:total>8</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-4303542282386117432</id><published>2010-03-26T23:45:00.010-04:00</published><updated>2010-03-31T23:28:49.280-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='bioinformatics'/><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>What find_pair in 3DNA can do</title><content type='html'>Structural analysis of nucleic acids &lt;i&gt;used to&lt;/i&gt; be a rather tedious process, especially for irregular, complicated RNA structures and nucleic-acid/protein complexes [&lt;i&gt;e.g.&lt;/i&gt;, the large ribosomal subunit of &lt;i&gt;H. marismortui &lt;/i&gt;(1JJ2)]. Without valid base-pairing information arranged properly in a duplex fragment as input, analysis programs such as Curves+ and analyze/cehs in 3DNA would produce meaningless results. The program find_pair in 3DNA was originally created to solve this specific problem, &lt;i&gt;i.e.&lt;/i&gt;, to generate an input file to 3DNA analysis routines directly from a nucleic-acid containing structure in PDB format. It is what makes nucleic acids structural analysis a routine process — running through thousands of structures from NDB/PDB can be fully automated.&lt;br /&gt;&lt;br /&gt;Overall, find_pair has more than fulfilled the goal of its initial design (as stated above). Over the past few years, its functionality has been expanded and &lt;i&gt;continuously&lt;/i&gt; refined (&lt;a class="postlink" href="http://en.wikipedia.org/wiki/Kaizen"&gt;kaizen&lt;/a&gt;; 改善), making find_pair itself a full-featured application. Now, it is efficient, robust, and its simple command line interface allows for easy integration with other bioinformatics tools. Properly acknowledged &lt;i&gt;or otherwise&lt;/i&gt;, find_pair has served (at least) as one of the key components in many other applications (RNAView, BPS, SwS, ARTS, to name just a few). Indeed, find_pair is by far the single program in 3DNA that has received the most questions (as evident from the 3DNA forum).&lt;br /&gt;&lt;br /&gt;While I still have to write a method paper to describe the underlying algorithms of find_pair in detail — &lt;i&gt;i.e.&lt;/i&gt;, for identifying nucleotides, H-bonds, base pairs, high-order base associations, and double helical regions — the basic idea is very intuitive and easy to understand: as summarized in our recent &lt;a href="http://nar.oxfordjournals.org/cgi/content/full/gkq155"&gt;GpU paper&lt;/a&gt;, find_pair is purely geometric based (with user adjustable parameters) and allows for the identification of canonical Watson–Crick as well as non-canonical base pairs, made up of normal or modified bases, regardless of tautomeric or protonation state. For example, in the &lt;a href="http://nar.oxfordjournals.org/cgi/content/full/gkq155"&gt;GpU paper&lt;/a&gt;, we chose the following set of stringent parameters to ensure that the geometry of each identified base pair is nearly planar and supports at least one inter-base H-bond: (i) a vertical distance (stagger) between base planes ≤ 1.5 Å; (ii) an angle between base normal vectors ≤ 30°; and (iii) a pair of nitrogen and/or oxygen base atoms at a distance ≤ 3.3 Å. Other criteria (documented or otherwise), such as the distance between the origins of the two standard base reference frames, are just filters to speed up the calculations.&lt;br /&gt;&lt;br /&gt;In a nutshell, find_pair has the following two core functionalities:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;The default is to generate input to the analysis routines in 3DNA (analyze/cehs) for double helices. However, there are many more works under the hood than just identifying base pairs: the base pairs must be in proper sequential order, and each strand must be in 5' to 3' direction, for the calculated step parameters (twist, roll etc) to make sense. Moreover, with the "-c" option, one gets an input file to &lt;a href="http://xiang-jun.blogspot.com/2009/08/curves-vs-3dna.html"&gt;Curves (but not Curves+, yet)&lt;/a&gt;; with the "-s" or "-1" option, find_pair treats the whole structure as one single strand, and is useful for getting all backbone torsion angles.&lt;/li&gt;&lt;br /&gt;&lt;li&gt;Detect all base pairs (regardless of in double helical regions or not) and higher-oder (3+) base associations with the "-p" option. This feature (in its preliminary form) was there starting from at least v1.5, which was released at the end of 2002 (just before I left Rutgers), but it was&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt; intentionally  &lt;/span&gt;not &lt;/b&gt;documented. The source code of find_pair (as part of 3DNA) was tested and shared within Rutgers (NDB and Dr. Olson's laboratory) before any 3DNA paper was published, and served as the basis for several other projects. We also offered 3DNA (with source code) to a few RNA experts for comments; but we received either no responses or politely-worded negative ones. Things did not work out as (what I thought) they should have been, but that's life and I have learned my lessons. The "-p" option was first explicitly mentioned in the &lt;a href="http://3dna.rutgers.edu:8080/forum/viewtopic.php?f=3&amp;amp;t=97"&gt;3DNA 2008 Nature Protocols paper&lt;/a&gt;, to illustrate how to identify the two pentaplets in the large ribosomal subunit of &lt;i&gt;H. marismortui&lt;/i&gt; (1JJ2).&lt;/li&gt;&lt;/ol&gt;It is interesting to mention the two papers I've recently come across: the first is on DNA-protein interactions and the second on RNA base-pairing, where new algorithms were developed to detect base pairs and their performances were compared with find_pair. In each of the two cases, it was claimed that find_pair missed certain pairs where the new methods succeeded. As it turned out, however, in the first case, simply relaxing find_pair's default H-bond distance cut-off 4.0 Å to 4.5 Å, as used by the authors, virtually all the missing pairs were recovered. In the second case, the "-p" option, which should have been, was simply not specified.&lt;br /&gt;&lt;br /&gt;After nearly a decade of extensive real-world applications and refinements, it is safe to say that find_pair is now a versatile and practical tool for nucleic acids structure analysis. Of course, I will continue to support and further refine find_pair as I see fit. Once in a while, I just cannot stop but to think that find_pair is to nucleic acids what DSSP is to proteins: simple and elegant. As more people become aware of its existence, I would expect find_pair to gain even more widespread usage, especially in RNA-structure related research areas.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-4303542282386117432?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/4303542282386117432/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/03/what-findpair-in-3dna-can-do.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/4303542282386117432'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/4303542282386117432'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/03/what-findpair-in-3dna-can-do.html' title='What find_pair in 3DNA can do'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-257808701237200329</id><published>2010-03-20T17:27:00.005-04:00</published><updated>2010-03-22T17:47:55.155-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>One computer, three operating systems</title><content type='html'>While so far I have been quite happy with my new MacBook Pro, running Mac OS X 10.6 (Snow Leopard), I still feel more comfortable with the Ubuntu Linux programming environment I have been using for the past few years. Moreover, to make sure that my software (e.g., 3DNA) is strictly ANSI C compliant, and compiles without changes on the most commonly used operating systems (OSes), I need to have direct access to Linux and Windows. Luckily, the Intel-based hardware architecture of MacBook Pro and the free VirtualBox software make it possible to have the three OSes – Mac OS X, Ubuntu Linux, and Windows – in one computer.&lt;br /&gt;&lt;br /&gt;Installing VirtualBox on Mac OS X was a snap. Specifically, I added the following two guest OSes:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Windows XP,  with 1 GB RAM and 70 GB (virtual) hard disk&lt;/li&gt;&lt;li&gt;Ubuntu 9.10,  with 2 GB RAM and 90 GM disk space&lt;/li&gt;&lt;/ol&gt;For seamless integration between each of the two guest OSes and the host Mac OS X, and for improved performance, I also created shared folders and installed guest additions for Windows and Linux. For Windows XP, the process had been quite straight forward. For Linux guest addition, however, I had some problems and solved them by following the instructions on "&lt;a href="http://www.mydigitallife.info/2009/11/25/how-to-install-virtualbox-guest-additions-in-linux-ubuntu-debian-fedora-opensuse-red-hat-and-more/"&gt;How To Install VirtualBox Guest Additions in Linux&lt;/a&gt;".&lt;br /&gt;&lt;br /&gt;Now in Fullscreen Mode (command-F), I can run Ubuntu Linux or Windows XP as if it is native for each. Very cool!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-257808701237200329?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/257808701237200329/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/03/one-computer-three-operating-systems.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/257808701237200329'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/257808701237200329'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/03/one-computer-three-operating-systems.html' title='One computer, three operating systems'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-6745829946162419069</id><published>2010-03-13T11:27:00.012-05:00</published><updated>2010-03-25T14:39:42.881-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='structural biology'/><title type='text'>Hoogsteen base-pair</title><content type='html'>The A·U (or A·T) Hoogsteen pair is a well-known base pair (bp), named after the scientist who discovered it. As shown in the Figure below (left), in the Hoogsteen bp scheme, adenine uses its N7 and N6 atoms (at the major groove edge)  to form two H-bonds with the N3 and O4 atoms from uracil, respectively. Interestingly, if the uracil base ring is flipped around the N7(A)&lt;span lang="ja"&gt;…&lt;/span&gt;N3(U) H-bond by 180 degrees, N6(A) can also form an H-bond with O2(U), i.e., N6(A)&lt;span lang="ja"&gt;…&lt;/span&gt;O2(U): this pairing scheme is called the reverse Hoogsteen bp (right).&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_CEuD25cqpCE/S52kIK85AjI/AAAAAAAAADE/18yDrixNofw/s1600-h/hoosteen.png"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 320px; height: 151px;" src="http://1.bp.blogspot.com/_CEuD25cqpCE/S52kIK85AjI/AAAAAAAAADE/18yDrixNofw/s320/hoosteen.png" alt="" id="BLOGGER_PHOTO_ID_5448691584473825842" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;I first came to know about the Hoogsteen bp from Saenger's book ("Principles of Nucleic Acid Structure"). Over the years, I have read many articles mentioning the Hoogsteen bp and touched this topic myself in the 2003 3DNA NAR publication. However, I have never read Hoogsteen's two original publications on this topic until recently:&lt;br /&gt;&lt;ol&gt;&lt;li&gt; The two-page long preliminary report, titled "The structure of crystals containing a hydrogen-bonded complex of 1-methylthymine and 9-methyladenine", was published in &lt;span style="font-style: italic;"&gt;Acta Cryst.&lt;/span&gt; (1959). &lt;span style="font-weight: bold;"&gt;12&lt;/span&gt;, 822-3. The paper contained only a single reference to the Watson-Crick DNA structure paper, published in &lt;span style="font-style: italic;"&gt;Nature&lt;/span&gt; in 1953. I found it very revealing to understand why Hoogsteen used the methyl-ed derivatives of thymine and adenine, and how the failed initial interpretation of the experimental "vector-density map" using the Watson-Crick A-T bp led to the discovery of the new base-pairing scheme:&lt;br /&gt;&lt;blockquote&gt;The fact that the first trial structure could not be refined led to a more critical scrutiny of the generalized projection and a greater emphasis on the significance of certain spurious peaks and on relatively large variations in the heights of peaks that were assumed to represent atoms. The correct structure was finally discovered by changing the positions of a few atoms in the 9-methyladenine portion of the asymmetric unit.&lt;br /&gt;&lt;/blockquote&gt;&lt;/li&gt;&lt;li&gt;The more extensive account of the Hoogsteen bp story, titled "The Crystal and Molecular Structure of a Hydrogen-Bonded Complex Between 1-Methylthymine and 9-Methyladenine", published in &lt;span style="font-style: italic;"&gt;Acta Cryst.&lt;/span&gt; (1963) &lt;span style="font-weight: bold;"&gt;16&lt;/span&gt;, 907-16.&lt;br /&gt;&lt;/li&gt;&lt;/ol&gt;I like these two papers, and more generally those focused-articles, where authors get directly to a point and addressed it thoroughly and clearly. Most publications nowadays are very ambitious, trying to solve "big problems": the papers are generally far more complicated and often have "reproducibility" problems.&lt;br /&gt;&lt;br /&gt;As a side note, the term &lt;span style="font-style: italic;"&gt;Hoogsteen "edge"&lt;/span&gt; appears quite frequently in today's publications of RNA structures: in the Leontis-Westhof bp classification scheme, the term simply means the major groove edge in what would be a Watson-Crick bp geometry.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-6745829946162419069?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/6745829946162419069/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/03/hoogsteen-base-pair.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/6745829946162419069'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/6745829946162419069'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/03/hoogsteen-base-pair.html' title='Hoogsteen base-pair'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_CEuD25cqpCE/S52kIK85AjI/AAAAAAAAADE/18yDrixNofw/s72-c/hoosteen.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-2977103338852376979</id><published>2010-03-06T18:16:00.005-05:00</published><updated>2010-03-06T23:05:24.966-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>MacMost, a valuable resource to help you get the most of your Mac OS X</title><content type='html'>Upon receiving my new Mac OS X Snow Leopard, I googled around, trying to find some tutorials on the web. Somehow, I came across a video clip by Gary Rosenzweig. I then visited &lt;a href="http://macmost.com/"&gt;MacMost.com&lt;/a&gt; and watched more video clips over there during the past week.&lt;br /&gt;&lt;br /&gt;Overall, I like the videos quite a bit: ~5 minutes long each, these podcasts show various Mac-related tips and tricks in an easy to follow fashion. Specifically, I like the following:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;#347: "Quick Look" – a functionality of looking at the contents of a file without opening it. "Quick Look" seems to be unique to Mac OS X since I am not familiar with it in Linux and Windows. So far, I have found it especially handy in Mail for quickly checking contents of attachments.&lt;/li&gt;&lt;li&gt;#357: "Do Macs Need Anti-Virus Software?" – it is assuring to know that "There are currently no active Mac viruses", and helpful to be aware that "anti-virus software could cause unexpected problems."&lt;/li&gt;&lt;li&gt;#363: "Learning to Program with &lt;a href="http://info.scratch.mit.edu/About_Scratch"&gt;Scratch&lt;/a&gt;" – it is from this video clip that I came to know the Scratch programming language from the MIT Media Lab. Unlike professional computer languages such as C, C++, Java, Ruby, Perl etc, Scratch targets the general public, especially for kids to learn mathematical and computational ideas by programing using a simple drag-and-drop interface. Using Scratch, it is really easy and cool create interactive stories and animations, and to share them on the web.&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;I will certainly keep visiting back MacMost.com and watch more videos as they become available. Little by little, I will learn new tricks to make my Mac life more enjoyable.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-2977103338852376979?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/2977103338852376979/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/03/macmost-valuable-resource-to-help-you.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2977103338852376979'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2977103338852376979'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/03/macmost-valuable-resource-to-help-you.html' title='MacMost, a valuable resource to help you get the most of your Mac OS X'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-3875305052559325294</id><published>2010-02-26T21:03:00.004-05:00</published><updated>2010-02-28T12:57:16.092-05:00</updated><title type='text'>Mac OS X Snow Leopard -- I'm loving it (mostly)!</title><content type='html'>Recently, when it was time for a new laptop, I decided to buy a MacBook Pro (Intel-based with Mac OS X 10.6.2 -- Snow Leopard).  Over the past few days, I have been playing around with it, migrating files from my Ubuntu Linux box. So far, things have gone through smoothly, thus by and large, I am enjoying my new Mac.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Over the years, I have been using Ubuntu Linux and I have been very happy with it, especially for software development. Lyx and OpenOffice are handy for writing technical documents. However, I have realized that when it comes to write a manuscript for publication, and to communicate effectively with non-Linux collaborators, MS Word (with EndNote) is the standard. So I set up a Windows XP virtual machine via VirtualBox on my Ubuntu Linux box, which avoids the problem of dual booting and allows for easy file sharing between Linux and Windows.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Mac OS X is Unix/Linux based but has native support for MS Office and Adobe Suite of programs, so it seems an ideal choice for a new laptop. Mac OS X 10.6 (Snow Leopard) is claimed to be "The world's most advanced operating system. Finely tuned." Other things aside, I do appreciate the fact that 10.6 (Snow Leopard) is a refinement of 10.5 (Leopard) from installation to shutdown -- "In ways big and small, Mac OS X Snow Leopard makes your Mac faster, more reliable, and easier to use."&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;So far, I have configured Mail to access my Columbia emails. I must say that Mail is way better than Columbia's CubMail web-interface, and I like Mail's native integration with iCal and Address Book. Safari still needs some getting used to, from my mostly Firefox experience. However, it is nice to find that some websites, which does not work in Firefox but IE, display properly with Safari. Preview appears to be powerful for PDF and image viewing and manipulations. I have installed Xcode, and may explore it more, if nothing but to see what an IDE has to offer. Of course, it is nice to have direct access to MS Office (mostly for Word and PowerPoint, so no need to play around with OpenOffice), EndNote, Adobe (Acrobat, Photoshop, Illustrator), etc. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Some nuisances up to this point:&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;Keyboard missing numeric keypad and Home/End/PgUp/PgDn&lt;/li&gt;&lt;li&gt;Ctrl-C/V etc keyboard shortcuts I am used to now become Command-C/V etc&lt;/li&gt;&lt;li&gt;File and directory names are &lt;b&gt;&lt;span class="Apple-style-span"  style="color:#FF0000;"&gt;not&lt;/span&gt;&lt;/b&gt; case-sensitive -- most surprising!&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;Overall, my new MacBook Pro is a very nice toy to play. As I become more familiar with it, I may like it more, hopefully.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-3875305052559325294?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/3875305052559325294/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/02/mac-os-x-snow-leopard-im-loving-it.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/3875305052559325294'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/3875305052559325294'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/02/mac-os-x-snow-leopard-im-loving-it.html' title='Mac OS X Snow Leopard -- I&apos;m loving it (mostly)!'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-6884235203630943834</id><published>2010-02-06T11:03:00.004-05:00</published><updated>2010-04-11T20:54:39.271-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>NSMB editorial: "Scientific writing 101"</title><content type='html'>In the February 2010 issue of Nature Structural &amp;amp; Molecular Biology [NSMB, &lt;span style="font-weight: bold;"&gt;17&lt;/span&gt;(2), p.139], there is a nice Editorial titled "Scientific writing 101". This short one-page essay is a good example of a (scientific) writing that is "a pleasures of reading".&lt;br /&gt;&lt;blockquote&gt;"Less is more when it comes to writing a good scientific paper. Tell a story in clear, simple language and keep in mind the importance of the ‘big picture’."&lt;/blockquote&gt;Specifically, the editorial makes the following points:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-weight: bold;"&gt;Tell a story&lt;/span&gt;. A scientific paper is not a chronology; the data should be presented and interpreted in context.&lt;/li&gt;&lt;li&gt;&lt;span style="font-weight: bold;"&gt;Be clear&lt;/span&gt;. "Clear, simple language allows the data and their interpretation to come through."&lt;/li&gt;&lt;li&gt;&lt;span style="font-weight: bold;"&gt;Provide an informative title and abstract&lt;/span&gt;. "Make the abstract clear and try to get the ‘&lt;span style="font-style: italic;"&gt;big picture&lt;/span&gt;’ across."&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;span style="font-weight: bold;"&gt;Make the introduction short and concise.&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style="font-weight: bold;"&gt;Clearly distinguish Results from Discussion&lt;/span&gt;. "Discussion should put those results in a broader context." It "should be an &lt;span style="font-style: italic;"&gt;interpretation&lt;/span&gt; of those results..."&lt;/li&gt;&lt;li&gt;Cover letter is important. You should spell check your manuscript, and number the pages, etc.&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;In this blog post, I am just recapping the key points of the editorial, and taking  the opportunity to re-read it. Following the simple principles outlined in the editorial would be beneficial to everyone in the scientific community.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-6884235203630943834?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/6884235203630943834/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/02/nsmb-editorial-scientific-writing-101.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/6884235203630943834'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/6884235203630943834'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/02/nsmb-editorial-scientific-writing-101.html' title='NSMB editorial: &quot;Scientific writing 101&quot;'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-968912893008813654</id><published>2010-01-30T19:27:00.004-05:00</published><updated>2010-01-31T00:32:54.182-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>Chinese new year greeting card</title><content type='html'>Recently, I received the following nice new year greeting card from &lt;a href="http://tatooine.u-strasbg.fr/%7Esws/SwS.html"&gt;Dr. Pascal Auffinger (IBMC-CNRS, France)&lt;/a&gt;. While never meeting each other in person, we have communicated via numerous emails: over the years, we have discussed extensively on 3DNA-related topics, and on nucleic acid structures in general. Pascal is one of my respected scientists, and &lt;a href="http://3dna.rutgers.edu:8080/forum/viewtopic.php?f=3&amp;amp;t=70"&gt;his greeting card  makes my support of 3DNA so gratifying&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Somehow, the picture reminds me Mountain Tai I climbed while at college, and the famous poem "会当凌绝顶，一览众山小".&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_CEuD25cqpCE/S2TUfQce-jI/AAAAAAAAACk/4gEc92FofWY/s1600-h/pascal2010.jpg"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 320px; height: 213px;" src="http://2.bp.blogspot.com/_CEuD25cqpCE/S2TUfQce-jI/AAAAAAAAACk/4gEc92FofWY/s320/pascal2010.jpg" alt="" id="BLOGGER_PHOTO_ID_5432700683971131954" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;According to the Chinese Zodiac, the &lt;a href="http://www.springsgreetingcards.com/catalogs/store.asp?pid=250913"&gt;Year of 2010 is the Year of Tiger&lt;/a&gt; (Feb. 14, 2010 to Feb. 2, 2011). Interestingly, this Chinese New Year's day coincides with  Valentine's Day, and it is on Sunday.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-968912893008813654?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/968912893008813654/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/01/chinese-new-year-greeting-card.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/968912893008813654'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/968912893008813654'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/01/chinese-new-year-greeting-card.html' title='Chinese new year greeting card'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_CEuD25cqpCE/S2TUfQce-jI/AAAAAAAAACk/4gEc92FofWY/s72-c/pascal2010.jpg' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-7923826569956312584</id><published>2010-01-23T21:55:00.006-05:00</published><updated>2010-01-23T22:42:53.314-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Chemical diagram of Watson-Crick base-pairs</title><content type='html'>Once in awhile, I need to refresh my memory about the chemical identities of the most common nucleobases: A, C, G, T/U. Sometimes, it is also necessary to explain to non-(bio)chemists about the concept of H-bond donor vs acceptor, and the major- vs minor-groove of the DNA double helix. In such cases, I use the following (customized) chemical diagram of Waton-Crick base-pairs (WC-bp):&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_CEuD25cqpCE/S1u6yYcCYcI/AAAAAAAAACc/Fueq6j0vI7w/s1600-h/wc_pair.png"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 294px; height: 320px;" src="http://1.bp.blogspot.com/_CEuD25cqpCE/S1u6yYcCYcI/AAAAAAAAACc/Fueq6j0vI7w/s320/wc_pair.png" alt="" id="BLOGGER_PHOTO_ID_5430139150441537986" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Before taking the effort to create my own version of the Waton-Crick base-pair diagram, I googled around and found many illustrations (like the one in &lt;a href="http://en.wikipedia.org/wiki/Base_pair"&gt;wikipedia&lt;/a&gt;). However, none of them suits my needs perfectly:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Trained as a chemist, I would like to see chemical bond types (double vs. single bond);&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Working extensively with PDB format, I want to have the atom numbering information as well. &lt;/li&gt;&lt;/ul&gt;So I ended up to (re)create my own version of the WC-bp diagram: I used &lt;a href="http://ruby.chemie.uni-freiburg.de/%7Emartin/chemtool/"&gt;Chemtool&lt;/a&gt; to sketch the framework, and &lt;a href="http://www.xfig.org/"&gt;Xfig&lt;/a&gt; to fine-tune it. Overall, the diagram serves my purpose quite well, and hopefully others would find it useful as well.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-7923826569956312584?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/7923826569956312584/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/01/chemical-diagram-of-watson-crick-base.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/7923826569956312584'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/7923826569956312584'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/01/chemical-diagram-of-watson-crick-base.html' title='Chemical diagram of Watson-Crick base-pairs'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_CEuD25cqpCE/S1u6yYcCYcI/AAAAAAAAACc/Fueq6j0vI7w/s72-c/wc_pair.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-4535285654757854213</id><published>2010-01-07T21:38:00.003-05:00</published><updated>2010-01-07T22:28:59.763-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>Requests for SCHNAaP/SCHNArP source code</title><content type='html'>Recently, I received several requests for the &lt;a href="http://3dna.rutgers.edu:8080/forum/viewtopic.php?f=9&amp;amp;t=120"&gt;source code of the SCHNAaP/SCHNArP&lt;/a&gt;, a software package for the analysis and rebuilding of double helical nucleic acid structures. This suite of programs was developed ten years ago during my &lt;a href="http://xiang-jun.blogspot.com/2009/05/phd-pages-on-dna-base-stacking.html"&gt;PhD work on DNA base-stacking interactions&lt;/a&gt; with Dr. Chris Hunter at the University of Sheffield, England.&lt;br /&gt;&lt;br /&gt;Users become interested in SCHNAaP/SCHNArP mostly because of 3DNA, which can be taken as its superseded, &lt;a href="http://xiang-jun.blogspot.com/2009/06/3dna-citations-reach-over-300.html"&gt;more popular version&lt;/a&gt;.  Due to Rutgers' policy of not releasing the source code of 3DNA, users who would like to understand details of the underlying algorithms thus turn back to SCHNAaP/SCHNArP. The interface is a bit aged, but the mathematics is still valid: it could serves well as a start point for those who really want to get into the world of nucleic acid structures.&lt;br /&gt;&lt;br /&gt;Overall, though, I have a mixed feeling in this situation. On one hand, I am happy to see people becoming interested in my (previous) work. On the other hand, however, it also becomes clear that Rutgers' current licensing policy has blocked 3DNA's further circulation and adoption by the scientific community. Given the current trend of open-source software development, I see no reason to continue keeping 3DNA closed source. Making 3DNA open source (under a proper license term, of course) would allow for interested users to get more directly involved in the project, and thus to move the software to the next level.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-4535285654757854213?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/4535285654757854213/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2010/01/requests-for-schnaapschnarp-source-code.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/4535285654757854213'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/4535285654757854213'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2010/01/requests-for-schnaapschnarp-source-code.html' title='Requests for SCHNAaP/SCHNArP source code'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-8196749337734176718</id><published>2009-12-19T10:42:00.003-05:00</published><updated>2009-12-19T10:45:36.749-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>ORCID -- an international research identification system?</title><content type='html'>From the &lt;span style="font-style: italic;"&gt;Nature &lt;/span&gt;news article titled "&lt;a href="http://science.thomsonreuters.com/orcid/media/pdf/Credit-where-credit-is-due_Nature_Dec2009.pdf"&gt;Credit where credit is due&lt;/a&gt;" in (&lt;span style="font-weight: bold;"&gt;462:7275&lt;/span&gt;, p. 825 on December 17, 2009), I came cross &lt;a href="http://science.thomsonreuters.com/orcid/"&gt;the ORCID initiative&lt;/a&gt;:&lt;br /&gt;&lt;blockquote&gt;Name ambiguity and attribution are persistent, critical problems imbedded in the scholarly research ecosystem. The ORCID Initiative represents a community effort to establish an open, independent registry that is adopted and embraced as the industry’s &lt;span style="font-style: italic;"&gt;de facto&lt;/span&gt; standard. Our mission is to resolve the systemic name ambiguity, by means of assigning unique identifiers linkable to an individual's research output, to enhance the scientific discovery process and improve the efficiency of funding and collaboration.&lt;/blockquote&gt;Overall, I think it is a good idea. If properly implemented and widely adopted, ORCID could help solve lots of issues associated with various ways of spelling a person's name due to, e.g., cultural differences. For example, put Chinese way, one's family name comes before one's given name, just the opposite of the western convention. Additionally, when a given name has two characters (quite common), there are could be a space or a hyphen (as I normally put in Xiang-Jun) or nothing in between. Combined with possible first name initials, there are already many ways to spell out a Chinese name.&lt;br /&gt;&lt;br /&gt;The  above &lt;em&gt;Nature&lt;/em&gt; article, “Credit Where Credit is Due”, helps introduce the ORCID Initiative. As an specific example, it points to another article on page 843, where &lt;span style="font-style: italic;"&gt;Nature &lt;/span&gt;profiles a research group trying to "complete the reference human genome sequence, which is still &lt;span style="font-weight: bold;"&gt;full of errors&lt;/span&gt; nearly a decade after the first draft was announced in 2000." &lt;span style="font-style: italic;"&gt;Nature &lt;/span&gt;acknowledges that "It is essential work", "But it is also work that offers few academic rewards beyond the satisfaction of a job well done — it is unlikely to result in a high profile publication." Hopefully, by adopting the ORCID system, contributions of such types (e.g., &lt;a href="http://xiang-jun.blogspot.com/2009/07/on-maintaining-3dna-forum.html"&gt;software support and maintenance&lt;/a&gt;) would be more properly acknowledged by the scientific community.&lt;br /&gt;&lt;br /&gt;Given the &lt;a href="http://science.thomsonreuters.com/orcid/gallery.html"&gt;high profile of the founding parties&lt;/a&gt;, I am hopeful that the ORCID initiative would move forward as promised. I will keep an eye on it and see how it evolves.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-8196749337734176718?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/8196749337734176718/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2009/12/orcid-international-research.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/8196749337734176718'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/8196749337734176718'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2009/12/orcid-international-research.html' title='ORCID -- an international research identification system?'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-3126243848562062729</id><published>2009-12-18T19:07:00.003-05:00</published><updated>2009-12-19T10:17:01.621-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Ribosomal structure: it helps to know some background information</title><content type='html'>This year's &lt;a href="http://nobelprize.org/nobel_prizes/chemistry/laureates/2009/index.html"&gt;Nobel Prize in Chemistry&lt;/a&gt; has been awarded to Venki Ramakrishnan, Tom Steitz, and Ada Yonath "for studies of the structure and function of the ribosome."&lt;br /&gt;&lt;br /&gt;My connection with ribosomal structure began with the 50S large subunit of &lt;span style="font-style: italic;"&gt;Haloarcula marismortui&lt;/span&gt; solved by Tom Steitz's group. Ever since the fully refined crystal structure at 2.4 Å resolution was published in 2001 (PDB entry: &lt;a href="http://www.rcsb.org/pdb/explore/explore.do?structureId=1JJ2"&gt;1jj2&lt;/a&gt;; NDB code: &lt;a href="http://ndbserver.rutgers.edu/atlas/xray/structures/R/rr0033/rr0033.html"&gt;rr0033&lt;/a&gt;), I have been using it to check 3DNA's applicability. In the two 3DNA papers (2003 NAR and 2008 NP), &lt;span style="font-weight: bold;"&gt;1jj2&lt;/span&gt; was used as an example to illustrate how &lt;span style="font-weight: bold;"&gt;find_pair&lt;/span&gt; can identify higher-order base-associations in complicated RNA containing structures. At the time, though, my understanding of the ribosomal RNA structure was purely geometrical:  for quite a while, I got overwhelmed by the various biological terminologies, including the various &lt;span style="font-weight: bold;"&gt;S&lt;/span&gt;-es: 50S large ribosomal subunit vs. the 23S and 5S rRNA; and of course, the 30S small subunit vs. 16S rRNA.&lt;br /&gt;&lt;br /&gt;Over the past year or so, I have become more interested in RNA structures. After reading a lot of related articles, gradually I feel things are becoming clearer than before. Nevertheless, there is something still missing, since my focus has (mostly) been on recent X-ray crystal structure-related work. My understanding of the ribosomal structure was finally put into context, thanks to following two recent publications:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;One in &lt;span style="font-style: italic;"&gt;Cell&lt;/span&gt; by James Williamson, titled "&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/20005796"&gt;The Ribosome at Atomic Resolution&lt;/a&gt;".&lt;/li&gt;&lt;li&gt;Another one in &lt;span style="font-style: italic;"&gt;Mol. Cell &lt;/span&gt;(in parallel and at the same time) by Joseph Puglisi, titled "&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/20005832"&gt;Resolving the Elegant Architecture of the Ribosome&lt;/a&gt;".&lt;/li&gt;&lt;/ul&gt;These two papers not only summarized the significance of work of the three Nobel laureates — "the atomic resolution structures of the ribosomal subunits provide an extraordinary context for understanding one of the most fundamental aspects of cellular function: protein synthesis" — but also provided background information of  decades of work from other players, including Harry Noller, Peter Moore, and Joachim Frank. Solving the ribosomal structure serves as a good example of how the fact that scientific research is both cooperative and competitive in nature.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-3126243848562062729?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/3126243848562062729/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2009/12/ribosomal-structure-it-helps-to-know.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/3126243848562062729'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/3126243848562062729'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2009/12/ribosomal-structure-it-helps-to-know.html' title='Ribosomal structure: it helps to know some background information'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-1223875098745079571</id><published>2009-12-11T21:28:00.012-05:00</published><updated>2009-12-13T12:58:34.205-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Not all PDB entries are reliable; some could be plain fake</title><content type='html'>With interest, I have browsed the recent thread in the PDB mailing list (pdb-l), "&lt;a href="https://lists.sdsc.edu/pipermail/pdb-l/2009-December/004987.html"&gt;Retraction of 12 Structures&lt;/a&gt;" posted by Michael Sadowski and &lt;a href="https://lists.sdsc.edu/pipermail/pdb-l/2009-December/004991.html"&gt;followed-up&lt;/a&gt; by Kevin Karplus et al. The story is about Krishna Murthy, a former scientist at the University of Alabama at Birmingham (UAB), who has been alleged to &lt;a href="http://blog.al.com/birmingham-news-stories/2009/12/ex-uab_researchers_work_may_be.html"&gt;fabricate protein structures and published papers&lt;/a&gt; on them. Here is an informative comment by &lt;span style="font-weight: bold;"&gt;firebug36&lt;/span&gt; from the above link:&lt;br /&gt;&lt;blockquote&gt;I am a protein crystallographer myself, so just trust me - the results this gentleman [Murthy] published were falsified, and not in a smart way. The structures [for C3b] deposited in the Protein Data Bank made no physical sense.  &lt;p&gt;Allegations against UAB group were first brought to light by several prominent people in the field, and not UAB officials:&lt;/p&gt;  &lt;p&gt;&lt;a href="http://www.nature.com/nature/journal/v448/n7154/full/nature06102.html"&gt;http://www.nature.com/nature/journal/v448/n7154/full/nature06102.html&lt;/a&gt;&lt;/p&gt;&lt;/blockquote&gt;Accordingly to the &lt;a href="https://lists.sdsc.edu/pipermail/pdb-l/2009-December/004991.html"&gt;post of Kevin Karplus&lt;/a&gt;, "several of the PDB files by Krishna Murthy's group were identified as problematic in the RosettaHoles paper". Naturally, then, comes the question, "should we remove ALL the PDB files from Krishna Murthy's group as suspect?"&lt;br /&gt;&lt;br /&gt;The way Murthy's case coming to spotlight may represent an exception rather than norm. Imagine the scenario that he did not &lt;a href="http://www.nature.com/nature/journal/v444/n7116/full/nature05258.html"&gt;publish his C3b structure in Nature&lt;/a&gt; which caught the attention from leading crystallographers (Bert Janssen1, Randy Read2, Axel Brünger and Piet Gros), maybe Murthy is still publishing on protein structures today. In a sense, it is a hard to believe how Murthy could falsify &lt;a href="http://www.p212121.com/2009/12/10/449-citations-maybe-effected-by-retracted-structures/"&gt;12 protein structures and published 9 papers&lt;/a&gt; in prestigious journals (including Nature, Cell, PNAS, JMB, Biochemistry, JBC etc) which have been cited 449 times.&lt;br /&gt;&lt;br /&gt;PDB contains the state-of-the-art experimental data of bio-macromolecular structures. Yet, &lt;a href="http://xiang-jun.blogspot.com/2009/07/errors-in-pdb-entries.html"&gt;the archive is certainly full of inconsistencies/errors of various types&lt;/a&gt;. It would be helpful to know how many PDB entries are largely or partially wrong, and which can be taken as "gold standard" as far data quality is concerned.&lt;br /&gt;&lt;br /&gt;This case gives an excellent lesson for those performing data-mining on macromolecular structures. Nowadays, PDB structures are many and keep increasing rapidly, but they are clearly of varying quality. Structural bioinformatics is about solving biology problems using informatics tools. Thus knowing the caveats of your data (how reliable are they?) and tools (what are their limitations?) is a prerequisite to draw sound scientific conclusions.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-1223875098745079571?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/1223875098745079571/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2009/12/not-all-pdb-entries-are-reliable.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/1223875098745079571'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/1223875098745079571'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2009/12/not-all-pdb-entries-are-reliable.html' title='Not all PDB entries are reliable; some could be plain fake'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-9105291230111797732</id><published>2009-12-06T17:21:00.005-05:00</published><updated>2009-12-06T18:17:59.346-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>3DNA in the PCCP nucleic acid simulations themed issue</title><content type='html'>While checking 3DNA-related citations through &lt;span style="font-style: italic;"&gt;Web of Science&lt;/span&gt; for this past week, I found a total of nine times, as follows:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Five times to the &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/12930962"&gt;3DNA 2003 NAR paper&lt;/a&gt;&lt;/li&gt;&lt;li&gt;Once to the &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/18600227"&gt;3DNA 2008 NP paper&lt;/a&gt;&lt;/li&gt;&lt;li&gt;Three times to the 2001 &lt;a href="http://www.blogger.com/standard%20base%20reference%20frame%20paper"&gt;standard base reference frame paper&lt;/a&gt;&lt;/li&gt;&lt;/ol&gt; Most interestingly, all the citations are from the same &lt;span style="font-weight: bold;"&gt;nucleic acid simulations&lt;/span&gt; themed issue of          &lt;a href="http://www.rsc.org/Publishing/Journals/CP/article.asp?Journal=CP5&amp;amp;VolumeYear=200911&amp;amp;Volume=11&amp;amp;JournalCode=CP&amp;amp;MasterJournalCode=CP&amp;amp;SubYear=2009&amp;amp;type=Issue&amp;amp;Issue=45"&gt;&lt;em&gt;Physical Chemistry Chemical Physics&lt;/em&gt; &lt;strong&gt;11 (45)&lt;/strong&gt;&lt;/a&gt;. Honestly, I was quite a bit (nicely) surprised by the fact, so I browsed the articles online. Edited by Charles Laughton and Modesto Orozco, the 2009 PCCP "themed issue exemplifies the rich diversity of cutting-edge research in the field of nucleic acids simulation." Indeed, quite a few well-known experts are among the authors of the two perspectives and 16 papers.&lt;br /&gt;&lt;br /&gt;While &lt;a href="http://xiang-jun.blogspot.com/2009/05/phd-pages-on-dna-base-stacking.html"&gt;not an "energetic" person myself&lt;/a&gt;, over the years I have been keeping an eye on MD simulations and MM calculations of nucleic acid structures. It is my pleasure to see that the 3DNA is being widely used (certainly more than I originally expected) by the nucleic acid simulations community. Given time, and with a suitable collaborator, I am open to consider adapting 3DNA to currently available MD simulation packages to make life easier for practitioners in this "dynamic" field.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-9105291230111797732?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/9105291230111797732/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2009/12/3dna-in-pccp-nucleic-acid-simulations.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/9105291230111797732'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/9105291230111797732'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2009/12/3dna-in-pccp-nucleic-acid-simulations.html' title='3DNA in the PCCP nucleic acid simulations themed issue'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-8315331880437528689</id><published>2009-12-05T23:18:00.009-05:00</published><updated>2009-12-06T23:07:26.611-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='programming'/><title type='text'>See the effect of C preprocessing with the -E option of gcc</title><content type='html'>Recently, I was interested in understanding better of one part of a C program, which is very generic, covering a lot of grounds with preprocessing options (#if ... #endif). However, I would like to see the minimum that covers the section I cared about. I vaguely remembered there is an option in the &lt;span style="font-family:courier new;"&gt;gcc&lt;/span&gt; compiler, from reading the book "&lt;a href="http://www.amazon.com/Introduction-GCC-Brian-J-Gough/dp/0954161793"&gt;An Introduction to GCC&lt;/a&gt;" a while ago, that can stop the process right after the preprocessing step (just like &lt;span style="font-weight: bold;"&gt;-c&lt;/span&gt; option stops after the compilation step). A quick check of "man gcc" revealed that it is &lt;span style="font-weight: bold;"&gt;-E&lt;/span&gt;:&lt;br /&gt;&lt;blockquote&gt;&lt;span style="font-weight: bold;"&gt;-E&lt;/span&gt; Stop after the preprocessing stage; do not run the compiler proper.  The output is in the form of preprocessed source code, which is sent to the standard output.&lt;br /&gt;&lt;br /&gt;Input files which don't require preprocessing are ignored.&lt;/blockquote&gt;&lt;br /&gt;After setting the proper macros and running &lt;span style="font-family:courier new;"&gt;gcc&lt;/span&gt; with &lt;span style="font-weight: bold;"&gt;-E&lt;/span&gt;, I could immediately focus on the component to get what I wanted.&lt;br /&gt;&lt;p&gt;I have been using &lt;span style="font-family:courier new;"&gt;gcc&lt;/span&gt; for more than ten years, still there are more handy tricks to uncover.&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;Here is a simplified example showing the effect of the &lt;span style="font-weight: bold;"&gt;-E&lt;/span&gt; option. The following C function is saved in file "&lt;span style="color: rgb(255, 0, 0);"&gt;check_gcc_E.c&lt;/span&gt;".&lt;/p&gt;&lt;pre&gt;--------------------------------------------------------------------------&lt;br /&gt;#define GREETING "Hello World"&lt;br /&gt;&lt;br /&gt;void check_gcc_E(void) {&lt;br /&gt;#ifdef VERBOSE&lt;br /&gt;   printf("Macro VERBOSE is defined.\n");&lt;br /&gt;   printf("So you see: '%s'\n", GREETING);&lt;br /&gt;#endif&lt;br /&gt;   printf("Hello everyone!\n");&lt;br /&gt;}&lt;br /&gt;--------------------------------------------------------------------------&lt;br /&gt;&lt;/pre&gt;&lt;p style="color: rgb(255, 0, 0);"&gt;gcc -E -DVERBOSE check_gcc_E.c&lt;br /&gt;&lt;/p&gt;&lt;pre&gt;--------------------------------------------------------------------------&lt;br /&gt;# 1 "check_gcc_E.c"&lt;br /&gt;# 1 "&amp;lt;built-in&amp;gt;"&lt;br /&gt;# 1 "&amp;lt;command-line&amp;gt;"&lt;br /&gt;# 1 "check_gcc_E.c"&lt;br /&gt;&lt;br /&gt;void check_gcc_E(void) {&lt;br /&gt;&lt;br /&gt;   printf("Macro VERBOSE defined\n");&lt;br /&gt;   printf("So you see: '%s'\n", "Hello World");&lt;br /&gt;&lt;br /&gt;   printf("Hello everyone!\n");&lt;br /&gt;}&lt;br /&gt;--------------------------------------------------------------------------&lt;br /&gt;&lt;/pre&gt;&lt;p style="color: rgb(255, 0, 0);"&gt;gcc -E check_gcc_E.c&lt;/p&gt;&lt;pre&gt;--------------------------------------------------------------------------&lt;br /&gt;# 1 "check_gcc_E.c"&lt;br /&gt;# 1 "&amp;lt;built-in&amp;gt;"&lt;br /&gt;# 1 "&amp;lt;command-line&amp;gt;"&lt;br /&gt;# 1 "check_gcc_E.c"&lt;br /&gt;&lt;br /&gt;void check_gcc_E(void) {&lt;br /&gt;   printf("Hello everyone!\n");&lt;br /&gt;}&lt;br /&gt;--------------------------------------------------------------------------&lt;br /&gt;&lt;/pre&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-8315331880437528689?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/8315331880437528689/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2009/12/see-effect-of-c-preprocessing-with-e.html#comment-form' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/8315331880437528689'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/8315331880437528689'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2009/12/see-effect-of-c-preprocessing-with-e.html' title='See the effect of C preprocessing with the -E option of gcc'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-2766947325319221578</id><published>2009-11-20T22:39:00.004-05:00</published><updated>2009-11-20T23:42:29.451-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Registered COPPA Users in phpBB3</title><content type='html'>Recently, I received an email from a 3DNA user who registered at the 3DNA forum, but could &lt;span style="font-weight: bold;"&gt;not&lt;/span&gt; see anything at all once logged in. 3DNA forum is based on phpBB3, and has been running for over three years now. So at the very beginning, I thought how could it be? I had never heard of any such problem/complain from 3DNA forum registers before. I even created a temporary test login account and found no problem. So I communicated with the user and asked her/him to log in using my test account, and again everything was fine!&lt;br /&gt;&lt;br /&gt;To reproduce the problem, I logged in as the user, and found one thing spurious: the user was in the group of "Registered COPPA Users", not the normal "Registered Users". I did not know what that COPPA stands for. So I googled the phase "Registered COPPA users" and the top hit led me into the &lt;a href="http://www.phpbb.com/support/documentation/3.0/adminguide/acp_groups.php"&gt;phpBB3 document on Group Management&lt;/a&gt;, and the section I am interested in reads as follows:&lt;br /&gt;&lt;blockquote&gt;Registered COPPA users are basically the same as registered users, except that they fall under the &lt;span style="font-style: italic; font-weight: bold;" class="acronym"&gt;COPPA&lt;/span&gt;&lt;span style="font-style: italic; font-weight: bold;"&gt;, or Child Online Privacy Protection Act, law, meaning that they are under the age of 13 in the U.S.A.&lt;/span&gt; Managing the permissions this usergroup has is important in protecting these users. &lt;span class="acronym"&gt;COPPA&lt;/span&gt; doesn't apply to users living outside of the U.S.A. and can be disabled altogether.&lt;/blockquote&gt;&lt;br /&gt;So a registered COPPA user is, by definition, under the age of 13. By default phpBB3 does not even allow such a child to read any content! In the context of 3DNA forum, this policy simply does not make any sense — the contents (in the public section) are viewable by any one without registration.&lt;br /&gt;&lt;br /&gt;It turned out that at registration stage, the first question is: &lt;span style="color: rgb(255, 0, 0);"&gt;"To continue with the registration procedure please tell us when you were born."&lt;/span&gt; Two dynamically generated dates are given, one is &lt;span style="font-weight: bold; font-style: italic;"&gt;"Before"&lt;/span&gt; a date defining an age over 13, and the other &lt;span style="font-weight: bold;"&gt;"On or after"&lt;/span&gt; it for below 13. Obviously the 3DNA user mentioned above clicked the wrong button.&lt;br /&gt;&lt;br /&gt;After knowing where the problem was and how it was created, fixing it was straightforward. Interestingly, when I then checked the 3DNA forum registered users, I found five of them were in the "Registered COPPA Users" group. Obviously, the previous (wrong) registers did not complain — possibly lost interest in pursuing further, so this issue did not surfaced until recently.&lt;br /&gt;&lt;br /&gt;In a real world as we live, what seems simple may not be. Nothing should be taken for granted.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-2766947325319221578?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/2766947325319221578/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2009/11/registered-coppa-users-in-phpbb3.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2766947325319221578'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2766947325319221578'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2009/11/registered-coppa-users-in-phpbb3.html' title='Registered COPPA Users in phpBB3'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-7821665813492170382</id><published>2009-11-20T20:36:00.005-05:00</published><updated>2009-12-12T18:36:48.468-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>3DNA in PDB</title><content type='html'>As mentioned previously,  PDB makes use of &lt;a href="http://xiang-jun.blogspot.com/2009/06/blocview-simple-effective-visualization.html"&gt;blocview&lt;/a&gt; (part of 3DNA) to generate the simple yet effective images for nucleic-acid-containing structures. That's the connection I knew of between 3DNA and PDB. By pure chance, however, I recently noticed the &lt;a href="http://www.rcsb.org/pdb/explore/explore.do?structureId=3DNA"&gt;3DNA entry in PDB&lt;/a&gt; — it is actually a protein structure, completely unrelated to the 3DNA software package!&lt;br /&gt;&lt;br /&gt;Just out of curiosity, I browsed the abstract of the Liu &lt;span style="font-style: italic;"&gt;et al.&lt;/span&gt; article, titled "&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/19014950"&gt;Halogenated benzenes bound within a non-polar cavity in T4 lysozyme provide examples of I...S and I...Se halogen-bonding&lt;/a&gt;" [J Mol Biol. 2009 Jan 16;385(2):595-605]. I then downloaded the full PDF version of the paper and read it carefully through. This work studied binding interactions of benzenes with the internal cavity of L99A mutated T4 lysozyme. The authors demonstrated that the center of the phenyl ring can be shifted by more than one angstrom due to different halogen-substitutions (where the 3DNA entry corresponds to C6H5I), and (further) proved the concept that "the protein is flexible and adapts to the size and shape of the ligand". At better than 2.0 Å resolution, they also observed the I...S and I...Se halogen-bonds.&lt;br /&gt;&lt;br /&gt;I became interested in this paper not just because of the name of &lt;span style="font-weight: bold;"&gt;3DNA&lt;/span&gt;, for which a quick browsing over the abstract would be sufficient. This paper also reminded me of an early article I published with the title "&lt;a href="http://www.rsc.org/Publishing/Journals/FT/article.asp?doi=FT9959102009"&gt;Influence of fluorine on aromatic interactions"&lt;/a&gt;:&lt;br /&gt;&lt;blockquote&gt;Non-covalent interactions between aromatic ligands influence the conformations of metal complexes, and the system [M(OAr)&lt;small&gt;&lt;sub&gt;2&lt;/sub&gt;&lt;/small&gt;L&lt;small&gt;&lt;sub&gt;2&lt;/sub&gt;&lt;/small&gt;] has been used to investigate the difference between phenyl–phenyl, phenyl–pentafluorophenyl and pentafluorophenyl–pentafluorophenyl interactions. X-Ray crystal structures show that pentafluorophenyl groups adopt partially stacked orientations with the two aromatic rings close to parallel and with significant π overlap. In contrast, phenyl groups are skewed away from each other with only edge-to-face contacts. Phenyl–pentafluorophenyl interactions adopt a coplanar fully stacked geometry. These results have been rationalised on the basis of energy calculations (carried out blind) using a variety of empirical models for treating weak non-covalent interactions. The major cause of the different behaviour of the three systems lies in the electrostatic interactions between the π systems.&lt;/blockquote&gt;&lt;br /&gt;Knowing of the &lt;a href="http://xiang-jun.blogspot.com/2009/07/pdb-id-vs-ndb-id.html"&gt;pattern of a PDB id&lt;/a&gt;, — 4 characters long: the first character is a numeral in the range 0-9, while the rest can be either numerals or letters — I played around with some other possible ids with my name initials in it. Indeed I found one, &lt;a href="http://www.rcsb.org/pdb/explore/explore.do?structureId=1XJL"&gt;1XJL&lt;/a&gt;, a protein structure of human annexin A2 in the presence of calcium ions. If you are bimolecular structure-oriented, why not have a try with some ids of special meaning to you — you might be related to PDB in some unexpected way!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-7821665813492170382?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/7821665813492170382/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2009/11/3dna-in-pdb.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/7821665813492170382'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/7821665813492170382'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2009/11/3dna-in-pdb.html' title='3DNA in PDB'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-9139076370349625297</id><published>2009-11-14T22:20:00.009-05:00</published><updated>2009-12-12T18:37:18.177-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>How shear affects twist angle of a dinucleotide step?</title><content type='html'>A recent post in the 3DNA forum, titled "&lt;a href="http://3dna.rutgers.edu:8080/forum/viewtopic.php?f=1&amp;amp;t=148"&gt;NUPARM vs X3DNA twist values&lt;/a&gt;", made me to rethink the issue of how or why shear affects twist angle of a dinucleotide step.&lt;br /&gt;&lt;br /&gt;To me, this problem has long been solved as demonstrated by the following two well-cited publications:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;&lt;a href="http://ndbserver.rutgers.edu/standards/standard_reference.html"&gt;The Tsukuba report&lt;/a&gt;, a.k.a., "A Standard Reference Frame for the Description of Nucleic Acid Base-pair Geometry". When Dr. Olson and I were drafting this report, I felt clearly the need to caution the community of the &lt;span style="font-style: italic;"&gt;intrinsic correlations&lt;/span&gt; between base-pair parameters and the associated step parameters (&lt;span style="font-weight: bold;"&gt;Figure 3&lt;/span&gt; there) to avoid possible mis-interpretations in structural analysis. This is specially the case for the effect of shear on twist, since the G–U wobble base-pair is common in RNA and it has a ~2.0 A shear.&lt;/li&gt;&lt;br /&gt;&lt;li&gt;&lt;a href="http://nar.oxfordjournals.org/cgi/content/full/31/17/5108"&gt;The 3DNA 2003 NAR paper&lt;/a&gt;. There is a subsection on the "Treatment of non-Watson–Crick base pairing motifs", and Figure 3 addressed specially on the issue:&lt;blockquote&gt;"Large Shear of the G–U wobble base pair influences the calculated but not the ‘observed’ Twist. The 3DNA&lt;!-- HIGHWIRE EXLINK_ID="31:17:5108:22" VALUE="3DNA" TYPEGUESS="PDB" --&gt;&lt;!-- /HIGHWIRE --&gt; numerical values of Twist, 20° (top) and 43° (bottom), differ from the visualization of nearly equivalent Twist suggested by the angle between successive C1'···C1' vectors (finely dotted lines)."&lt;/blockquote&gt;&lt;/li&gt;&lt;/ol&gt;It was thus a bit surprising that such question still popping up. On second thought, however, it is quite understandable: one cannot expect everyone to read that two papers; not to mention remembering such details. So I am glad that this question was brought up to my attention, and it made me thinking possible ways to document more thoroughly the many 3DNA-related "technical details" that are crucial for better understanding of nucleic acid structures.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_CEuD25cqpCE/Sv-GRTw_XMI/AAAAAAAAACQ/d51uctcanEk/s1600-h/shear_on_twist.png"&gt;&lt;img style="margin: 0pt 10px 10px 0pt; float: left; cursor: pointer; width: 320px; height: 279px;" src="http://2.bp.blogspot.com/_CEuD25cqpCE/Sv-GRTw_XMI/AAAAAAAAACQ/d51uctcanEk/s320/shear_on_twist.png" alt="" id="BLOGGER_PHOTO_ID_5404185709789535426" border="0" /&gt;&lt;/a&gt;Coming back to the shear on twist angle issue, the figure at the left shows a G–U wobble pair example (top), and a simple rationale: the base-pair is approximately of 10Å-by-5Å (as defined in SCHNArP/3DNA), so a 2Å shift will lead to an angle:&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(255, 0, 0);"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-weight: bold; color: rgb(255, 0, 0);"&gt;atan2(2, 10) * 180 / pi = 11.3 degrees&lt;/span&gt;&lt;/blockquote&gt; (i.e., the red dotted line relative to the bottom horizontal line).&lt;br /&gt;&lt;br /&gt;To a first order approximation, that is the difference between RC8–YC6 (or C1'–C1') vs. the base-centered mean &lt;span style="font-style: italic;"&gt;y&lt;/span&gt;-axis of the pair for calculating twist angle. So whenever one has a G–U wobble pair next to a normal Watson-Crick pair, there would be ~11 degrees difference in "calculated" twist angle between the two approaches (NewHelix/CEHS/SCHNAaP/NUPARM vs 3DNA/Curve+). Moreover, when a G–U wobble is next to a U–G wobble pair, the difference would be doubled to ~23 degrees!&lt;br /&gt;&lt;br /&gt;It is worth mentioning that the issue here (as in other similar cases) is &lt;span style="font-style: italic;"&gt;not&lt;/span&gt; which number is "correct" or which is "wrong": a number is a number. It is its interpretation that matters, and it is here that "details" do count.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-9139076370349625297?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/9139076370349625297/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2009/11/how-shear-affects-twist-angle-of.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/9139076370349625297'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/9139076370349625297'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2009/11/how-shear-affects-twist-angle-of.html' title='How shear affects twist angle of a dinucleotide step?'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_CEuD25cqpCE/Sv-GRTw_XMI/AAAAAAAAACQ/d51uctcanEk/s72-c/shear_on_twist.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-6227717752414688209</id><published>2009-11-08T16:35:00.003-05:00</published><updated>2009-11-08T17:33:47.671-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>It's sad to hear that Warren DeLano, author of PyMOL, passed away</title><content type='html'>From a couple of mailing lists, I heard the sad news that &lt;a href="http://web.me.com/brendandelano/Warren_Lyford_DeLano/In_Memorium.html"&gt;Warren DeLano, author of PyMOL, passed away&lt;/a&gt; on Tuesday morning, November 3rd. He was only 37!&lt;br /&gt;&lt;br /&gt;I have never met Dr. DeLano personally, nor even I communicated with him by email, but I am very aware of PyMol, the &lt;span style="font-style: italic;"&gt;de facto&lt;/span&gt; standard nowadays for molecular graphics. In writing 3DNA Nature Protocols paper, I dug more deeply into PyMol. I was impressed by its interactive interface to .r3d files (Raster3D) and the high quality ray-traced images it produced. So I came up with a Perl script (&lt;span style="font-weight: bold;font-family:courier new;" &gt;x3dna_r3d2png&lt;/span&gt;) to convert automatically from a 3DNA generated .r3d file to a PNG image through the PyMol engine.&lt;br /&gt;&lt;br /&gt;Through his seminal contributions to PyMol, Dr. DeLano achieved something very few others in computational chemistry/biology can match: he successfully mobilized literately thousands of software programmers and ordinary users from multi-disciplines to join him to produce phenomenal pictures, each of which is worth a thousand words!&lt;br /&gt;&lt;br /&gt;It was due to Dr. DeLano's vision that he made PyMol open source so the community now has the possibility/opportunity to continue support and further improve the software. At this stage, however, no one is likely to knows PyMol code to the depth Dr. DeLano did, not to mention the leadership and enthusiasm that he brought to the project. Whatever the case, the community undoubtedly would appreciate Dr. DeLano's valuable contributions.&lt;br /&gt;&lt;br /&gt;Thanks, Dr. DeLano, for bring PyMol to the world!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-6227717752414688209?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/6227717752414688209/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2009/11/its-sad-to-hear-that-warren-delano.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/6227717752414688209'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/6227717752414688209'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2009/11/its-sad-to-hear-that-warren-delano.html' title='It&apos;s sad to hear that Warren DeLano, author of PyMOL, passed away'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-234620014540979425</id><published>2009-10-31T19:49:00.009-04:00</published><updated>2009-10-31T23:39:06.856-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='structural biology'/><category scheme='http://www.blogger.com/atom/ns#' term='algorithm'/><title type='text'>How to calculate torsion angle?</title><content type='html'>Given the x-, y-, and z-coordinates of four points in 3-dimensional (3D) space, how to calculate torsion angle? Overall, this is a well-solved problem in structural biology, and one can find detailed description of the algorithm in text books and on-line documents. The algorithm for calculating torsion angle is implementated in virtually every software package in structural chemistry or biology.&lt;br /&gt;&lt;br /&gt;Basic as it is, however, in my experience, it is very important to have a detailed appreciation of how the method works in order to really get into the 3D world. Here is a worked example using Octave/Matlab of my simplified, geometry-based implementation of how to calculate torsion angle, including how to determine its sign. No theory or (complicated) mathematical formula, just a step-by-step illustration of how I solve this problem.&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Coordinates of four points in variable &lt;span style="font-weight: bold;"&gt;abcd&lt;/span&gt;:&lt;br /&gt;&lt;pre face="monospace" style="margin-left: 0pt; background-color: rgb(204, 204, 204);"&gt;abcd = [ 21.350  31.325  22.681&lt;br /&gt;22.409  31.286  21.483&lt;br /&gt;22.840  29.751  21.498&lt;br /&gt;23.543  29.175  22.594 ];&lt;/pre&gt;&lt;/li&gt;&lt;li&gt;Two auxiliary functions: &lt;span style="font-weight: bold;"&gt;norm_vec()&lt;/span&gt; to normalize a vector; &lt;span style="font-weight: bold;"&gt;get_orth_norm_vec()&lt;/span&gt; to get the orthogonal component (normalized) of a vector with reference to another vector, which should have been normalized.&lt;br /&gt;&lt;pre style="margin-left: 0pt; background-color: rgb(204, 204, 204); font-family: monospace;"&gt;function ovec = norm_vec(vec)&lt;br /&gt;  ovec = vec / norm(vec);&lt;br /&gt;endfunction&lt;br /&gt;&lt;br /&gt;function ovec = get_orth_norm_vec(vec, vref)&lt;br /&gt;  temp = vec - vref * dot(vec, vref);&lt;br /&gt;  ovec = norm_vec(temp);&lt;br /&gt;endfunction&lt;/pre&gt;&lt;/li&gt;&lt;li&gt;Get three vectors: &lt;span style="font-weight: bold;"&gt;b_c&lt;/span&gt; is the normalized vector &lt;span style="font-weight: bold;"&gt;b&lt;/span&gt;→&lt;span style="font-weight: bold;"&gt;c&lt;/span&gt;; &lt;span style="font-weight: bold;"&gt;b_a_orth&lt;/span&gt; is the orthogonal component (normalized) of vector &lt;span style="font-weight: bold;"&gt;b&lt;/span&gt;→&lt;span style="font-weight: bold;"&gt;a&lt;/span&gt; with reference to &lt;span style="font-weight: bold;"&gt;b&lt;/span&gt;→&lt;span style="font-weight: bold;"&gt;c&lt;/span&gt;; &lt;span style="font-weight: bold;"&gt;c_d_orth&lt;/span&gt; is similarly defined, as the orthogonal component (normalized) of vector &lt;span style="font-weight: bold;"&gt;c&lt;/span&gt;→&lt;span style="font-weight: bold;"&gt;d&lt;/span&gt; with reference to &lt;span style="font-weight: bold;"&gt;b&lt;/span&gt;→&lt;span style="font-weight: bold;"&gt;c&lt;/span&gt;.&lt;br /&gt;&lt;pre style="margin-left: 0pt; background-color: rgb(204, 204, 204); font-family: monospace;"&gt;b_c = norm_vec(abcd(3, :) - abcd(2, :))&lt;br /&gt;  % &lt;span style="color: rgb(255, 0, 0); font-weight: bold;"&gt;[0.2703158  -0.9627257   0.0094077]&lt;/span&gt;&lt;br /&gt;b_a_orth = get_orth_norm_vec(abcd(1, :) - abcd(2, :), b_c)&lt;br /&gt;  % &lt;span style="color: rgb(255, 0, 0); font-weight: bold;"&gt;[-0.62126  -0.16696   0.76561]&lt;/span&gt;&lt;br /&gt;c_d_orth = get_orth_norm_vec(abcd(4, :) - abcd(3, :), b_c)&lt;br /&gt;  % &lt;span style="color: rgb(255, 0, 0); font-weight: bold;"&gt;[0.41330   0.12486   0.90199]&lt;/span&gt;&lt;/pre&gt;&lt;/li&gt;&lt;li&gt;Now the torsion angle is the angle between the two vectors, &lt;span style="font-weight: bold;"&gt;b_a_orth&lt;/span&gt; and &lt;span style="font-weight: bold;"&gt;c_d_orth&lt;/span&gt;, and can be easily calculated by their dot product. The sign of the torsion angle is determined by the relative orientation of the cross product of the same two vectors with reference to the middle vector &lt;span style="font-weight: bold;"&gt;b&lt;/span&gt;→&lt;span style="font-weight: bold;"&gt;c&lt;/span&gt;. Here they are in opposite direction, thus the torsion angle is negative.&lt;br /&gt;&lt;pre style="margin-left: 0pt; background-color: rgb(204, 204, 204); font-family: monospace;"&gt;angle_deg = acos(dot(b_a_orth, c_d_orth)) * 180 / pi&lt;br /&gt;  % &lt;span style="color: rgb(255, 0, 0); font-weight: bold;"&gt;65.609&lt;/span&gt;&lt;br /&gt;sign = dot(cross(b_a_orth, c_d_orth), b_c)&lt;br /&gt;  % &lt;span style="color: rgb(255, 0, 0); font-weight: bold;"&gt;-0.91075&lt;/span&gt;&lt;br /&gt;if (sign &amp;lt; 0)&lt;br /&gt;  ang_deg = -angle_deg  % &lt;span style="color: rgb(255, 0, 0); font-weight: bold;"&gt;-65.609&lt;/span&gt;&lt;br /&gt;endif&lt;br /&gt;&lt;/pre&gt;&lt;/li&gt;&lt;/ol&gt;A related concept is the so-called dihedral angle, or more generally the angle between two planes. As long as the normal vectors to the two corresponding planes are defined, the angle between them is easy to work out.&lt;br /&gt;&lt;br /&gt;Moreover, the method to calculate &lt;span style="font-weight: bold;"&gt;twist&lt;/span&gt; angles of helical nucleic acid structures in SCHNAaP and 3DNA is &lt;span style="font-weight: bold; font-style: italic;"&gt;essentially&lt;/span&gt; the same.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-234620014540979425?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/234620014540979425/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2009/10/how-to-calculate-torsion-angle.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/234620014540979425'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/234620014540979425'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2009/10/how-to-calculate-torsion-angle.html' title='How to calculate torsion angle?'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-1646475648552427662</id><published>2009-10-30T22:04:00.002-04:00</published><updated>2009-10-31T18:25:12.331-04:00</updated><title type='text'>Upgrade to Ubuntu 9.10</title><content type='html'>I am now upgraded to &lt;a href="http://www.ubuntu.com/"&gt;Ubuntu&lt;/a&gt; 9.10 Karmic Koala, released On October 29 by Canonical Ltd. The system is now up and running, even though there are still some (minor) issues to be resolved. Overall, it was an exciting exploration, and Internet and Google search were essential for solving most of the problems.&lt;br /&gt;&lt;br /&gt;Normally, I am not that quick to catch up with a new software release, but wait a few more weeks when most initial bugs have been fixed. I was quick this time mainly because I had been trapped into an earlier 9.10 alpha release (in development branch), when I tried to fix a printing problem (without success). Worse yet, suddenly, my VirtualBox to run Windows XP did not start, and SCIM Chinese input stopped functioning, etc. It caused me quite some time and trouble to get my Linux box back to work. So over the past few months, I did not perform any update, but eagerly waiting for the stable release. It is a relief that the new official 9.10 release does solve my problems.&lt;br /&gt;&lt;br /&gt;With Ubuntu 9.10, now I have access to OpenOffice 3.1.1 and Lyx 1.6.4 for text processing, GNU Emacs 22.2.1, and GCC 4.4.1, among other things. It is nice to stay current, not only in science, but also in IT.&lt;br /&gt;&lt;br /&gt;It is worth mentioning that &lt;a href="http://en.wikipedia.org/wiki/Ubuntu_%28philosophy%29"&gt;Ubuntu is "an ethical concept of African origin emphasizing community, sharing and generosity."&lt;/a&gt; It is amazing how useful and robust the free, open-source software can be!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-1646475648552427662?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/1646475648552427662/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2009/10/upgrade-to-ubuntu-910.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/1646475648552427662'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/1646475648552427662'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2009/10/upgrade-to-ubuntu-910.html' title='Upgrade to Ubuntu 9.10'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-8707182930968631503</id><published>2009-10-23T23:45:00.007-04:00</published><updated>2009-12-12T18:38:27.574-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Sharing published data?</title><content type='html'>A letter in the recent issue of &lt;span style="font-style: italic;"&gt;Science&lt;/span&gt; [VOL 326 23 OCTOBER 2009] titled "The Antidote to Bias in Research" by Allison mentioned the &lt;a href="http://xiang-jun.blogspot.com/2009/09/on-reproducibility-of-scientific.html"&gt;&lt;span style="font-style: italic;"&gt;Nature Genetics &lt;/span&gt;article&lt;/a&gt; on the issue of wide-spread non-repeatability of published microarray gene expression studies. It also touched on another recent study titled "Empirical Study of Data Sharing by Authors Publishing in PLoS Journals" by Savage and Vickers [&lt;a href="http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0007078"&gt;PLoS ONE 4(9): e7078&lt;/a&gt;]:&lt;br /&gt;&lt;blockquote&gt;In another study, refusal to share data despite policies requiring sharing was nearly ubiquitous among authors publishing in Public Library of Science journals.&lt;/blockquote&gt;I then read the Savage and Vickers article, and was surprised to find that only one author out of 10 sent them an original data set. Therefore, the authors wrote:&lt;br /&gt;&lt;blockquote&gt;In conclusion, our findings suggest that explicit journal policies requiring data sharing do not lead to authors making their data sets available to independent investigators.&lt;/blockquote&gt;I can understand why the repeatability rate is so low from the &lt;a href="http://xiang-jun.blogspot.com/2009/09/on-reproducibility-of-scientific.html"&gt;&lt;span style="font-style: italic;"&gt;Nature Genetics&lt;/span&gt; paper&lt;/a&gt; given the so many &lt;span style="font-style: italic; font-weight: bold;"&gt;details&lt;/span&gt; associated with a published table or figure. However, it is a bit hard to explain why it is &lt;span style="font-style: italic; font-weight: bold;"&gt;so&lt;/span&gt; difficult to share just the original dataset associated with a published work. I would imagine that the authors would be honored that others care about their publications, and contact them directly.&lt;br /&gt;&lt;br /&gt;In my experience, I have also been simply ignored frequently for clarifications of some details in published work, or requests for some datasets. The highest successful rate is asking for PDF represents from corresponding authors.&lt;br /&gt;&lt;br /&gt;If the basic principles of scientific publications are strictly enforced, even just by the big journals (so many, nowadays), a lot of unfounded big claims would be gone or can be easily seen through. Well, I know that's only possible in an ideal world.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-8707182930968631503?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/8707182930968631503/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2009/10/sharing-published-data.html#comment-form' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/8707182930968631503'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/8707182930968631503'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2009/10/sharing-published-data.html' title='Sharing published data?'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-3827015665213895791</id><published>2009-10-23T21:07:00.007-04:00</published><updated>2009-10-28T13:16:15.670-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>How to quantify the relative geometry between two base-pairs? — Part 2</title><content type='html'>In &lt;a href="http://xiang-jun.blogspot.com/2009/10/two-sets-of-parameters-to-quantify.html"&gt;part I&lt;/a&gt; of this series, and indeed other occasions in my blog, 3DNA home page and forum, I have attributed to several related programs where 3DNA has benefited from. Before going into details on how the various parameters are calculated in 3DNA, however, I feel it is in order to make it clear my philosophy on creating scientific software in general, and 3DNA in particular.&lt;br /&gt;&lt;h2 style="border-bottom: 1px solid;"&gt;Some underlying considerations&lt;/h2&gt;&lt;ul&gt;&lt;li&gt;Whenever possible/applicable, I always try to get a &lt;span style="font-style: italic;"&gt;thorough&lt;/span&gt; understanding of related software available. By &lt;span style="font-style: italic;"&gt;thorough&lt;/span&gt; I mean at the source code level to see how an algorithm is implemented. The benefits of doing this are several folds: &lt;ol&gt;&lt;li&gt;Not to recreate the wheel, but to build upon previous work.&lt;/li&gt;&lt;li&gt;It is the most effective way to learn — many math formulae or text descriptions of algorithms, while useful for getting general ideas, are lack in details or vague in nature. In bioinformatics, most of the fundamental mathematics are not new. It is more about a novel combination of various known-parts, applying to a specific problem. Reading the source code is the only way to see unambiguously the implementation details.&lt;/li&gt;&lt;li&gt;With a clear understanding of how the parameters are actually calculated, one can make better use of a software tool, know its limitations and thus avoid misinterpretations.&lt;/li&gt;&lt;li&gt;To make objective, convincing (even to the original authors) comparisons/comments on existing tools, and importantly, to create something better if needed.&lt;br /&gt;&lt;/li&gt;&lt;/ol&gt;&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;Specially, 3DNA has benefited the most from my SCHNAaP/SCHNArP programs: the underlying algorithms in 3DNA for calculating the various structural parameters (propeller, buckle, shear, roll, slide, x-displacement, inclination etc) are exactly the same as there — &lt;span style="color: rgb(255, 0, 0);font-family:courier new;" &gt;analyze&lt;/span&gt; and &lt;span style="color: rgb(255, 0, 0);"&gt;rebuild&lt;/span&gt; are direct derivatives from SCHNA&lt;span style="font-weight: bold;"&gt;a&lt;/span&gt;P (&lt;span style="font-weight: bold;"&gt;a&lt;/span&gt; for analysis) and SCHNA&lt;span style="font-weight: bold;"&gt;r&lt;/span&gt;P (&lt;span style="font-weight: bold;"&gt;r&lt;/span&gt; for rebuilding/reconstruction), respectively; the ideas of standard stacking diagrams, the Zp parameter, building atomic models with sugar-phosphate backbone, and the base/bp rectangular block representations are also from SCHNAaP/SCHNArP. I also borrowed ideas from Babcock's &lt;span style="font-family:courier new;"&gt;RNA&lt;/span&gt;, Bansal's NUPARM, Dickerson's NewHelix/FreeHelix etc., which will be mentioned explicitly in following sessions.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Code wise, 3DNA was created from scratch in strict ANSI C (over 25K lines). In addition to implementing a unique combination of existing algorithms, I have added many significant novel methods including the &lt;span style="color: rgb(255, 0, 0);font-family:courier new;" &gt;find_pair&lt;/span&gt; program. However, feature-rich has never been my goal. Instead, I only consider to add new functions that I understand clearly and find useful. On the other hand, I am always quick to fix bugs.&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;Overall, it seems that I am an &lt;em&gt;outlier&lt;/em&gt; rather than norm in scientific programming. The reasons could be as follows:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;It is hard to understand other people's implementations of algorithms since it needs to read between the lines. This is especially the case with a unfamiliar computer language; undocumented, or bad code. It is &lt;span style="font-style: italic;"&gt;easier&lt;/span&gt; to create one's own program than to understand and modify third-party software.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;For publication purpose, it is more attractive to work on something "new" instead of building on others.&lt;br /&gt;&lt;/li&gt;&lt;/ol&gt;Thus, in bioinformatics field, there are far more "new" methods papers published than refinements of previous ones. However, what claimed/appeared to be novel may not be the case under the hook, due to different terminologies used in different fields. Moreover, while it is easy to create a self-claimed "new" program, others could simply do the same. In my experience, it is well worth the effort to really understand leading programs if one is &lt;span style="font-style: italic; font-weight: bold;"&gt;serious&lt;/span&gt; about getting into an informatics field: it was really revealing and exciting when I went thorough CEHS, NewHelix/FreeHelix, RNA, Curves etc. Then I know clearly what was "the start-of-the-art" of the field and how I can do better: that's exactly how 3DNA came about!&lt;br /&gt;&lt;br /&gt;In the following sessions, I will provide details on how 3DNA calculates the two sets of parameters to quantify the relative geometry of a dinucleotide step.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-3827015665213895791?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/3827015665213895791/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2009/10/how-to-quantify-relative-geometry.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/3827015665213895791'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/3827015665213895791'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2009/10/how-to-quantify-relative-geometry.html' title='How to quantify the relative geometry between two base-pairs? — Part 2'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-1787781813688320419</id><published>2009-10-18T19:21:00.003-04:00</published><updated>2009-10-18T19:53:38.062-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Duplicate tab function in Adobe READER 9.1 is very handy</title><content type='html'>In reading a scientific paper, one often needs to jump back and forth for referred to tables, figures and references, etc. Those days, PDF is the standard way to share an e-document, and Acroread is the norm to view its content. I used to open two copies of the same document in two instances of Acroread (or one Acroread, one xpdf) so I can read continuously in one while moving around in the other, mostly at the end for references. This works, but obviously less than ideal.&lt;br /&gt;&lt;br /&gt;One day, purely by chance, I (mouse) right-clicked the tab for the document I was viewing, and noticed that it popped up with two options: &lt;span style="font-weight: bold;"&gt;Detach Tab&lt;/span&gt; and &lt;span style="font-weight: bold;"&gt;Duplicate Tab&lt;/span&gt;. Clicking &lt;span style="font-weight: bold;"&gt;Duplicate Tab&lt;/span&gt; led to duplication of the same document in another tab. Actually, this process can be repeated more than once, thus allowing for multiple views of the same document simultaneously. Very neat!&lt;br /&gt;&lt;br /&gt;Nowadays, whenever I read a scientific publication in Acroread, I often duplicate tab to have two views. It is only a click away to switch between the two. Thus when a citation is referred to in the main text, I can immediately see at the reference section what it is about.&lt;br /&gt;&lt;br /&gt;A word of caution: I am using Ubuntu 9.04, with Acroread v9.1.2 05/25/2009. I have no idea from which version and on what platform such function was added.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-1787781813688320419?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/1787781813688320419/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2009/10/duplicate-tab-function-in-adobe-reader.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/1787781813688320419'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/1787781813688320419'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2009/10/duplicate-tab-function-in-adobe-reader.html' title='Duplicate tab function in Adobe READER 9.1 is very handy'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-5961419324566955338</id><published>2009-10-18T12:10:00.008-04:00</published><updated>2009-10-18T18:58:02.194-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><category scheme='http://www.blogger.com/atom/ns#' term='structural biology'/><title type='text'>How to quantify the relative geometry between two base-pairs? — Part 1</title><content type='html'>In the field of double helical DNA (and RNA) structures, the following two sets of parameters to normally used to quantify the relative geometry of the two base pairs in a dinucleotide step:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;shift, slide, rise, tilt, roll, and twist — which I call them &lt;span style="font-weight: bold; font-style: italic;"&gt;stacking&lt;/span&gt; or simple &lt;span style="font-weight: bold;"&gt;step&lt;/span&gt; parameters&lt;/li&gt;&lt;li&gt;x-displacement, y-displacement, helical rise, inclination, tip, and helical twist — which I call them &lt;span style="font-weight: bold; font-style: italic;"&gt;helical&lt;/span&gt; parameters&lt;/li&gt;&lt;/ul&gt;3DNA calculates all of these parameters. Over the years, I have been approached  quite a few times on the &lt;a href="http://3dna.rutgers.edu:8080/forum/viewtopic.php?f=1&amp;amp;t=95"&gt;local helical parameters interpretation question&lt;/a&gt;. In literature, I've noticed numerous times of confusions the community still has over the two set of parameters, especially with regard to the issue of "twist vs helical twist" and "rise vs. helical rise". This series of blog posts is aimed to clarify the problem by providing some background information and step-by-step worked examples illustrating how the parameters are actually calculated in 3DNA (which follows &lt;a href="http://3dna.rutgers.edu:8080/forum/viewtopic.php?f=9&amp;amp;t=120"&gt;SCHNAaP/SCHNArP&lt;/a&gt;). In my experience, knowing how the parameters are derived is the key to understand what they mean, and thus to avoid misinterpretations.&lt;br /&gt;&lt;h2 style="border-bottom: 1px solid;"&gt;Part 1 — background information&lt;/h2&gt;&lt;ol&gt;&lt;li&gt;&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/7666417"&gt;The Calladine and El-Hassan Scheme (CEHS)&lt;/a&gt; calculates only the six step parameters, i.e., shift, slide, rise, tilt, roll, and twist. The original CEHS implementation was a few FORTRAN subroutines taking advantage the code base of Dickerson's well-known NewHelix program. In developing  &lt;a href="http://3dna.rutgers.edu:8080/forum/viewtopic.php?f=9&amp;amp;t=120"&gt;SCHNAaP/SCHNArP&lt;/a&gt;, a collaborative project between the then Sheffield and Cambridge groups, I also went through the source code of NewHelix to get a thorough knowledge of its internals in order to integrate its nice features into SCHNAaP.&lt;br /&gt;&lt;p&gt;One thing I noticed was x-displacement, a parameter characterizing the "hole" of A-DNA in top view. The global helical axis from NewHelix was (still is) appealing to me, especially for a short, relatively straight fragment. Even for a clearly non-straight duplex, e.g., in super helical nucleosome core particle DNA, or a severely kinked DNA, the deviation from regular linear helix serves as a good parameter to quantify its overall non-linearity. As a side note, a single so-called bending angle is often misleading — for example, the bending angle as commonly cited in literature (mostly calculated from Curves), is strongly influenced by the two terminal base-pairs.&lt;/p&gt;So I devised a parallel set of &lt;span style="font-style: italic; font-weight: bold;"&gt;global&lt;/span&gt; helical parameters following the CEHS scheme of angle combination: here instead of &lt;span style="color: rgb(255, 0, 0); font-weight: bold;"&gt;roll-tilt&lt;/span&gt;, I used &lt;span style="color: rgb(255, 0, 0); font-weight: bold;"&gt;tip-inclination&lt;/span&gt;. The definition of global helical axis and the point the helix passes through are based on my simplified implementation in ANSI C of the algorithms used by NewHelix.&lt;br /&gt;&lt;p&gt;Thus SCHNAaP calculates both a set of &lt;span style="font-style: italic;"&gt;local&lt;/span&gt; CEHS step parameters, and a set of &lt;span style="font-style: italic;"&gt;global&lt;/span&gt; helical parameters — a unique combination of CEHS and NewHelix, made possible only after a &lt;span style="font-weight: bold; font-style: italic;"&gt;thorough&lt;/span&gt; understanding of both methods. SCHNArP was developed to complete the circle, i.e., to rebuild a structure given a set of parameters, either the local step parameters: shift, slide, rise, tilt, roll, and twist, or the global helical parameters: x-displacement, y-displacement, helical rise, inclination, tip, and helical twist.&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/8133513"&gt;The &lt;span style="font-weight: bold;font-family:courier new;" &gt;RNA&lt;/span&gt; (Running Nucleic Acids) program by Babcock et al.&lt;/a&gt; calculates a set of &lt;span style="font-weight: bold; font-style: italic;"&gt;local&lt;/span&gt; helical parameters. While working at Rutgers in Dr. Olson's group, I was interested in knowing how &lt;span style="font-weight: bold;"&gt;the local helical axis and the point&lt;/span&gt; its passes through were defined in the RNA program, since it would naturally substitute for the global one in SCHNAaP/SCHNArP to make two sets of purely local parameters. As it turned out, however, the algorithm was finally implemented in the 3DNA software package, as summarized in the email I sent out &lt;a href="http://rutchem.rutgers.edu/%7Exiangjun/3DNA/albany/luJun092003.txt"&gt;before the 13th Conversation at Albany&lt;/a&gt;: &lt;blockquote&gt;The way to calculate the local helical axis and helical parameters in 3DNA is *essentially the same* as in RNA, but expressed in a much simpler way. First, the local helical axis is calculated as dx-times-dy, following Bansal, which gives the *same* result as the "single rotation axis" detailed in the RNA paper. I still could not figure out WHY, but verified this numerically. The location where the helix passes through is based directly on the RNA paper (i.e., following Chasles's theorem). The procedure to calculate helical parameters, i.e., tip/inclination/x-disp/y-disp/etc, following the SCHNAaP paper (i.e, with tip-inclination combination, which is consistent with the roll-tilt, propeller-buckle combinations in 3DNA). What amazed us is that this much simpler tip-inclination implementation in 3DNA gives *exactly* the same numerical values as the original RNA algorithm.&lt;/blockquote&gt; &lt;/li&gt;&lt;li&gt;In summary, the two sets of local parameters as implemented in 3DNA are based on a unique combination of nice features from several programs well-known in the community, including SCHNAaP/SCHNArP (CEHS), NewHelix, RNA, and NUPARM. However, the 3DNA calculated parameters are &lt;span style="font-style: italic;"&gt;numerically&lt;/span&gt; different from any of them, partially because 3DNA adopts the standard base reference frame. To the best of my knowledge, 3DNA is the only software that allows for rigorous conversion between the two sets of local parameters — a utility program in 3DNA called &lt;span style="color: rgb(255, 0, 0);"&gt;step_hel&lt;/span&gt; illustrates this simple fact. Again, as I wrote &lt;a href="http://rutchem.rutgers.edu/%7Exiangjun/3DNA/albany/luJun092003.txt"&gt;before the 13th Conversation at Albany&lt;/a&gt;:&lt;blockquote&gt;Take each base-pair as a rigid block, only 6 parameters are required to relate one bp to the other. Two sets are commonly used: one set is "shift, slide, rise, tilt, roll and twist", and the other set is "x-displacement, y-displacement, helical rise, inclination, tip, and helical twist". Obviously, these two sets of parameters should be directly convertible, as demonstrated by Calladine and Drew in their 1984 JMB B-to-A transition paper, and illustrated in 3DNA manuscript.&lt;/blockquote&gt;&lt;/li&gt;&lt;li&gt;Links to other programs referred to in this section:&lt;ul&gt;&lt;li&gt;&lt;a href="http://3dna.rutgers.edu:8080/forum/viewforum.php?f=9"&gt;SCHNAaP/SCHNArP&lt;/a&gt; by Lu et al.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://ndbserver.rutgers.edu/services/index.html"&gt;The RNA program&lt;/a&gt; by Babcock et al.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://ndbserver.rutgers.edu/services/index.html"&gt;FreeHelix98&lt;/a&gt; by Dickerson&lt;/li&gt;&lt;li&gt;&lt;a href="http://nucleix.mbu.iisc.ernet.in/nuparm/nuparm.shtml"&gt;NUPARM&lt;/a&gt; from Banal et al.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://3dna.rutgers.edu/x3dna/standards"&gt;Standards&lt;/a&gt;, especially the email exchanges before the 13th Conversation at Albany&lt;/li&gt;&lt;br /&gt;&lt;/ul&gt;&lt;/li&gt;&lt;/ol&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-5961419324566955338?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/5961419324566955338/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2009/10/two-sets-of-parameters-to-quantify.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5961419324566955338'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5961419324566955338'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2009/10/two-sets-of-parameters-to-quantify.html' title='How to quantify the relative geometry between two base-pairs? — Part 1'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-49010245119382471</id><published>2009-10-10T21:06:00.007-04:00</published><updated>2009-10-11T09:40:05.079-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Blogger's duo-editing modes allow for flexibility and convenience</title><content type='html'>In my experience using Blogger over the past several months, I have begun to appreciate more its double editing mode: &lt;span style="color: rgb(255, 0, 0);"&gt;Compose&lt;/span&gt; and &lt;span style="color: rgb(255, 0, 0);"&gt;Edit HTML&lt;/span&gt;. The &lt;span style="color: rgb(255, 0, 0);"&gt;Compose&lt;/span&gt; mode is a simple WYSIWYG editor, convenient for most common tasks. Once in a while, however, I get stuck with some nasty formatting issues that could drive one crazy to fix. This is where the &lt;span style="color: rgb(255, 0, 0);"&gt;Edit HTML&lt;/span&gt; mode comes in handy, which allows for full flexibility in editing raw HTML.&lt;br /&gt;&lt;br /&gt;In principle, I am pretty competent with HTML and could use the &lt;span style="color: rgb(255, 0, 0);"&gt;Edit HTML&lt;/span&gt; mode directly. However, raw HTML is verbose and thus not that convenient. So I normally start a blog post with the &lt;span style="color: rgb(255, 0, 0);"&gt;Compose &lt;/span&gt;mode for most of the content, and switch to the &lt;span style="color: rgb(255, 0, 0);"&gt;Edit HTML &lt;/span&gt;mode only when necessary. Blogger makes the switch between the two modes a simple button click. This is in contrast to the single editing mode in &lt;a href="http://3dna.rutgers.edu:8080/forum/"&gt;phpBB3 (BBCode) used by the 3DNA forum&lt;/a&gt;, which does not allow for direct access to HTML.&lt;br /&gt;&lt;br /&gt;Ideally, a software tool should be both flexible and convenient. In reality, however, not that many software could strike a balance between the two factors. Blogger is a nice example. Interesting, in composing this post, I switched back and forth between the two editing modes a couple of occasions: one after copy-and-pasting &lt;span style="color: rgb(255, 0, 0);"&gt;Edit HTML&lt;/span&gt; to stop red coloring of the following text, and the other to qualify the 3DNA forum link text.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-49010245119382471?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/49010245119382471/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2009/10/flexibility-vs-convenience-in-software.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/49010245119382471'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/49010245119382471'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2009/10/flexibility-vs-convenience-in-software.html' title='Blogger&apos;s duo-editing modes allow for flexibility and convenience'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-2176493709500983247</id><published>2009-10-09T20:47:00.008-04:00</published><updated>2009-10-09T22:31:43.191-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><category scheme='http://www.blogger.com/atom/ns#' term='structural biology'/><title type='text'>Fiber models in 3DNA make it easy to build regular DNA helices</title><content type='html'>3DNA contains 55 fiber models compiled from literature. To the best of my knowledge, this is the most comprehensive collection of its kind (detailed list given below), including:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Chandrasekaran &amp;amp; Arnott (from #1 to #43) — the most well-known set of fiber models&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Alexeev et al. (#44-#45)&lt;/li&gt;&lt;li&gt;van Dam &amp;amp; Levitt (#46-#47)&lt;/li&gt;&lt;li&gt;Premilat &amp;amp; Albiser (#48-#55)&lt;br /&gt;&lt;/li&gt;&lt;/ol&gt;The utility program &lt;span style="font-weight: bold; color: rgb(255, 0, 0);"&gt;fiber &lt;/span&gt;puts the generation of all these fiber models in a simple, consistent interface, and can generate structures in either standard PDB or PDBML format. Of those models, some can be built with an arbitrary sequence of A, C, G and T (e.g., A-/B-/C-DNA from calf thymus), while others are of fixed sequences (e.g., Z-DNA with GC repeats). The sequence can be specified either from command-line or a plain text file.&lt;br /&gt;&lt;br /&gt;Once 3DNA in properly installed, the command-line interface is the most versatile and convenient way to generate, e.g., a regular double strand DNA (mostly, B-DNA) of arbitrary sequence. Moreover, &lt;a href="http://xiang-jun.blogspot.com/2009/07/two-web-interfaces-to-3dna.html"&gt;the w3DNA and 3DART web-interface&lt;/a&gt; to 3DNA makes it easy to generate a regular DNA model, especially for occasional use or educational purposes.&lt;br /&gt;&lt;br /&gt;Theoretically, there is nothing to worth showing off in 3DNA's fiber model generation functionality. However, it serves as a clear example of the differences between a "proof of concept" and a practical software application. I initially decided to work on this issue simply for my own convenience. At that time, I had access to A-DNA and B-DNA fiber model generators, each as a separate program. Moreover, the constructed models did not comply to the PDB format in atom naming and other subtitles.&lt;br /&gt;&lt;br /&gt;I started with the Chandrasekaran &amp;amp; Arnott fiber models which I had a copy of data files. Nevertheless, there are many details to work out, typos to correct, etc to put them in a consistent framework. For other models, I read each original publication, and typed into computer (with help from Dr. Olson's secretary) raw atomic cylindrical coordinates for each model. Again, quite a few inconsistencies popped up between the different publications with a time span over decades.&lt;br /&gt;&lt;br /&gt;Overall, it was a quite tedious undertaking, requiring great attention to details. I am glad that I did that: I learned so much from the process, and more importantly, others can benefit from my effort. As I put in the &lt;a href="http://www.natureprotocols.com/2008/07/03/3dna_a_versatile_integrated_so.php"&gt;3DNA Nature Protocol paper (BOX 6 | FIBER-DIFFRACTION MODELS)&lt;/a&gt;,&lt;br /&gt;&lt;blockquote&gt;&lt;br /&gt;In preparing this set of fiber models, we have taken great care to ensure the accuracy and consistency of the models. For completeness and user verification, 3DNA includes, in addition to 3DNA-processed files, the &lt;span style="font-weight: bold; font-style: italic;"&gt;original coordinates collected from the literature&lt;/span&gt;.&lt;/blockquote&gt;&lt;br /&gt;For those who want to understand what's going on under the hood, there is no better way than to try to reproduce the process using, e.g., fiber B-DNA as an example.&lt;br /&gt;&lt;br /&gt;From the very beginning, I had expected the &lt;span style="font-weight: bold;"&gt;fiber&lt;/span&gt; functionality to be easily reachable and thus beneficial to all those who are interested in nucleic acid structures, especially to build a regular DNA duplex of chosen sequence. In my sense, the &lt;span style="font-weight: bold; color: rgb(255, 0, 0);"&gt;fiber&lt;/span&gt; program has not been widely used as it should have been, probably due to the fact that many people are not aware of its existence or capability. Hopefully, this blog post would help to get my message across.&lt;br /&gt;&lt;br /&gt;&lt;hr /&gt;PS. Given below is the content of the README file for fiber models in 3DNA&lt;br /&gt;&lt;pre style="font-size: 10px;"&gt;&lt;br /&gt;1. The repeating units of each fiber structure are mostly based on the&lt;br /&gt;   work of Chandrasekaran &amp; Arnott (from #1 to #43). More recent fiber&lt;br /&gt;   models are based on Alexeev et al. (#44-#45), van Dam &amp; Levitt (#46&lt;br /&gt;   -#47) and Premilat &amp; Albiser (#48-#55).&lt;br /&gt;&lt;br /&gt;2. Clean up of each residue&lt;br /&gt;   a. currently ignore hydrogen atoms [can be easily added]&lt;br /&gt;   b. change ME/C7 group of thymine to C5M&lt;br /&gt;   c. re-assign O3' atom to be attached with C3'&lt;br /&gt;   d. change distance unit from nm to A [most of the entries]&lt;br /&gt;   e. re-ordering atoms according to the NDB convention&lt;br /&gt;&lt;br /&gt;3. Fix up of problem structures.&lt;br /&gt;   a. str#8 has no N9 atom for guanine&lt;br /&gt;   b. str#10 is not available from the disk, manually input&lt;br /&gt;   c. str#14 C5M atom was named C5 for Thymine, resulting two C5 atoms&lt;br /&gt;   d. str#17 has wrong assignment of O3' atom on Guanine&lt;br /&gt;   e. str#33 has wrong C6 position in U3&lt;br /&gt;   f. str#37 to #str41 were typed in manually following Arnott's&lt;br /&gt;        new list as given in "Oxford Handbook of Nucleic Acid Structure"&lt;br /&gt;        edited by S. Neidle (Oxford Press, 1999)&lt;br /&gt;   g. str#38 coordinates for N6(A) and N3(T) are WRONG as given in the&lt;br /&gt;        original literature&lt;br /&gt;   h. str#39 and #40 have the same O3' coordinates for the 2nd strand&lt;br /&gt;&lt;br /&gt;4. str#44 &amp; 45 have fixed strand II residues (T)&lt;br /&gt;&lt;br /&gt;5. str#46 &amp; 47 have +z-axis upwards (based on BI.pdb &amp; BII.pdb)&lt;br /&gt;&lt;br /&gt;6. str#48 to 55 have +z-axis upwards&lt;br /&gt;&lt;br /&gt;List of 55 fiber structures&lt;br /&gt;&lt;br /&gt;id#  Twist   Rise        Structure description&lt;br /&gt;    (dgrees)  (A)&lt;br /&gt;-------------------------------------------------------------------------------&lt;br /&gt; 1   32.7   2.548  A-DNA  (calf thymus; generic sequence: A, C, G and T)&lt;br /&gt; 2   65.5   5.095  A-DNA  poly d(ABr5U) : poly d(ABr5U)&lt;br /&gt; 3    0.0  28.030  A-DNA  (calf thymus) poly d(A1T2C3G4G5A6A7T8G9G10T11) :&lt;br /&gt;                                        poly d(A1C2C3A4T5T6C7C8G9A10T11)&lt;br /&gt; 4   36.0   3.375  B-DNA  (calf thymus; generic sequence: A, C, G and T)&lt;br /&gt; 5   72.0   6.720  B-DNA  poly d(CG) : poly d(CG)&lt;br /&gt; 6  180.0  16.864  B-DNA  (calf thymus) poly d(C1C2C3C4C5) : poly d(G6G7G8G9G10)&lt;br /&gt; 7   38.6   3.310  C-DNA  (calf thymus; generic sequence: A, C, G and T)&lt;br /&gt; 8   40.0   3.312  C-DNA  poly d(GGT) : poly d(ACC)&lt;br /&gt; 9  120.0   9.937  C-DNA  poly d(G1G2T3) : poly d(A4C5C6)&lt;br /&gt;10   80.0   6.467  C-DNA  poly d(AG) : poly d(CT)&lt;br /&gt;11   80.0   6.467  C-DNA  poly d(A1G2) : poly d(C3T4)&lt;br /&gt;12   45.0   3.013  D-DNA  poly d(AAT) : poly d(ATT)&lt;br /&gt;13   90.0   6.125  D-DNA  poly d(CI) : poly d(CI)&lt;br /&gt;14  -90.0  18.500  D-DNA  poly d(A1T2A3T4A5T6) : poly d(A1T2A3T4A5T6)&lt;br /&gt;15  -60.0   7.250  Z-DNA  poly d(GC) : poly d(GC)&lt;br /&gt;16  -51.4   7.571  Z-DNA  poly d(As4T) : poly d(As4T)&lt;br /&gt;17    0.0  10.200  L-DNA  (calf thymus) poly d(GC) : poly d(GC)&lt;br /&gt;18   36.0   3.230  B'-DNA alpha poly d(A) : poly d(T) (H-DNA)&lt;br /&gt;19   36.0   3.233  B'-DNA beta2 poly d(A) : poly d(T) (H-DNA  beta)&lt;br /&gt;20   32.7   2.812  A-RNA  poly (A) : poly (U)&lt;br /&gt;21   30.0   3.000  A'-RNA poly (I) : poly (C)&lt;br /&gt;22   32.7   2.560  Hybrid poly (A) : poly d(T)&lt;br /&gt;23   32.0   2.780  Hybrid poly d(G) : poly (C)&lt;br /&gt;24   36.0   3.130  Hybrid poly d(I) : poly (C)&lt;br /&gt;25   32.7   3.060  Hybrid poly d(A) : poly (U)&lt;br /&gt;26   36.0   3.010  10-fold poly (X) : poly (X)&lt;br /&gt;27   32.7   2.518  11-fold poly (X) : poly (X)&lt;br /&gt;28   32.7   2.596  Poly (s2U) : poly (s2U) (symmetric base-pair)&lt;br /&gt;29   32.7   2.596  Poly (s2U) : poly (s2U) (asymmetric base-pair)&lt;br /&gt;30   32.7   3.160  Poly d(C) : poly d(I) : poly d(C)&lt;br /&gt;31   30.0   3.260  Poly d(T) : poly d(A) : poly d(T)&lt;br /&gt;32   32.7   3.040  Poly (U) : poly (A) : poly(U) (11-fold)&lt;br /&gt;33   30.0   3.040  Poly (U) : poly (A) : poly(U) (12-fold)&lt;br /&gt;34   30.0   3.290  Poly (I) : poly (A) : poly(I)&lt;br /&gt;35   31.3   3.410  Poly (I) : poly (I) : poly(I) : poly(I)&lt;br /&gt;36   60.0   3.155  Poly (C) or poly (mC) or poly (eC)&lt;br /&gt;37   36.0   3.200  B'-DNA beta2  Poly d(A) : poly d(U)&lt;br /&gt;38   36.0   3.240  B'-DNA beta1  Poly d(A) : poly d(T)&lt;br /&gt;39   72.0   6.480  B'-DNA beta2  Poly d(AI) : poly d(CT)&lt;br /&gt;40   72.0   6.460  B'-DNA beta1  Poly d(AI) : poly d(CT)&lt;br /&gt;41  144.0  13.540  B'-DNA  Poly d(AATT) : poly d(AATT)&lt;br /&gt;42   32.7   3.040  Poly(U) : poly d(A) : poly(U) [cf. #32]&lt;br /&gt;43   36.0   3.200  Beta Poly d(A) : Poly d(U) [cf. #37]&lt;br /&gt;44   36.0   3.233  Poly d(A) : poly d(T) (Ca salt)&lt;br /&gt;45   36.0   3.233  Poly d(A) : poly d(T) (Na salt)&lt;br /&gt;46   36.0   3.38   B-DNA (BI-type nucleotides; generic sequence: A, C, G and T)&lt;br /&gt;47   40.0   3.32   C-DNA (BII-type nucleotides; generic sequence: A, C, G and T)&lt;br /&gt;48   87.8   6.02   D(A)-DNA  ploy d(AT) : ploy d(AT) (right-handed)&lt;br /&gt;49   60.0   7.20   S-DNA  ploy d(CG) : poly d(CG) (C_BG_A, right-handed)&lt;br /&gt;50   60.0   7.20   S-DNA  ploy d(GC) : poly d(GC) (C_AG_B, right-handed)&lt;br /&gt;51   31.6   3.22   B*-DNA  poly d(A) : poly d(T)&lt;br /&gt;52   90.0   6.06   D(B)-DNA  poly d(AT) : poly d(AT) [cf. #48]&lt;br /&gt;53  -38.7   3.29   C-DNA (generic sequence: A, C, G and T) (depreciated)&lt;br /&gt;54   32.73  2.56   A-DNA (generic sequence: A, C, G and T) [cf. #1]&lt;br /&gt;55   36.0   3.39   B-DNA (generic sequence: A, C, G and T) [cf. #4]&lt;br /&gt;-------------------------------------------------------------------------------&lt;br /&gt;List 1-41 based on Struther Arnott: ``Polynucleotide secondary structures:&lt;br /&gt;     an historical perspective'', pp. 1-38 in ``Oxford Handbook of Nucleic&lt;br /&gt;     Acid Structure'' edited by Stephen Neidle (Oxford Press, 1999).&lt;br /&gt;     &lt;br /&gt;     #42 and #43 are from Chandrasekaran &amp; Arnott: "The Structures of DNA&lt;br /&gt;     and RNA Helices in Oriented Fibers", pp 31-170 in "Landolt-Bornstein&lt;br /&gt;     Numerical Data and Functional Relationships in Science and Technology"&lt;br /&gt;     edited by W. Saenger (Springer-Verlag, 1990).&lt;br /&gt;&lt;br /&gt;#44-#45 based on Alexeev et al., ``The structure of poly(dA) . poly(dT)&lt;br /&gt;     as revealed by an X-ray fiber diffraction''. J. Biomol. Str. Dyn, 4,&lt;br /&gt;     pp. 989-1011, 1987.&lt;br /&gt;&lt;br /&gt;#46-#47 based on van Dam &amp; Levitt, ``BII nucleotides in the B and C forms&lt;br /&gt;     of natural-sequence polymeric DNA: a new model for the C form of DNA''.&lt;br /&gt;     J. Mol. Biol., 304, pp. 541-561, 2000.&lt;br /&gt;&lt;br /&gt;#48-#55 based on Premilat &amp; Albiser, ``A new D-DNA form of poly(dA-dT) .&lt;br /&gt;     poly(dA-dT): an A-DNA type structure with reversed Hoogsteen Pairing''.&lt;br /&gt;     Eur. Biophys. J., 30, pp. 404-410, 2001 (and several other publications).&lt;br /&gt;&lt;/pre&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-2176493709500983247?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/2176493709500983247/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2009/10/fiber-models-in-3dna.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2176493709500983247'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2176493709500983247'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2009/10/fiber-models-in-3dna.html' title='Fiber models in 3DNA make it easy to build regular DNA helices'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-7806011285266775138</id><published>2009-10-04T22:13:00.008-04:00</published><updated>2009-12-12T18:39:03.173-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>3DNA in molecular dynamics simulations</title><content type='html'>While updating &lt;a href="http://scholar.google.com/scholar?hl=en&amp;amp;lr=&amp;amp;q=allintitle%3A3dna+software+author%3A%22XJ+Lu%22&amp;amp;btnG=Search"&gt;3DNA citations&lt;/a&gt; last Friday, I came across the paper "Flexibility of Short-Strand RNA in Aqueous Solution as Revealed by Molecular Dynamics Simulation: Are A-RNA and A′-RNA Distinct Conformational Structures?" by Ouyang et al. [&lt;a href="http://www.publish.csiro.au/nid/51/paper/CH09090.htm"&gt;Aust. J. Chem. 2009, 62, 1054–1061&lt;/a&gt;]. Through molecular dynamics&lt;span style=""&gt; (MD) simulations over a 30 ns period&lt;/span&gt;, the authors found that "&lt;span style=""&gt;the identification of distinct A-RNA and A′-RNA structures ... may not be generally relevant in the context of RNA in the aqueous phase." Overall, the paper is nicely written.&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style=""&gt;I have never performed MD simulations in my research experience, so normally I only read abstracts of such publications just to get a general idea of the main conclusions. What had attracted my attention to this work was its simultaneous citations to three earlier papers:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/9917397"&gt;Resolving the discrepancies among nucleic acid conformational analyses&lt;/a&gt;.&lt;/li&gt;&lt;br /&gt;&lt;li&gt;&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/10217453"&gt;Overview of nucleic acid analysis programs&lt;/a&gt;.&lt;/li&gt;&lt;br /&gt;&lt;li&gt;&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/11601858"&gt;A standard reference frame for the description of nucleic acid base-pair geometry&lt;/a&gt;.&lt;/li&gt;&lt;/ul&gt;This is quite unusual, since nowadays it is far more common to only cite 3DNA itself (mostly 2003 NAR and/or 2008 NP). So I decided to have a look of the whole paper. It turned out that the authors used the extra citations to justify their choice of using 3DNA instead of Curves to calculate the helical parameters, including &lt;/span&gt;&lt;span&gt;&lt;span style=""&gt;"the three main structural descriptors commonly used to differentiate between the two forms of RNA – namely major groove width, inclination and the number of base pairs in a helical twist [turn]"&lt;/span&gt;&lt;/span&gt;&lt;span style=""&gt;.&lt;br /&gt;&lt;br /&gt;I communicated to the authors about the availability of Curves+. Specifically, using one case (413D, one of the three structures in their Table 1, "Comparison of different results by CURVES and 3DNA programs"), I've tried to illustrate the point that &lt;a href="http://xiang-jun.blogspot.com/2009/08/curves-vs-3dna.html"&gt;&lt;span style="font-weight: bold; font-style: italic;"&gt;Curves+&lt;/span&gt; and 3DNA now give directly comparable parameters&lt;/a&gt;. &lt;/span&gt;&lt;span style=""&gt;&lt;br /&gt;&lt;br /&gt;Of course, I am glad to see 3DNA being applied to molecular dynamics simulations of nucleic acid structures. &lt;/span&gt;Hopefully, more such applications will show up in the future, and I am willing to offer my help in ways that &lt;span style="font-style: italic;"&gt;make sense&lt;/span&gt; to me.&lt;br /&gt;&lt;span style=""&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-7806011285266775138?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/7806011285266775138/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2009/10/3dna-in-molecular-dynamics-simulations.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/7806011285266775138'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/7806011285266775138'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2009/10/3dna-in-molecular-dynamics-simulations.html' title='3DNA in molecular dynamics simulations'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-3947063237485410163</id><published>2009-10-03T22:15:00.004-04:00</published><updated>2009-10-03T23:58:57.117-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><title type='text'>Whenever in doubt, check with the author</title><content type='html'>Once in a while, I send emails to authors of papers I am interested in, sometimes simply to ask for PDF reprints, mostly to request for clarifications of points I cannot understand fully. Of course, the responses I have received vary significantly: some authors are responsive and are able to answer my questions concretely; while others respond less professionally; in no small percentage, I get no feedback at all. Whatever the case, though, sending querying emails is convenient, and the responses I get (even no response at all) are informative. Naturally, I would take more seriously the papers whose authors are responsive. On the other hand, in my memory, I have &lt;a href="http://xiang-jun.blogspot.com/2009/05/phd-pages-on-dna-base-stacking.html"&gt;never ignored a reader's question of my publications&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Seeking clarification on a scientific software from author(s) or maintainer(s) is even more important due to inherent subtleties of (undocumented) details, as is common in (bio)informatics. In supporting 3DNA over the years, I've experienced quite a few cases where authors of some articles are misinformed in making judgment about 3DNA's functionality. In one case, I read in a paper claiming 3DNA cannot handle Hoogsteen base-pairs while Curves can. A few email exchanges with the corresponding author (who was very responsive and professional) turned out that an internally modified version of Curves was used. More recently, I found a paper claiming that &lt;span style="font-weight: bold;"&gt;find_pair&lt;/span&gt; from 3DNA failed to identify some base pairs in DNA-protein complexes where a new method succeeded. I asked for the missing list, and immediately noticed that simply relaxing some criteria recovered virtually all of the pairs. Thus, to make a  &lt;span style="font-weight: bold; font-style: italic;"&gt;convincing &lt;/span&gt;comparison of scientific software, it is &lt;span style="font-weight: bold; font-style: italic;"&gt;crucial&lt;/span&gt; to check with the original authors to avoid misunderstandings. Serious scientific software developers and maintainers always welcome users' feedback. Why not ask for clarifications if one really wants to make a (strong) point in comparison? Of course, it is another story for unsupported software.&lt;br /&gt;&lt;br /&gt;The Internet age has provided unprecedented convenience for scientific communication. It would be a pity not to take full advantage of it. One simple and important thing to do is: whenever in doubt, ask for clarification from corresponding author of a publication or maintainer of a software.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-3947063237485410163?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/3947063237485410163/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2009/10/whenever-in-doubt-check-with-author.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/3947063237485410163'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/3947063237485410163'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2009/10/whenever-in-doubt-check-with-author.html' title='Whenever in doubt, check with the author'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-5907171579561552917</id><published>2009-09-27T22:50:00.010-04:00</published><updated>2009-12-12T18:39:46.779-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>On reproducibility of scientific publications</title><content type='html'>In the September 25, 2009 issue of Science (Vol. 325, pp.1622-3), I read with interest the letter from Osterweil &lt;span style="font-style: italic;"&gt;et al.&lt;/span&gt; "Forecast for Reproducible Data: Partly Cloudy" and the response from Nelson. This exchange of views highlights the difficulty/importance for one research team to precisely reproduce results from another when &lt;span style="font-style: italic;"&gt;elaborate&lt;/span&gt; computation is involved. As is well known, subtle differences in computer hardware and software, different versions of the same software, or even different options of the same version, could all play a role. Without specifying those details, it is virtually impossible to repeat a publication exactly.&lt;br /&gt;&lt;br /&gt;This reminds me of a recent paper "&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/19174838"&gt;Repeatability of published microarray gene expression analyses&lt;/a&gt;" by Ioannidis et al. [&lt;span style="font-style: italic;"&gt;Nat Genet.&lt;/span&gt; 2009, 41(2):149-55]. In the abstract, the authors summarized their findings:&lt;br /&gt;&lt;blockquote&gt;Here we evaluated the replication of data analyses in 18 articles on microarray-based gene expression profiling published in &lt;span style="font-style: italic;"&gt;Nature Genetics&lt;/span&gt; in 2005-2006. One table or figure from each article was independently evaluated by two teams of analysts. We reproduced two analyses in principle and six partially or with some discrepancies; ten could not be reproduced. The main reason for failure to reproduce was data unavailability, and discrepancies were mostly due to incomplete data annotation or specification of data processing and analysis.&lt;/blockquote&gt;&lt;br /&gt;Specifically, please note that:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;The authors are experts on microarray analysis, not occasional application software users.&lt;/li&gt;&lt;li&gt;These 18 articles surveyed were published in &lt;span style="font-style: italic;"&gt;Nature Genetics&lt;/span&gt;, one of the top journals of its field.&lt;/li&gt;&lt;li&gt;Not a single analysis could be reproduced exactly: two were reproduced in principle, six only partially, and the other ten not at all.&lt;/li&gt;&lt;/ol&gt;Without being able to reproduce exactly the results from others, it is hard to build upon previous work and move forward. The various reasons for lack of reproducibility listed by Ioannidis et al. are certainly not limited to microarray analysis. As far as I can tell, they also exist in the fields of RNA structure analysis and predictions, energetics of protein-DNA interactions, quantum mechanics calculations, and molecular dynamics simulations etc.&lt;br /&gt;&lt;br /&gt;In my experience and understanding, the methods section in journal articles is not, and should not aim to be, detailed enough for exact duplication by a qualified reader. Instead, most such reproducibility issues would be gone if journals require that authors provide raw data, detailed procedures used to process the data, software version and options used to generate the figures and tables reported in the publication. Such information could be made available in (journal or authors) websites. This is an effective way to solve the problem, especially for computational, informatics-related articles. Over the years, for papers I am the first author or I have made major contributions, I've always kept a folder for each article to include &lt;span style="font-weight: bold; font-style: italic;"&gt;every detail&lt;/span&gt; (data files, scripts etc) so that the published tables and figures can be repeated precisely. This has turned out to be extremely helpful when I want to refer back to early publications, or when I was asked by readers for further details.&lt;br /&gt;&lt;br /&gt;As noted by Osterweil et al., "repeatability, reproducibility, and transparency are the hallmarks of the scientific enterprise." To really achieve the goal, every scientist needs to pay more attention to details and be responsive. Do not be fool around by the impressive introduction or extensive discussions (which are important, of course) in a paper: to get the bottom of something, it is usually the details that count.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-5907171579561552917?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/5907171579561552917/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2009/09/on-reproducibility-of-scientific.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5907171579561552917'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5907171579561552917'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2009/09/on-reproducibility-of-scientific.html' title='On reproducibility of scientific publications'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-2572382113375197505</id><published>2009-09-26T22:53:00.008-04:00</published><updated>2009-12-28T12:05:20.897-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>RiboClub 10th Annual Meeting -- it was a good one!</title><content type='html'>I attended the RiboClub 10th Annual Meeting held during September 21 to 23 at Hotel Cheribourg, Orford, Quebec. Everyday, the schedule was fulfilled from early morning to late night. Indeed, it was so tight and intensive that I could not even find a time to walk around in the beautiful season. Overall, though, the meeting was well-organized and had a fantastic scientific program.&lt;br /&gt;&lt;br /&gt;The &lt;a href="http://riboclub.org/cgi-bin/RiboWeb/index.pl"&gt;RiboClub&lt;/a&gt; was founded ten years ago by researchers at the University of Sherbrooke, Quebec. Initially, it was local, and then the club was extended to nearby areas, and the whole Canada. As time goes, its influence has also passed the border so at its 10th anniversary, several leading RNA experts (including Phillip Sharp, Jack Szostak, Tim Nilsen, Tom Steitz et al.) from the USA also participated.&lt;br /&gt;&lt;br /&gt;I noticed the RiboClub meeting early this year when I was writing up a manuscript on an RNA structural motif. As hinted in another post, "&lt;a href="http://xiang-jun.blogspot.com/2009/07/does-3dna-work-for-rna.html"&gt;Does 3DNA work for RNA?&lt;/a&gt;", I've recently been attracted to the field of RNA structures. Using 3DNA, we have uncovered a simple RNA-specific interaction that is biologically relevant, yet virtually ignored by the community. So attending a RNA meeting would allow me to learn more about RNA and to pass our message across to a wider audience.&lt;br /&gt;&lt;br /&gt;The meeting was mostly on various aspects of RNA biology. Of which, 3-dimensional structure is an integral part, yet no specific session was devoted to it. The same applied to bioinformatics tools and applications. Thus, for example, Tom Steitz's talk on the ribosome structures was under the session titled "Translation: targets and impact".&lt;br /&gt;&lt;br /&gt;The organizers obviously paid attention to arrange meeting participants at the dinning table. So for Tuesday (Sept. 22) night, I sit next to &lt;a href="http://www.biochem.ualberta.ca/MacMillanLab/"&gt;Dr. Andrew MacMillan&lt;/a&gt; from University of Alberta. It was a nice surprise to know that Dr. MacMillan works on "structural and functional characterization of splicesome assembly and activation." I took this opportunity to read the abstracts from his lab and to talk to him about my findings that are related to pre-mRNA splicing. He visited my post the next day (Wednesday, Sept 23) and we discussed it in more details. On the Gala dinner on Wednesday, I was arranged to sit between &lt;a href="http://paul.representinggenes.org/"&gt;Dr. Paul Griffiths&lt;/a&gt;, "a philosopher of science with a focus on biology and psychology", and &lt;a href="http://www.biochem.dal.ca/faculty/facultypages/doolittle/"&gt;Dr. W. Ford Doolittle&lt;/a&gt;, a leading scientist in Comparative Genomics. It was a valuable experience to hear them and others around the table talking on politics and science-related issues.&lt;br /&gt;&lt;br /&gt;It is worth noting that Wednesday's dinner speaker was Alexander Rich. Wearing his tie from the famous &lt;a href="http://www.ambion.com/main/tieclub/flash/print_form/story_2.html"&gt;RNA Tie Club&lt;/a&gt;, Dr. Rich talked about "&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/19638248"&gt;The era of RNA awakening: structural biology of RNA in the early years&lt;/a&gt;." At the age of 85, he still spoke clearly and logically. His story telling style was very effective and his talk was well-received by the audience. For those who are interested in knowing more about Dr. Rich's work, I would strongly recommend his article "&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/15189135"&gt;The excitement of discovery&lt;/a&gt;."&lt;br /&gt;&lt;br /&gt;The "Neo-Traditional Quebec Music" show on Wednesday night was exciting and relaxing, following and in contrast to the three-day long intensive scientific program. Although I did not understand the music that well, I stayed until the very end.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-2572382113375197505?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/2572382113375197505/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2009/09/riboclub-10th-annual-meeting-it-was.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2572382113375197505'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/2572382113375197505'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2009/09/riboclub-10th-annual-meeting-it-was.html' title='RiboClub 10th Annual Meeting -- it was a good one!'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-1404361133944600884</id><published>2009-09-05T21:45:00.008-04:00</published><updated>2009-12-17T23:33:38.047-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='3DNA-related'/><category scheme='http://www.blogger.com/atom/ns#' term='structural biology'/><title type='text'>Double helix groove width parameters from 3DNA</title><content type='html'>In the 3DNA output (from the &lt;span style="font-weight: bold;"&gt;analyze&lt;/span&gt; program) for a DNA/RNA duplex structure, there is a section on "Minor and major groove widths: direct P-P distances and refined P-P distances which take into account the directions of the sugar-phosphate backbones". The underlying algorithm is that of El Hassan and Calladine (1998). ``&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/9735291"&gt;Two Distinct Modes of Protein-induced Bending in DNA&lt;/a&gt;.'' &lt;span style="font-style: italic;"&gt;J. Mol. Biol.&lt;/span&gt;, &lt;span style="font-weight: bold;"&gt;v282&lt;/span&gt;, pp331-343. Note that the P-P distances need to &lt;span style="font-weight: bold;"&gt;be subtracted by 5.8 Å&lt;/span&gt; to take account of the vdw radii of the phosphate groups (2.9 Å), and for comparisons with NewHelix/FreeHelix and Curves.&lt;br /&gt;&lt;br /&gt;Using 3DNA fiber models #1 for A-DNA (calf thymus) and #4 for B-DNA (calf thymus), the groove widths are as follows:&lt;br /&gt;&lt;pre  style="color:red;"&gt;                 Minor Groove        Major Groove&lt;br /&gt;              P-P     Refined     P-P     Refined&lt;br /&gt;-----------------------------------------------------&lt;br /&gt;A-DNA (#1)      18.5      &lt;span style="font-weight: bold;"&gt;16.7&lt;/span&gt;      15.2      &lt;span style="font-weight: bold;"&gt;11.1&lt;/span&gt;&lt;br /&gt;B-DNA (#4)      11.7      &lt;span style="font-weight: bold;"&gt;11.7&lt;/span&gt;      17.2      &lt;span style="font-weight: bold;"&gt;17.2&lt;/span&gt;&lt;br /&gt;-----------------------------------------------------&lt;/pre&gt;From the above table, it is clearly that for A-DNA, the minor and major groove widths for the refined set are smaller than their corresponding non-refined counterparts (i.e., those based on direct P-P distances). For B-DNA, there are no changes between the two sets. It should be noted that in real structures (i.e., non-perfectly regular, as in X-ray crystal structures in the NDB/PDB), there are nearly always some differences between the refined vs. direct P-P distances. As a general rule, the refined set should be used.&lt;br /&gt;&lt;br /&gt;One of the key structural differences between A- and B-DNA is their opposite groove dimensions: for B-DNA, the major groove width (~17 Å) is about 5 Å wider than the minor groove width (~12 Å); whereas for A-DNA, the major groove width (~11 Å) is narrower than the minor groove width (~17 Å) by a similar amount. Since the grooves provide binding sites, the difference between A- and B-DNA grooves has important implications in DNA (groove) recognitions by ligands or proteins.&lt;br /&gt;&lt;br /&gt;In retrospect, I implemented the El Hassan and Calladine algorithm for calculating the groove widths  mainly because of its simplicity: I can understand clearly how it works visually. The algorithm is described in a two-page appendix of the above cited paper. For those who are interest in DNA structures in general and how groove widths are defined in particular, I would strongly recommend them to read the appendix carefully and try to implement it: there is no substitute for first hand experience. For an idealized cases, as the above for fiber A- and B-DNA, the implementation should be straightforward. To be more realistic, an implementation should account for missing phosphate groups in some structures (for testing purpose, simply delete one P atom from a structure), for example.&lt;br /&gt;&lt;br /&gt;As is obvious, 3DNA does &lt;span style="font-weight: bold;"&gt;not&lt;/span&gt; calculate groove depths. Over the years, I have actually been approached with requests/suggestions to provide such parameters to complement groove widths. However, for various reasons, none of the algorithms fits with 3DNA. As a general principle, I do not add new functionality to 3DNA simply for the seek of it. I must understand a new piece clearly in order to integrate it with the rest and to be able to respond concretely to possible questions from users.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-1404361133944600884?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/1404361133944600884/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2009/09/double-helix-groove-width-parameters.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/1404361133944600884'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/1404361133944600884'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2009/09/double-helix-groove-width-parameters.html' title='Double helix groove width parameters from 3DNA'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7853142166506564202.post-5860114787175279531</id><published>2009-09-05T19:46:00.009-04:00</published><updated>2009-09-05T20:46:15.132-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><title type='text'>Some emacs tricks</title><content type='html'>As a Linux/Unix fan, I am very familiar with &lt;span style="font-style: italic;"&gt;vi&lt;/span&gt; and use it for quick and simple text editing purposes. Over the years, however, I have been using &lt;span style="font-style: italic;"&gt;emacs&lt;/span&gt; the most: I like its color coding and programming language-specific editing mode.&lt;br /&gt;&lt;br /&gt;Emacs is well-known for its extensibility, so there are many ways to customize it to suit one's taste. In my experience, I have found the following settings convenient:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Highlight the current line with a background color (here "greenyellow")&lt;br /&gt;&lt;pre  style="color:red;"&gt;(require 'highlight-current-line)&lt;br /&gt;(highlight-current-line-on t)&lt;br /&gt;(highlight-current-line-set-bg-color "&lt;span style="font-style: italic;"&gt;greenyellow&lt;/span&gt;")&lt;br /&gt;&lt;/pre&gt;&lt;/li&gt;&lt;li&gt;Set transient mark mode on, so that selected text become more obvious&lt;br /&gt;&lt;pre style="color: red;"&gt;(setq transient-mark-mode t)&lt;/pre&gt;&lt;/li&gt;&lt;li&gt;Show line number and column number&lt;br /&gt;&lt;pre style="color: red;"&gt;(setq line-number-mode t)&lt;br /&gt;(setq column-number-mode t)&lt;/pre&gt;&lt;/li&gt;&lt;/ul&gt;There are many features in emacs that could be handy and I am always trying to learn more of it.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7853142166506564202-5860114787175279531?l=xiang-jun.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://xiang-jun.blogspot.com/feeds/5860114787175279531/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://xiang-jun.blogspot.com/2009/09/some-emacs-tricks.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5860114787175279531'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7853142166506564202/posts/default/5860114787175279531'/><link rel='alternate' type='text/html' href='http://xiang-jun.blogspot.com/2009/09/some-emacs-tricks.html' title='Some emacs tricks'/><author><name>Xiang-Jun Lu (律祥俊)</name><uri>http://www.blogger.com/profile/16532205401155665754</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='24' src='http://3.bp.blogspot.com/_CEuD25cqpCE/Ssp2Z-yzJVI/AAAAAAAAABw/us1htwutiYg/S220/Xiangjun09.jpg'/></author><thr:total>0</thr:total></entry></feed>
