public class AtomData extends Object
JavaRDD and Dataset of atoms.| Constructor and Description |
|---|
AtomData(org.apache.spark.sql.Dataset<org.biojava.nbio.structure.Atom> atomDataset)
Construct from a
Dataset |
AtomData(org.apache.spark.api.java.JavaRDD<org.biojava.nbio.structure.Atom> atomRdd)
Construct from an
JavaRDD |
| Modifier and Type | Method and Description |
|---|---|
void |
cacheData()
Cache the data - for multi-processing.
|
Map<String,Long> |
countByAtomName()
Count the number of times each atom name appears.
|
Map<String,Long> |
countByElement()
Count the number of times each element appears.
|
Map<String,Long> |
countByGroupAtomName()
Get the unique group atom name combinations in this.
|
org.apache.spark.api.java.JavaRDD<org.biojava.nbio.structure.Atom> |
getRdd()
Get the underlying
JavaRDD for this AtomDataRDD. |
public AtomData(org.apache.spark.api.java.JavaRDD<org.biojava.nbio.structure.Atom> atomRdd)
JavaRDDatomRdd - the input JavaRDDpublic AtomData(org.apache.spark.sql.Dataset<org.biojava.nbio.structure.Atom> atomDataset)
DatasetatomDataset - the input Datasetpublic org.apache.spark.api.java.JavaRDD<org.biojava.nbio.structure.Atom> getRdd()
JavaRDD for this AtomDataRDD.JavaRDD for this AtomDataRDDpublic void cacheData()
public Map<String,Long> countByElement()
public Map<String,Long> countByAtomName()
Copyright © 2016 Biojava. All Rights Reserved.