AVATAR Page
The USF's Computer Science and
Computer Engineering Department
in collaboration with Sandia National
Laboratories
is working to develop AVATAR - the
Adaptive Visualization Aid
for Touring and Recovery.
AVATAR Page at Sandia - includes overview of the AVATAR project, a personnel list, and a reports archive. (Only for AVATAR project members.)
Our work will involve the ASCI Red Machine at Sandia National Laboratories . Developed in collaboration with Intel , it is the world's first teraflop supercomputer. Information on the Accelerated Strategic Computing Initiative (ASCI) Project can be found at the ASCI Home Page .
| Dr. Kevin Bowyer | Research PI | Email: kwb@cse.nd.edu |
| Dr. Larry Hall | Research PI | Email: hall@cse.usf.edu |
| Larry Shoemaker | Email: lwshoema@cse.usf.edu |
Graduated Members of the Research Team
Dr. Robert Banfield.
Graduated in May 2007. Currently with WeoGeo.
Email: rbanfiel@cse.usf.edu
Remy Losaria. Graduated in May 2006.
Email: lrosaria@cse.usf.edu
Divya Bhadoria. Graduated in August 2004.
Email: dbhadori@cse.usf.edu
Aneesh Karve. Graduated in May 2003.
Email: karve@cse.usf.edu
Dr. Steven Eschrich.
Graduated in May 2003. Currently, Research Scientist with H. Lee Moffitt Cancer
Center & Research Institute.
Email: eschrich@cse.usf.edu
Dr. Nitesh V. Chawla. Graduated in
December 2002. Currently a research professor with Notre Dame.
Email: nchawla@cse.nd.edu
Thomas Moore. Graduated in 2002.
Email: tmoore4@cse.usf.edu
Cathy Smith. Graduated in 1998.
Email: csmith19@tampabay.rr.com
"Canister Tear Simulations" (QuickTime Movie format) presented at the Symposium on High-Performance Computing at the University of South Florida, January 20, 2009.
The contributing authors include the documents contained in these directories as a means to ensure timely dissemination of scholarly and technical work on a non-commercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.
John N. Korecki, Robert E. Banfield, Lawrence O. Hall, Kevin W. Bowyer and W. Philip Kegelmeyer, "Semi-supervised learning on large complex simulations", in Proceedings of the Nineteenth Conference of the International Association for Pattern Recognition, Tampa, Florida, USA, December 2008.
Larry Shoemaker, Robert E. Banfield, Lawrence O. Hall, Kevin W. Bowyer and W. Philip Kegelmeyer, "Detecting and ordering salient regions for efficient browsing", in Proceedings of the Nineteenth Conference of the International Association for Pattern Recognition, Tampa, Florida, USA, December 2008.
Larry Shoemaker, Robert E. Banfield, Lawrence O. Hall, Kevin W. Bowyer and W. Philip Kegelmeyer, "Using classifier ensembles to label spatially disjoint data", Information Fusion Journal, Special Issue on Applications of Ensemble Methods, 9:1, pp. 120-133, January 2008.
Larry Shoemaker, Robert E. Banfield, Lawrence O. Hall, Kevin W. Bowyer, W. Philip Kegelmeyer, "Learning to Predict Salient Regions from Disjoint and Skewed Training Sets", in Proceedings of the 18th IEEE Conference on Tools with Artificial Intelligence (ICTAI 2006), Arlington, Virginia, USA, pp. 116-123, 2006.
Robert E. Banfield, Lawrence O. Hall, Kevin W. Bowyer, W. Philip Kegelmeyer, "A Comparison of Decision Tree Ensemble Creation Techniques", IEEE Transactions on Pattern Analysis and Machine Intelligence, v. 29, no. 1, pp 173-180, January 2007.Appendix. Note: The OpenDT software designed by Robert E. Banfield and used in this paper is available at http://sourceforge.net/projects/opendt. A copy of the datasets in OpenDT format is included in OpenDT, and is also available here.
"Ensembles of Classifiers from Spatially Disjoint Data", Robert E. Banfield, Lawrence O. Hall, Kevin W. Bowyer, W. Philip Kegelmeyer, The Sixth International Conference on Multiple Classifier Systems, Monterey, CA, pp. 196-205, June 2005.
Lawrence O. Hall, Divya Bhadoria, and Kevin W. Bowyer. "Learning a model from spatially disjoint
data," to appear in 2004 IEEE International Conference on Systems, Man and
Cybernetics.
Xiaomei Liu, Kevin W. Bowyer, and Lawrence O. Hall. "Decision Trees Work Better Than
Feed-Forward Back-Propagation Neural Nets for A Specific Class of Problems,"
to appear in 2004 IEEE International Conference on Systems, Man and Cybernetics.
"Ensemble
Diversity Measures and their Application to Thinning", Robert
E. Banfield, Lawrence O. Hall, Kevin W. Bowyer, W. Philip Kegelmeyer, Information
Fusion Journal, 6:1, pp49-62, March 2005. Xiaomei Liu, Lawrence O. Hall, and Kevin W. Bowyer. "Comments on A Parallel Mixture of SVMs for Very Large
Scale Problems," Neural Computation 16 (7), July 2004, 1345-1351.
"A
Comparison of Ensemble Creation Techniques", Robert E. Banfield,
Lawrence O. Hall, Kevin W. Bowyer, Divya Bhadoria, W. Philip Kegelmeyer and
Steven Eschrich, The Fifth International Conference on Multiple Classifier Systems,
Cagliari, Italy, June, 2004. [Slides] "Learning
ensembles from bites: A scalable and accurate approach", Nitesh V.
Chawla, "SMOTEBoost: Improving Prediction of the Minority Class in
Boosting" N.V. Chawla, A. Lazarevic, L.O. Hall, and K.W. Bowyer, 7th European Conference on Principles and Practice of
Knowledge Discovery in Databases (PKDD), pp. 107 to 119, Dubrovnik, Croatia,2003. "Comparing
Pure Parallel Ensemble Creation Techniques Against Bagging" "Why are
Neural Networks Sometimes Much More
Accurate than Decision Trees: An Analysis on a
Bio-Informatics Problem" Lawrence
O. Hall, Xiaomei Liu, Kevin W. Bowyer, Robert Banfield,
IEEE International Conference on Systems, Man & Cybernetics, Washington,
D.C., pp. 2851-2856, October 5–8, 2003 "A New Ensemble Diversity Measure Applied to Thinning
Ensembles" Robert E. Banfield, Lawrence O. Hall, Kevin W. Bowyer, W. Philip
Kegelmeyer, International Workshop on Multiple Classifier Systems, pp. 306 - 316, Surrey, UK,
June, 2003. "Is Error-Based Pruning Redeemable?" Lawrence
O. Hall, Kevin W. Bowyer, Robert E. Banfield, Steven Eschrich, Richard Collins, International Journal
on Artificial Intelligence Tools: Architectures, Languages, Algorithms, V. 12, No. 3, pp. 249-264, 2003. "An Analysis of Neural Network Versus Decision Tree
Performance on a Bio-Informatics Problem" Lawrence O. Hall, Xiaomei Liu,
Kevin W. Bowyer, Robert Banfield, Workshop on Information Technology, Rabat,
Morocco - March 17 - 19, 2003 "Error Based
Pruning of Decision Trees grown on very datasets can work!" Lawrence O.
Hall, Kevin W. Bowyer, Richard Collins, Robert Banfield, International
Conference on Tools for Artificial Intelligence, pp. 233-238, November 2002.
"Learning on
extremes -- size and imbalance -- of data." Nitesh V. Chawla,
Dissertation Defense Slides, August 2, 2002, Department of Computer Science and
Engineering, University of South Florida. "Generalization Methods in Bioinformatics", Steven Eschrich, Nitesh V.
Chawla, Lawrence O. Hall,
BIOKDD 2002 (Workshop on Bioinformatics, 8th ACM SIGKDD Conference). "SMOTE: Synthetic Minority Over-sampling technique", Nitesh Chawla,
Kevin Bowyer, Lawrence Hall, Philip Kegelmeyer,
Journal of Artificial Intelligence Research , Volume 16, 321 -- 357,
2002. (This paper is available online from the JAIR site.) "Distributed Pasting of Small Votes" Nitesh V. Chawla, Lawrence O. Hall,
Kevin W. Bowyer, Thomas E. Moore, W. Philip Kegelmeyer.
International Workshop on Multiple
Classifier Systems, 2002 . "Distributed learning with
bagging-like performance", Nitesh
V. Chawla, Thomas E. Moore, Lawrence O. Hall, Kevin W. Bowyer, W. Philip
Kegelmeyer, Clayton Springer. Published in Pattern Recognition Letters, Vol. 24
(1-3) (2003) pp. 455-471 "Creating Ensembles of Classifiers" Nitesh Chawla, Steven Eschrich,
Lawrence Hall, IEEE
International Conference on Data Mining (ICDM), November 2001. For the extended version of the ICDM paper "Creating Ensembles of
Classifiers", Go to
Technical Report: Creating Ensembles of Classifiers. "Bagging is a Small-Data-Set phenomenon" Nitesh Chawla, Thomas Moore,
Kevin Bowyer, Lawrence Hall, Clayton Springer, Philip Kegelmeyer.
International Conference on
Computer Vision and
Pattern Recognition (CVPR), 2001 "Investigation of bagging-like effects and decision trees versus neural nets in
protein secondary structure prediction", Nitesh Chawla, Thomas Moore,
Kevin Bowyer, Lawrence Hall, Clayton Springer, Philip Kegelmeyer. Workshop on
Data Mining in Bioinformatics, Knowledge
Discovery and Data Mining (KDD), 2001 "SMOTE: Synthetic Minority Over-sampling
technique", Nitesh Chawla, Kevin Bowyer, Lawrence
Hall, Philip Kegelmeyer, International Conference on
Knowledge Based Computer
Systems , 2000. Distributed Learning on Very Large Data Sets, L.O. Hall, K.W. Bowyer,
W.P. Kegelmeyer, T.E. Moore and C. Chao, Workshop on Distributed and Parallel
Knowledge Discovery, International Conference
Knowledge Discovery and Data Mining
(KDD), 2000. "A Parallel
Decision Tree Builder for Mining Very Large Visualization DataSets",
Bowyer, Hall, Chawla, Moore, Kegelmeyer. IEEE International Conference on
Systems, Man, and
Cybernetics, 2000. "AVATAR -- Adaptive Visualization Aid for Touring and Recovery,"
Lawrence O. Hall, Kevein W. Bowyer, Nitesh V. Chawla, Thomas E. Moore, W. Philip
Kegelmeyer, Sandia - Report, SAND2000-8203, Sandia National Labs , 2000.
"RiDE : Rule
Learning in Distributed Environment ", Nitesh Chawla, Master's
Thesis, 1999. "Learning rules from
distributed data," Lawrence O. Hall, Nitesh Chawla, Kevin W. Bowyer, W.
Philip Kegelmeyer, Large-Scale Parallel KDD Systems, International Conference
of Knowledge Discovery and Data Mining
. Lecture Notes in Computer Science,Springer Verlag, 1999. "Modifying MUSTAFA
to capture salient data" Nitesh Chawla, Lawrence O, Hall. Internal Tech
Report, ISL-99-01, Dept. Of Computer Science
and Engineerin g, University of South Florida, 1999. "Combining Decision
Trees Learned in Parallel ", Lawrence O. Hall, Nitesh Chawla and Kevin W.
Bowyer. Distributed Data Mining Workshop at International Conference of
Knowledge Discovery and Data Mining
, 1998. "Decision Tree Learning
on Very Large Data Sets," Lawrence O. Hall, Nitesh Chawla and Kevin W.
Bowyer. IEEE International Conference on
Systems, Man and
Cybernetics , 1998. For CVS checkouts, click here. For documentation on the checkout, click here.
1.
All Robert's experiments with probabilistic majority vote Intel Technology Journal, 1st quarter, 1998
- Contains several articles on the Intel TFLOPS supercomputer (the ASCI Red) The Intel
ASCI Red Supercomputer - a nice overview of the ASCI Red Machine. ASCI PSE Platforms -
Provides links to all three of the ASCI projects. ASCI Red Users' Guide
- The title speaks for itself. ASCI Red at Sandia - By
Pat Fay of Intel. WebLUIS - USF Libraries Send comments or suggestions to
lwshoema@cse.usf.edu Last modified January 29, 2009
"Extreme Data
Mining: What to do with a flood of training data? " Nitesh Chawla, Lawrence
Hall, Kevin Bowyer, Thomas Moore, Clayton Springer, Philip Kegelmeyer,
Invited Talk, Conference of
Institute of Operations Research and Management Sciences, (INFORMS) , 2001 .
(html format)
2.
Summary of Robert's experiments
3.
Summary of Larry's experiments
Information on ASCI Red
Research Sources
Elsevier Science: Site
Search
IBM Technical
Journals
STSC Home Page
Metacrawler Search Engine
USF's Intelligent Systems Laboratory
Associtaion for Uncertainity in Artificial
Intelligence
KDNuggets
UCI Machine
Learning Databases
Quest Data Mining Project
JAIR
Leo Breiman
UCI Machine Learning Group
CLIPS
Online
Machine Learning Resources
UMBC Agent Web
Machine Learning Bibliographies
SPRINT
This page maintained by Larry Shoemaker