AVATAR Page

 


The USF's Computer Science and
Computer Engineering Department
in collaboration with Sandia National Laboratories
is working to develop AVATAR - the
Adaptive Visualization Aid
for Touring and Recovery.


 

[Contacts]  [Publications]  [Info on ASCI Red]  [Research Sources]  [Codes]


 AVATAR Page at Sandia   - includes overview of the AVATAR project, a personnel list, and a reports archive. (Only for AVATAR project members.)

Our work will involve the ASCI Red Machine at Sandia National Laboratories .  Developed in collaboration with Intel , it is the world's first teraflop supercomputer. Information on the Accelerated Strategic Computing Initiative (ASCI) Project can be found at the ASCI Home Page .


Contacts - USF Personnel Involved in the AVATAR Project

Dr. Kevin Bowyer Research PI Email: kwb@cse.nd.edu
Dr. Larry Hall Research PI
Email: hall@cse.usf.edu
Larry Shoemaker
Email: lwshoema@cse.usf.edu

Graduated Members of the Research Team

Dr. Robert Banfield. Graduated in May 2007. Currently with WeoGeo.
Email:  rbanfiel@cse.usf.edu

Remy Losaria. Graduated in May 2006.
Email: lrosaria@cse.usf.edu

Divya Bhadoria. Graduated in August 2004.
Email: dbhadori@cse.usf.edu

Aneesh Karve. Graduated in May 2003.
Email: karve@cse.usf.edu

Dr. Steven Eschrich. Graduated in May 2003. Currently, Research Scientist with H. Lee Moffitt Cancer Center & Research Institute.
Email: eschrich@cse.usf.edu

Dr. Nitesh V. Chawla. Graduated in December 2002. Currently a research professor with Notre Dame.
Email: nchawla@cse.nd.edu

Thomas Moore. Graduated in 2002.
Email: tmoore4@cse.usf.edu

Cathy Smith. Graduated in 1998.
Email: csmith19@tampabay.rr.com


Videos

"Canister Tear Simulations"  (QuickTime Movie format) presented at the Symposium on High-Performance Computing at the University of South Florida, January 20, 2009.


Publications - Papers resulting from this research (in a reverse chronological order)

The contributing authors include the documents contained in these directories as a means to ensure timely dissemination of scholarly and technical work on a non-commercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

John N. Korecki, Robert E. Banfield, Lawrence O. Hall, Kevin W. Bowyer and W. Philip Kegelmeyer, "Semi-supervised learning on large complex simulations", in Proceedings of the Nineteenth Conference of the International Association for Pattern Recognition, Tampa, Florida, USA, December 2008.

Larry Shoemaker, Robert E. Banfield, Lawrence O. Hall, Kevin W. Bowyer and W. Philip Kegelmeyer, "Detecting and ordering salient regions for efficient browsing", in Proceedings of the Nineteenth Conference of the International Association for Pattern Recognition, Tampa, Florida, USA, December 2008.

Larry Shoemaker, Robert E. Banfield, Lawrence O. Hall, Kevin W. Bowyer and W. Philip Kegelmeyer, "Using classifier ensembles to label spatially disjoint data", Information Fusion Journal, Special Issue on Applications of Ensemble Methods, 9:1, pp. 120-133, January 2008.

Larry Shoemaker, Robert E. Banfield, Lawrence O. Hall, Kevin W. Bowyer, W. Philip Kegelmeyer, "Learning to Predict Salient Regions from Disjoint and Skewed Training Sets", in Proceedings of the 18th IEEE Conference on Tools with Artificial Intelligence (ICTAI 2006), Arlington, Virginia, USA, pp. 116-123, 2006.

Robert E. Banfield, Lawrence O. Hall, Kevin W. Bowyer, W. Philip Kegelmeyer, "A Comparison of Decision Tree Ensemble Creation Techniques", IEEE Transactions on Pattern Analysis and Machine Intelligence, v. 29, no. 1, pp 173-180, January 2007.Appendix.  Note: The OpenDT software designed by Robert E. Banfield and used in this paper is available at http://sourceforge.net/projects/opendt. A copy of the datasets in OpenDT format is included in OpenDT, and  is also available here.

"Ensembles of Classifiers from Spatially Disjoint Data", Robert E. Banfield, Lawrence O. Hall, Kevin W. Bowyer, W. Philip Kegelmeyer, The Sixth International Conference on Multiple Classifier Systems, Monterey, CA, pp. 196-205, June 2005.

Lawrence O. Hall, Divya Bhadoria, and Kevin W. Bowyer. "Learning a model from spatially disjoint data," to appear in 2004 IEEE International Conference on Systems, Man and Cybernetics.

Xiaomei Liu, Kevin W. Bowyer, and Lawrence O. Hall. "Decision Trees Work Better Than Feed-Forward Back-Propagation Neural Nets for A Specific Class of Problems," to appear in 2004 IEEE International Conference on Systems, Man and Cybernetics.

"Ensemble Diversity Measures and their Application to Thinning", Robert E. Banfield, Lawrence O. Hall, Kevin W. Bowyer, W. Philip Kegelmeyer, Information Fusion Journal, 6:1, pp49-62, March 2005.

Xiaomei Liu, Lawrence O. Hall, and Kevin W. Bowyer. "Comments on A Parallel Mixture of SVMs for Very Large Scale Problems," Neural Computation 16 (7), July 2004, 1345-1351.

"A Comparison of Ensemble Creation Techniques", Robert E. Banfield, Lawrence O. Hall, Kevin W. Bowyer, Divya Bhadoria, W. Philip Kegelmeyer and Steven Eschrich, The Fifth International Conference on Multiple Classifier Systems, Cagliari, Italy, June, 2004. [Slides]

"Learning ensembles from bites: A scalable and accurate approach", Nitesh V. Chawla, Lawrence O. Hall, KevinW. Bowyer, W. Philip Kegelmeyer, Journal of Machine Learning Research, 2003

"SMOTEBoost: Improving Prediction of the Minority Class in Boosting" N.V. Chawla, A. Lazarevic, L.O. Hall, and K.W. Bowyer, 7th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD), pp. 107 to 119, Dubrovnik, Croatia,2003.

"Comparing Pure Parallel Ensemble Creation Techniques Against Bagging" Lawrence O. Hall, KevinW. Bowyer, Robert E. Banfield, Divya Bhadoria, W. Philip Kegelmeyer and Steven Eschrich, The Third IEEE International Conference on Data Mining, Melbourne, Florida, pp. 533-536, November, 2003.

"Why are Neural Networks Sometimes Much More Accurate than Decision Trees: An Analysis on a Bio-Informatics Problem" Lawrence O. Hall, Xiaomei Liu, Kevin W. Bowyer, Robert Banfield,  IEEE International Conference on Systems, Man & Cybernetics, Washington, D.C., pp. 2851-2856, October 5–8, 2003

"A New Ensemble Diversity Measure Applied to Thinning Ensembles" Robert E. Banfield, Lawrence O. Hall, Kevin W. Bowyer, W. Philip Kegelmeyer, International Workshop on Multiple Classifier Systems, pp. 306 - 316, Surrey, UK, June, 2003.

"Is Error-Based Pruning Redeemable?" Lawrence O. Hall, Kevin W. Bowyer, Robert E. Banfield, Steven Eschrich, Richard Collins, International Journal on Artificial Intelligence Tools: Architectures, Languages, Algorithms, V. 12, No. 3, pp. 249-264, 2003.

"An Analysis of Neural Network Versus Decision Tree Performance on a Bio-Informatics Problem" Lawrence O. Hall, Xiaomei Liu, Kevin W. Bowyer, Robert Banfield, Workshop on Information Technology, Rabat, Morocco - March 17 - 19, 2003

"Error Based Pruning of Decision Trees grown on very datasets can work!" Lawrence O. Hall, Kevin W. Bowyer, Richard Collins, Robert Banfield, International Conference on Tools for Artificial Intelligence, pp. 233-238, November 2002.

"Learning on extremes -- size and imbalance -- of data." Nitesh V. Chawla, Dissertation Defense Slides, August 2, 2002, Department of Computer Science and Engineering, University of South Florida.

"Generalization Methods in Bioinformatics", Steven Eschrich, Nitesh V. Chawla, Lawrence O. Hall, BIOKDD 2002 (Workshop on Bioinformatics, 8th ACM SIGKDD Conference).

"SMOTE: Synthetic Minority Over-sampling technique", Nitesh Chawla, Kevin Bowyer, Lawrence Hall, Philip Kegelmeyer, Journal of Artificial Intelligence Research , Volume 16, 321 -- 357, 2002. (This paper is available online from the JAIR site.)

"Distributed Pasting of Small Votes" Nitesh V. Chawla, Lawrence O. Hall, Kevin W. Bowyer, Thomas E. Moore, W. Philip Kegelmeyer. International Workshop on Multiple Classifier Systems, 2002 .

"Distributed learning with bagging-like performance", Nitesh V. Chawla, Thomas E. Moore, Lawrence O. Hall, Kevin W. Bowyer, W. Philip Kegelmeyer, Clayton Springer. Published in Pattern Recognition Letters, Vol. 24 (1-3) (2003) pp. 455-471

"Extreme Data Mining: What to do with a flood of training data? " Nitesh  Chawla, Lawrence Hall, Kevin Bowyer, Thomas Moore, Clayton Springer, Philip Kegelmeyer, Invited Talk, Conference of Institute of Operations Research and Management Sciences, (INFORMS) , 2001 .

"Creating Ensembles of Classifiers" Nitesh Chawla, Steven Eschrich, Lawrence Hall, IEEE International Conference on Data Mining (ICDM), November 2001.

For the extended version of the ICDM paper "Creating Ensembles of Classifiers", Go to Technical Report: Creating Ensembles of Classifiers.

"Bagging is a Small-Data-Set phenomenon" Nitesh Chawla, Thomas  Moore, Kevin Bowyer, Lawrence Hall, Clayton Springer, Philip Kegelmeyer. International Conference on Computer Vision and Pattern Recognition (CVPR), 2001

"Investigation of bagging-like effects and decision trees versus neural nets in protein secondary structure prediction", Nitesh  Chawla, Thomas Moore, Kevin Bowyer, Lawrence Hall, Clayton Springer, Philip Kegelmeyer. Workshop on Data Mining in Bioinformatics, Knowledge Discovery and Data Mining (KDD), 2001

"SMOTE: Synthetic Minority Over-sampling technique", Nitesh Chawla, Kevin Bowyer, Lawrence Hall, Philip Kegelmeyer, International Conference on Knowledge Based Computer Systems , 2000. 

Distributed Learning on Very Large Data Sets, L.O. Hall, K.W. Bowyer, W.P. Kegelmeyer, T.E. Moore and C. Chao, Workshop on Distributed and Parallel Knowledge Discovery, International Conference Knowledge Discovery and Data Mining (KDD), 2000.

"A Parallel Decision Tree Builder for Mining Very Large Visualization DataSets", Bowyer, Hall, Chawla, Moore, Kegelmeyer. IEEE International Conference on Systems, Man, and Cybernetics, 2000.

"AVATAR -- Adaptive Visualization Aid for Touring and Recovery," Lawrence O. Hall, Kevein W. Bowyer, Nitesh V. Chawla, Thomas E. Moore, W. Philip Kegelmeyer, Sandia - Report, SAND2000-8203, Sandia National Labs , 2000.

"RiDE : Rule Learning in Distributed Environment ", Nitesh Chawla, Master's Thesis, 1999.

"Learning rules from distributed data," Lawrence O. Hall, Nitesh Chawla, Kevin W. Bowyer, W. Philip Kegelmeyer, Large-Scale Parallel KDD Systems, International Conference of Knowledge Discovery and Data Mining . Lecture Notes in Computer Science,Springer Verlag, 1999.
 

 "Modifying MUSTAFA to capture salient data"   Nitesh Chawla, Lawrence O, Hall. Internal Tech Report, ISL-99-01, Dept. Of Computer Science and Engineerin g, University of South Florida, 1999.
(html format)

"Combining Decision Trees Learned in Parallel ", Lawrence O. Hall, Nitesh Chawla and Kevin W. Bowyer. Distributed Data Mining Workshop at International Conference of Knowledge Discovery and Data Mining , 1998.

"Decision Tree Learning on Very Large Data Sets," Lawrence O. Hall, Nitesh Chawla and Kevin W. Bowyer. IEEE International Conference on Systems, Man and Cybernetics , 1998.
 


Codes

For CVS checkouts, click here.

For documentation on the checkout, click here.


Spatially Disjoint Data

1. All Robert's experiments with probabilistic majority vote
2. Summary of Robert's experiments
3. Summary of Larry's experiments


Information on ASCI Red

 Intel Technology Journal, 1st quarter, 1998 - Contains several articles on the Intel TFLOPS supercomputer (the ASCI Red)

 The Intel ASCI Red Supercomputer    -  a nice overview of the ASCI Red Machine.

ASCI PSE Platforms - Provides links to all three of the ASCI projects.

ASCI Red Users' Guide - The title speaks for itself.

ASCI Red at Sandia - By Pat Fay of Intel.
 


Research Sources

WebLUIS - USF Libraries
Elsevier Science: Site Search
IBM Technical Journals
STSC Home Page
Metacrawler Search Engine
USF's Intelligent Systems Laboratory
Associtaion for Uncertainity in Artificial Intelligence
KDNuggets
UCI Machine Learning Databases
Quest Data Mining Project
JAIR
Leo Breiman
UCI Machine Learning Group
CLIPS
Online Machine Learning Resources
UMBC Agent Web
Machine Learning Bibliographies
SPRINT

 


This page maintained by Larry Shoemaker

Send comments or suggestions to lwshoema@cse.usf.edu

Last modified January 29, 2009