A Visual Analytics Approach Applying for Discovering Knowledge from Multivariate Datasets of Stakeholders Feedback in the University
Abstract
In accrediting training quality, assessing the educational quality of a university is an extremely important task to affirm the prestige of the university to the society. Areas of assessment include training program, teaching and learning activities, capacity of graduate student, and the reputation of the university in society. This research studies the characteristics of the survey dataset structures to form data variables and analyzes the relations between data variables to build non-spatial and temporal multidimensional model (nSTM), applies the establishment of visual graphs according to the data variables representing the datasets with vertical bars indicating the values of the data variables being visually represented. This research conducts experiments on the dataset structures surveying the areas of a university. With this scientific approach, it will help university administrators to build policies to improve in all aspects and especially improve the quality of university education.
1. Introduction
Educational management at universities is increasingly oriented towards being both modern and adaptive to the practical needs of society. Many universities have been conducting verification at many levels with different scales, domestically and internationally. Especially, the quality verification activities help the university leaders to be aware of the real values that the university owns and is more aware of the university’s position in the face of modern requirements of international integration and practical requirements of society as a basis for forming necessary policies to raise the university’s level in competitive environments.
Collecting opinions from the society about the university’s training products is one of the activities of the quality verification process. With the opinion of employers, they evaluate the university’s created products as generations of university graduates who have contributed to the society. With the opinion of the university graduates, they will evaluate the adaptability of the life baggage provided by the university’s faculty during their time at the university. With the instructor’s opinions, they will assess the opinions of their satisfying ability towards teaching job and the appropriateness of university’s policies. In reality, there are many cases that university leaders have not fully analyzed and applied the meaning of data to create appropriate policies.
Approaching data visualization and visual analytics can make it easy for university administrators to understand the mean of the data which they have with their experiences, management knowledges, and own requirements. Approaching data visualization is a mapping that transforms data into information and knowledge through visual perception by the human vision organs. In other words, data visualization transforms data from different forms and structures into images so that people can perceive information and knowledge by the human vision organs and by the viewing–thinking method. Regarding the visual analytics approach, this method uses an interactive visual interface to assist users in extracting information and discovering knowledge from multivariate datasets.1 It can be said that data analysis cannot be performed without the question being asked. The question is does the initial step and the basis for the impulsion of data analysis process need to be conducted?2,3 Therefore, data analysis is the process of perception by the human vision organs that serves as the key to unlocking the deep meaning of things and phenomena by asking people questions and from there through the model of visual viewing–thinking will upgrade people’s knowledge so that they can answer the questions posed by themselves. With this scientific approach, it will help humans, especially the leadership levels of a university, have an insight into the survey datasets of stakeholders, from which to make the right decisions policy making and decision making.
This objective of this paper is to study the characteristics of the survey datasets structures to form data variables and analyze the relations between data variables to build non-spatial and temporal multidimensional model (nSTM), apply the set of visual graphs according to the data variables representing the datasets with vertical bars indicating the values of the data variables. Visualization techniques are applied to represent survey data about the teaching and learning activities of the university into visual graphs so that university administrators can perceive the meaning hidden within multivariate datasets. This paper focuses on building a process for visual analysis of the survey dataset and transforming the survey dataset into an evaluative dataset and the dataset structure as a set of data variables and relations of them. This paper classifies the types of questions (the elementary analysis questions, overall analysis questions, and relation analysis questions see details in Sec. 4.2.2) to set up data visual analysis questionnaires and designing visual graphs help users can view - think and answer analytical questions for university development management policy making.
The rest of this paper is organized as follows. Section 2 carries out overview of the related research work from which comments and suggestions are made. Section 3 focuses on researching the characteristics of the survey dataset structures to find out the data variables to build nSTM representing the visual graphs of multivariate data. Section 4 illustrates the experimental scenario on the areas and the evaluation criteria of a university’s survey dataset to help university leaderships in formulating policies to improve and enhance the quality of university training. Section 5 briefly presents conclusions and proposes directions for development in the future.
2. Related Works
Educational data mining, data analysis, exploratory data analysis, knowledge discovery in databases, data mining, data visualization, and educational data visual analytics are promising areas in the future. These areas have been of great interest to researchers, especially as humans make decisions increasingly based on feelings and influences on the mind to make policies to improve and enhance the training quality of universities and social organizations as a premise for the future. This paper makes an overview of some scientific achievements in data analysis by visual methods.
With one’s own cognitive stream, human realizes the perception of the objects, that cognitive stream is made into a hierarchy system from viewing — perceiving — cognizing — analyzing — understanding — remembering. This hierarchy system helps human perceive real-world objects that happen automatically and continuously over time and create a closely linked chain. The data visualization process is a process of information retrieval and knowledge discovery from data, Chen and associates propose that this visualization process consists of two main components, the computational spatial component and the perceiving spatial component and visual cognizing. In addition, this process has additional components including, the visualization process complementary component, the knowledge-based system component, and the component that is extended into knowledge support infrastructure engineering.4 Knowledge-based visualization workflows with visualization process that simulate visualization transform data through the stream of visualizations over time and transform that data into knowledge using ability of argument based on specific components already present. This approach alleviates the difficulty of recording human knowledge.
The model of knowledge generation in the area of visual analytics proposed by Sacha and associates takes the role of humans involved in the process as the center and the rest of the process is automatically performed by the computer. Humans perform activities through discovery, inspection, and knowledge generation processes, which express human–computer interactions.5 Daniel Keim and associates propose a combination of modeling and visualization methods with the main purpose of helping humans participate in the process of knowledge discovering from data to build a visual analytics system.6 With the modeling method, the data is automatically analyzed by the user requesting it, while with the visualization method, it is human-centered in the cognitive process. In other words, the visualization approach is based on the experience and available knowledge of each human in many areas of expertise to be able to extract information and discover knowledge from a multivariate dataset. The data visual analytics process consists of three main components proposed by Keim and associates7 based on the visualization model of Wijk.8 The component receives raw data sources which are analyzed using data analysis methods to create data structures from datasets of surveying. The component applies the visualization method to convert the classified data structure into visual images.
Regular evaluation of students about lecturer teaching is one of the important criteria in higher education to determine the effectiveness of lectures, courses, and majors. This assessment is still unsatisfactory because of low-response rate and quality. This research9 shows how to increase the response rate as well as the quality of student feedback. This research10 also addressed student evaluation of lecture teaching, with the results of this assessment being used for both courses improvement and quality assurance purposes. This research11 interprets the correctness of student rating data and uses this data appropriately to evaluate lecturers and courses. Now that, more and more universities are using student ranking data to guide staffing decisions, it is important that senior administrators and experts must know information about lecturer scientific research to use and interpret in assessing lecture teaching quality. In the research,12 the assessment of student about teaching is being widely used to measure the teaching quality of lecturers in higher education and applied it to courses, lecturers, departments, and organizations. However, the response to these surveys is low which makes this usage unreliable.
Modeling of student competency is one of the most challenging and popular research topics in educational data mining. The strength of the education dataset is its large availability, which further drives research, especially in online education.13 Educational data mining is to discover student-generated learning behaviors in courses and to predict student learning competency. Student learning models that have been studied and proposed by researchers14 include active interaction groups, stable learning groups, active teaching document groups, and negative learning groups. These groups also answered questionnaires on student-learning consciousness. With the results of this research, lecturers can use the mid-term forecasting system to identify groups that may be at high risks in the term exam and to overcome their academic tasks in the future.14 Over time, educational organizations have generated and stored huge amounts of educational data leading to major challenges in educational data mining.15,16
Through the analysis of the above researches, the data mining techniques, models and processes have been appreciably contributing to discover the hidden values in the educational data. Some authors have applied for the approaches of visualization processes, knowledge generation models, modeling of student competency, and regular evaluation of students to analyze the correlation between learning and teaching activities. Therefore, this research studies educational data variables to form nSTM of multivariable data that visually represents data about teaching and learning activities to extract information and discover knowledge to serve university leaders to develop policies to improve and enhance the quality of university education. In the visual analytics and mathematical modeling, the values of dependent variables depend on the values of independent variables. The dependent variables represent the output or outcome for the change being studied. The independent variables represent the input or cause, that is, the reason for the change. In addition, a symbol representing an arbitrary input is also called an independent variable while a symbol representing an arbitrary output is also called a dependent variable. Through the above analysis, this paper has transformed all survey datasets of a university through the academic years 2016–2017, 2017–2018, 2018–2019, 2019–2020, 2020–2021 ≡ 16–17, 17–18, 18–19, 19–20, 20–21 into visual graphs based on nSTM to apply data visual analytics, which will be discussed in detail in this paper in the following sections. In the process of this survey, this paper discovered that the dataset variables including the human variable, the criterion variable, the time variable, the survey variable, the area variable, and the evaluative variable are presented in following sections.
3. Survey Datasets of the University
3.1. Datasets
The survey dataset is the evaluation sheets of a university on the occasion of the 2021 quality accreditation. These evaluation sheets are prepared according to each criterion related to the training quality of the university. The evaluation criteria are developed from the areas to be surveyed, including: Training programs, teaching and learning activities, and the capacity of graduates (including newly graduated engineers, engineers with three years of experience), and the prestige of the university to society. The evaluation sheets are scored on a point ladder of 1, 2, 3, 4, and 5. In which, point 5 indicates the best rating for each criterion, and point 1 indicates the worst rating for each criterion. These survey sheets were conducted in a survey with the humans who are working in the university related to the quality of training and those who are working outside the university related to the quality of training, including faculty leaders, departmental leaders, faculty members and lecturers, graduate students, and businesses (specifically, employers from businesses, by partners related to the training program of the university).
3.2. Survey contents
One of the important tasks in the process of accrediting the educational quality of a university is to survey the evaluation of those who are related to the training of the university according to each criterion. The survey is carried out according to the areas, according to the parent criteria and according to the child criteria. These criteria will be grouped into the respective areas.
3.2.1. Training program
The area of training program d1 surveyed is the program of a university. Therefore, the proposed evaluation criteria are designed in accordance with the university training levels.
Training objectives and expected learning outcomesp1 :
– The training objectives and expected learning outcomes of the training program are consistent with the reality of the university c1.1: This criterion is applied to survey and evaluate the appropriateness of the training program with the actual conditions of the training university.
– The training objectives and output standards of the training program are consistent with the needs of society and businesses c1.2: This criterion is applied to survey and evaluate the appropriateness of training programs with social and business needs.
– The training objectives and output standards of the training program are consistent with the needs of the learners themselves c1.3: This criterion is applied to survey and evaluate the appropriateness of the training program for learners.
Features of the training program p2 :
– Updating of the training program c2.1: Updating of training program was surveyed and assessed the compatibility of training programs with social needs and corporate recruiters.
– The appropriateness of the training program c2.2: The rationality of the training program was surveyed and assessed the reasonableness of the time distribution between the number of theoretical and practical periods of the subjects.
– The flexibility of the training program c2.3: The flexibility of the training program was surveyed and assessed the flexibility of the training program structure, in which the modules of the program can be swapped in time reasonably.
– The responsiveness of the training program c2.4: The responsiveness of the training program was surveyed and assessed the responsiveness of the training program’s content to the announced training objectives and output standards.
3.2.2. Teaching activities
The area of teaching activities d2 surveyed is one of the core activities of a university. Therefore, the proposed evaluation criteria are designed in accordance with university teaching lecturer. The criteria surveyed in the content of teaching activities include the following:
Evaluation of teaching activities p3 :
– Lecturer qualifications c3.1: The criteria for the qualifications of the lecturers are used to survey and evaluate the lecturers’ deep understanding of the modules and/or courses in charge of teaching.
– Combination of theory and practice c3.2: The criteria for the combination of theory and practice are used to survey and evaluate the ability of teachers to combine theory and practice when teaching.
– The impartation ability of lecturers c3.3: The criteria of lecturer ability to communicate is used to survey and evaluate the ability of lecturers to convey the content of the lecture to students. The more knowledge students receive from the lecturer, the better the lecturer’s ability to communicate is assessed.
– Lecturer’s method of communication c3.4: The criteria for the lecturer’s method of communication is used to survey and evaluate the function of the lecturer, in which the teaching method of the lecturer is required by the content of the lecture and the existing knowledge of the students. The more appropriate the teacher’s method of communication is with the lecture and the existing knowledge of the students, the better the criteria for the method of communication will be evaluated.
– Study adviser c3.5: The criteria for academic counseling are used to survey the lecturer’s sense of responsibility for providing academic counseling to students, helping students achieve more effective, and improved learning results. Lecturers implement good counseling and regular counseling for students in learning, leading to more effective and improved student learning outcomes, so the criteria for academic counseling are highly appreciated.
3.2.3. Competence of new graduates
Competence areas of new graduates d3 are the results of the university training. The quality training of the university is assessed by the knowledge, skills, and attitudes of students when starting to integrate into the community of professionals and the community of people in the same agency.
Knowledge criteria p4 :
– The new graduates’ knowledge c4.1: The criteria for new graduates’ knowledge are used to survey the knowledge of professional theory and practical skills in the professional area of new graduates.
– The knowledge of students after three years graduation c4.2: The criteria for knowledge of students after three years of graduation are used to investigate the creative application of professional theory and practical skills to practical work at enterprises. This criterion is evaluated by comparing the knowledge and skills of students after three years of graduation, who must meet the requirements of proficient making and guiding others, this is a requirement for people working in specialized area.
The criteria of skills p5 :
– Foreign language skill c5.1: The criteria for foreign language skills are used to survey the ability of new graduates to use foreign languages in professional work and communication.
– Information technology skills c5.2: The criteria for Information Technology (IT) skill are used to survey the ability of applying IT in their work of new graduates. The more flexible students use IT in their work, the higher the criteria for IT skills are appreciated.
– Communication skills c5.3: The criteria for communication skills are the skills and distinct attitudes of each person in communicating with others. The criteria of communication skills are used to survey the ability and attitude of fresh graduates in communicating with colleagues and administrators in a practical working environment.
– Job search skills c5.4: The criteria for job search skills are used to survey the ability of new graduates to find jobs. Job search skills are related to the flexibility and acumen of new graduates in job search activities in order to find a job that is suitable for their expertise, good working environment, and good salary.
– Critical thinking skills c5.5: The criteria of critical thinking skills are used to survey the thinking ability of newly graduated students in problem analysis for commenting, remarking and evaluation leading to increase the operational efficiency of enterprises.
– Creative thinking skills c5.6: The criteria for creative thinking skills are used to survey the thinking ability of newly graduated students to discover new problems of students in actual activities of enterprises. Creative thinking involves individual skills and is enhanced on the basis of knowledge and skills learned in school.
– Economic thinking skills c5.7: The criteria of economic thinking skills are used to survey the interest of new graduates about the economic efficiency of enterprises in the jobs performed at enterprises.
– Job handling skills c5.8: The criteria for job handling skills are used to survey the capacity of new graduates in solving arising problems in order to bring benefits to the enterprises.
– Teamwork skills c5.9: The criteria for teamwork skills are used to survey the attitudes of new graduates in cooperation and sharing experiences with colleagues to complete the tasks of the enterprise with the highest possible efficiency.
Motivation, confidence and responsibility criteria p6 :
– Motivation c6.1: The motivation of each student plays a very important role throughout the learning process at the university. The motivation criteria are used to survey the spirit of hard work, eagerness to learn, and each student must have a sense of responsibility for the future career they have chosen to study.
– Confidence c6.2: The confidence is a personality of each person. The confidence criteria are used to survey the attitude of new graduates who fully meet relevant activities in the enterprise.
– The sense of responsibility c6.3: The sense of responsibility is also a personality of each person. The responsibility criteria are used to survey students’ responsibilities in learning, researching, self-study, participating in school, community, business, and social activities.
3.2.4. Prestige of the university to society
The area of prestige of the d4 university is related to the role and position of the university in society and is influenced by the objectives, mission, and vision of the university as announced. The higher the prestige of the university, the greater the competitive advantage of the degree awarded by the university. The higher the university’s prestige, the easier it is for employers to accept a university’s degree, meaning that its graduates are more likely to get a job.
The advantage of competition, employment, and cooperation with partners p7 :
– The advantage of competition of the degree c7.1: The university’s competitive advantage is related to the value on which it has built its teaching and scientific research activities as well as its contributions to society. The criteria for competitive advantage are used to survey the position of the university in society, for employers, for affiliated schools, etc.
– Employment waiting period c7.2: Employment waiting period is related to the time which is necessary for a new graduate to find a job, from graduation to full employment. This time depends on the prestige of the school and the flexibility and responsiveness of new graduates. The criteria for employment waiting period are used to survey the difficulty level of new graduates in applying for jobs. The shorter the student’s employment waiting period, the higher the university’s prestige and position with employers, partners, and society, contributing to the value of the school’s educational quality.
– Coordination between universities and partners c7.3: The criteria for coordination between universities and partners are used to survey the degree of relationship between the university and partners from many related areas, the level of this relationship has an impact on the job prospects of graduate students.
3.3. Survey dataset structures
3.3.1. Human variable
Human variable is individuals who work within the university and those outside the university who jointly influence the training program and the overall value of the university invited by the university for an assessment of the university’s performance. The humans of university are classified into human classes according to the nature of the relations between the human and university. The survey dataset is classified into five human classes, including lecturer object class, school administrative management object class, school educational administration object class, student and alumni object class of the university, and the university business partner object class. The mathematically encoded human classes are Hm|m=1,2,3,4,5.
H1: The lecturer object class of the training programme, including N1≡Nm members.
H2: The administrative management object class of university, including N2≡Nm members.
H3: The educational administration object class of university, including N3≡Nm members.
H4: The student and alumni object class of university, including N4≡Nm members.
H5: The business partner object class of university, including N5≡Nm members.
3.3.2. Criteria variable
The criteria variable is one of the various activity aspects of the university that is related to the training quality of a training program. In the above criteria, for evaluating, the teaching and learning activities of the university were presented. This number of criteria can be added if more detailed assessment is required. These criteria have been grouped into groups of criteria according to each area to evaluate the quality of the training program, and group of criteria according to a parent criterion that has many child criteria. Criteria variables are mathematically coded as a set of criteria or a set of areas C≡{ck.j|k=1,2,3,…;j=1,2,3,…}≡{Dk|k=1,2,3,…}, in which, each area Dk is a set of many criteria ck.j. In which, each parent criterion is a set of several child criteria cj.
3.3.3. Temporal variable
Temporal variable is a set of time units, notations T≡{ti|i=1,2,3,4,5,…}, in which ti is each time unit of an academic year. This survey is carried out 5-time units T={t1,t2,t3,t4,t5}={16−17,17−18,18−19,19−20,20−21}.
3.3.4. Survey variable
Survey variable is a set of values rnmm.x.k.j, each value given by a survey human hnm|nm=1,2,3,…,Nm;m=1,2,3,4,5 for parent criteria pk|k=1,2,3,… there are child criteria cj|j=1,2,3,… according to each area dx|x=1,2,3,… on the evaluation sheet is shown in the form ck.j|k=1,2,3,….; j=1,2,3,… Each value rnmm.x.k.j of the survey variable is assigned a value of the set of scores S≡{sδ|δ=1,2,3,4,5}≡{1,2,3,4,5} by human hnm. Here, score sδ=s5≡5 means best rating and score sδ=s1≡1 means worst rating.
3.3.5. Evaluative variable
Evaluative variable is the number of humans of human class Hm|m =1, 2, 3, 4, 5 for criteria ck.j|k=1,2,3,…; j=1,2,3,… same score sδ|δ =1, 2, 3, 4, 5 as qδm.x.k.j(ti). The below data in Table 1 illustrates that lecturer objects of human class H1 for criterion c1.3 have the same score as q51.1.1.3=25 of school year 19–20. The evaluative variable R is defined as a set of values evaluated by a feature class against a parent criterion and certain child criteria or a certain area. The Rm.x.k.j(ti) value of the R evaluator is calculated from the value rnmm.x.k.j(ti) or from the value qδm.x.k.j(ti) together with corresponding score sδ.
H1 | H2 | ||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
δ | 1 | 2 | 3 | 4 | 5 | R1.x.k.j | 1 | 2 | 3 | 4 | 5 | R2.x.k.j | |||||
x | dx | k | pk | j | cj | qδ1.x.k.j | M | qδ2.x.k.j | M | ||||||||
1 | d1 | 1 | p1 | 1 | c1.1 | 1 | 5 | 12 | 7 | 21 | 3.91 | 19 | 55 | 2 | 11 | 5 | 2.22 |
2 | c1.2 | 1 | 7 | 7 | 10 | 21 | 3.93 | 14 | 64 | 2 | 11 | 1 | 2.14 | ||||
3 | c1.3 | 1 | 3 | 10 | 7 | 25 | 4.13 | 26 | 55 | 2 | 6 | 3 | 1.97 | ||||
2 | p2 | 1 | c2.1 | 2 | 3 | 10 | 23 | 8 | 3.7 | 2 | 8 | 22 | 55 | 5 | 3.58 | ||
2 | c2.2 | 2 | 5 | 8 | 23 | 8 | 3.65 | 2 | 12 | 14 | 60 | 4 | 3.57 | ||||
3 | c2.3 | 2 | 13 | 8 | 11 | 12 | 3.39 | 2 | 6 | 25 | 56 | 3 | 3.57 | ||||
4 | c2.4 | 2 | 3 | 10 | 23 | 8 | 3.7 | 3 | 2 | 36 | 46 | 5 | 3.52 | ||||
2 | d2 | 3 | p3 | 1 | c3.1 | 3 | 5 | 8 | 20 | 10 | 3.63 | 2 | 5 | 10 | 55 | 20 | 3.93 |
2 | c3.2 | 2 | 11 | 10 | 15 | 8 | 3.35 | 2 | 4 | 11 | 63 | 12 | 3.86 | ||||
3 | c3.3 | 2 | 3 | 11 | 18 | 12 | 3.76 | 3 | 3 | 5 | 55 | 26 | 4.07 | ||||
4 | c3.4 | 2 | 3 | 10 | 23 | 8 | 3.7 | 3 | 5 | 2 | 46 | 36 | 4.16 | ||||
5 | c3.5 | 2 | 8 | 8 | 18 | 10 | 3.57 | 3 | 12 | 5 | 52 | 20 | 3.8 | ||||
… | … | … | … | … | … | … | … | … | … | … | … | … | … | … | … | … | … |
H3 | H4 | ||||||||||||||||
δ | 1 | 2 | 3 | 4 | 5 | R3.x.k.j | 1 | 2 | 3 | 4 | 5 | R4.x.k.j | |||||
x | dx | k | pk | j | cj | qδ3.x.k.j | M | qδ4.x.k.j | M | ||||||||
1 | d1 | 1 | p1 | 1 | c1.1 | 4 | 12 | 3 | 10 | 4 | 2.94 | 2 | 19 | 64 | 20 | 5 | 3.06 |
2 | c1.2 | 3 | 15 | 3 | 7 | 5 | 2.88 | 2 | 18 | 63 | 22 | 5 | 3.09 | ||||
3 | c1.3 | 2 | 8 | 4 | 15 | 4 | 3.33 | 2 | 29 | 55 | 18 | 6 | 2.97 | ||||
2 | p2 | 1 | c2.1 | 4 | 3 | 12 | 3 | 11 | 3.42 | 2 | 24 | 56 | 23 | 5 | 3.05 | ||
2 | c2.2 | 2 | 5 | 12 | 3 | 11 | 3.48 | 2 | 14 | 60 | 30 | 4 | 3.18 | ||||
3 | c2.3 | 4 | 7 | 10 | 3 | 9 | 3.18 | 2 | 20 | 56 | 29 | 3 | 3.1 | ||||
4 | c2.4 | 4 | 5 | 11 | 3 | 10 | 3.3 | 3 | 31 | 46 | 25 | 5 | 2.98 | ||||
2 | d2 | 3 | p3 | 1 | c3.1 | 1 | 5 | 13 | 8 | 6 | 3.39 | 2 | 5 | 20 | 55 | 28 | 3.93 |
2 | c3.2 | 2 | 6 | 9 | 7 | 9 | 3.45 | 2 | 4 | 12 | 63 | 29 | 4.03 | ||||
3 | c3.3 | 2 | 10 | 5 | 8 | 8 | 3.3 | 3 | 3 | 21 | 55 | 28 | 3.93 | ||||
4 | c3.4 | 1 | 5 | 5 | 12 | 10 | 3.76 | 3 | 5 | 36 | 46 | 20 | 3.68 | ||||
5 | c3.5 | 2 | 7 | 7 | 8 | 9 | 3.45 | 3 | 12 | 20 | 49 | 26 | 3.75 | ||||
… | … | … | … | … | … | … | … | … | … | … | … | … | … | … | … | … | … |
3.4. The relations between data variables
Mathematically, exploring the relations between variables helps us to know that the structural variables of datasets have independent and dependent relations.2,3,17,18 The independent variable varies independently itself without having to depend on the changes of other variables, the independent variable also is referenced by other variables. A dependent variable is a variable whose value depends on the value of one or more other variables. The dependent variable can also be referenced by one or more other variables. From here, the paper defines the evaluation variable R that depends on the human variable H, the score variable S, the criterion variable C, and the time variable T.
• |
H×S×C×T→R,(1) | ||||||||||||||||||||||
• |
{hm}×{sδ}×{ck.j}×{ti}→{Rm.k.j(ti)}|m=1,2,3,4,5;k=1,2,…;δ=1,2,3,4,5;j=1,2,…;i=1,2,…,(1.1a) | ||||||||||||||||||||||
• |
{h1m,p2m,…,hNmm}×{sδ}×{ck.j}×{ti}→{Rm.k.j(ti)}|m=1,2,3,4,5;k=1,2,…;δ=1,2,3,4,5;j=1,2,…;i=1,2,…,(1.1b) | ||||||||||||||||||||||
• |
{hnmm}×{sδ}×{ck.j}×{ti}→{rnmm.k.j(ti)|m=1,2,3,4,5;k=1,2,…;δ=1,2,3,4,5;j=1,2,…;i=1,2,…;nm=1,2,…,Nm}.(1.1c)
|
Arithmetic average values of the set of scores
For each training program-related area, the evaluation variable R depends on the human variable H, the score variable S, the area variable D, and the time variable T. In which, the area of evaluation dk for the program includes the training program, teaching and learning activities, the capacity of the new graduates, and the prestige of the university.
• |
H×S×D×T→R,(2) | ||||
• |
{hm×{sδ}×{dx}×{ti}→Rm.x(ti)}|m=1,2,3,4,5;δ=1,2,3,4,5;x=1,2,3,4,…;i=1,2,3,4,5,⋯.(2.1a) |
The evaluation value of each human of the human class Hm for the area dx at the time unit ti is specifically calculated as follows:
• |
(hnmm,sδ,dx,ti)→Rnmm.x(ti),(2.1b) | ||||||||||||||||
• |
Rm.x(ti)=1JxJx∑j∑5δ=1δ.qδm.k.j(ti)∑5δ=1qδm.k.j(ti)=1Jx.NmJx∑jNm∑nm=1rδ.nmm.x.j(ti).(2.1c) | ||||||||||||||||
• | Here
|
4. Analyzing Visually the Survey Datasets
4.1. Non-spatial and temporal multidimensional model
This paper focuses on analyzing survey datasets through the survey dataset analysis process to represent and to analyze survey dataset by visual methods. The nSTM was chosen to represent the survey datasets, and the value bar chart was chosen to represent the values of the data variables because of its visual characteristics, which has high efficiency.3,18,19,20 The method of viewing–thinking graphs representing visually the survey datasets is applied in seminars on accrediting the university’s education quality. There, participating members view visual graphs that visually represent the survey datasets to jointly answer pre-compiled data visual analysis questions or questions that arise while members viewing–thinking visual graphs.
The survey datasets are the set of point values {1,2,3,4,5} given by each human according to each criterion and each school year, each point value of the survey data variable refers to the human for the score, the assessment criteria, and the year of the university being assessed. The survey data variable is converted into an evaluative data variable for visual representation using arithmetic average operation. The evaluative variable depends on the human variable, the criterion variable or the area variable, and the time variable. Therefore, a visual graph is used to illustrate the viewing–thinking method to visually analyze the educational quality of the training program, applied in conferences on educational accreditation of the university is very important and urgent today. From the above analysis, this paper proposes nSTM that visually represents the survey datasets on the evaluative variable according to the following criteria (see Fig. 1).

Fig. 1. Models (nSTM) of value representation Rm.k.j(ti): Model (a) represents the value Rm.x.k.j(ti) each area of the parent criteria with the child criteria; Model (b) represents the value Rm.j(ti) by a human class; Model (c) represents the value Rm.k.j(ti) for a school year; Model (d) represents the value Rm.k.j(ti) for a criterion; Model (e) represents the value Rm.k.j(ti) for a parent criterion with child criteria.
Based on Table 1 above, the value of Rm.x.k.j(ti) is related to the human class evaluating the criteria, the score of the survey variable, the criterion, and the time unit, in which, the time unit is called the school years. (hm,sδ,ck.j,ti)→Rm.k.j(ti) (1), where the values Rm.k.j(ti) are represented on the model visually as shown in Fig. 1. Figure 1(a) shows the value of Rm.x.k.j(ti) rated each area by each human class for each school year; The Q plane parallel to the (R,T) plane moving along the H axis represents the change of views of the human classes when evaluating each area for the criteria for many years of study. Figure 1(b) shows the Rm.j(ti) value of a pre-specified school year evaluated by each feature human class Hm|m=1,2,3,4,5 for each criterion cj|j=1,2,3,…; the Q plane parallel to the (R,H) plane moving along the C axis shows different views of different human classes when evaluating criteria cj|j=1,2,3,…. Figure 1(c) shows the value of Rm.j(ti) evaluated by a human class for each criterion cj|j=1,2,3,… of a school year; the Q plane parallel to the (R,T) plane moving along the C axis represents the change of perspective of a human class when evaluating the cj|j=1,2,3,… for many years of study or the change in education quality for each criterion over many years of study. Figure 1(d) shows the value Rm.k.j(ti) that evaluates a parent criterion with child criteria ck.j|k=1,2,3,…;j=1,2,3,… by each human class for each school year; the Q plane parallel to the (R,T) plane moving along the H axis represents the change of opinion of a human class when evaluating a parent criterion with child criteria ck.j|k=1,2,3,…;j=1,2,3,… for many years of study. Figure 1(e) shows the value of Rm.k.j(ti) that evaluates a parent criterion along with the child criteria ck.j|k=1,2,3,…;j=1,2,3,… by each human class for each school year; the Q plane parallel to the (R,T) plane moving along the H axis represents the change of opinion of the human classes when evaluating the parent criteria together with the child criteria for many university years or the change in educational quality for the criteria over many years of schooling.
4.2. Experiments
4.2.1. Process of visual representation of survey datasets
The education quality survey dataset of a university is collected by surveying tables consisting of many questions, in which one question corresponds to one criterion, in which each survey object scores according to each criterion on a scale of {1,2,3,4,5}, a score of 5 is the best, and a score of 1 is the worst. The points in the questionnaire are stored in the computer for visual representation and data analysis according to the following steps.
Step 1: Determine the survey variable. Group the survey tables according to each human class and design the survey dataset to restructure as a data table and count the number of humans of each human class given the same score for each criterion qsδm.k.j.
Step 2: Establish the evaluation variable. Transform the survey variable into an evaluative variable by choosing an arithmetic average transformation algorithm to calculate the value Rm.k.j(ti).
Step 3: Design 3D models. Design 3D models for representing visual graph with the value Rm.x.k.j(ti) referring to hm, dx, pk, cj, and ti.
Step 4: Prepare analytical questionnaires. Set up an analytical questionnaire consisting of elementary questions, overall questions, and relational questions with clear assumptions and conclusions in each question.
Step 5: Analyze data. Look at the visual graphs on the models of 3D to understand the meaning of the educational accreditation of the training program and to identify the correlation between the Rm.x.k.j(ti) values by answering the questions of data visual analysis.
4.2.2. Question for visual analytics of survey datasets
Visually analyzing data means answering data analysis questions. These questions can be prepared before the user begins viewing–thinking visual graph or arise during viewing visual graph and thinking about data. Each question consists of two parts, a hypothesis and a conclusion, in which the hypothesis is related to the data. Due to the different nature of the hypothetical part of the questions, the questions are classified into three categories as follows: Elementary analysis questions, overall analysis questions, and relation analysis questions.2,3,17,18 In which, the elementary level question has assumptions related to each value of each data variable, the overall question has assumptions related to many values of a variable, and the relational question has related assumptions to the correlation between the data variables. Following are some questions related to the teaching and learning survey dataset.
Elementary level analysis questions:
Question 1: How did the business objects, which are the school’s partners, evaluate the school year 16–17, 17–18, 18–19, 19–20, or 20–21 training program? (see Fig. 2).
Question 2: How did the business object {h5} evaluates the capacity of new graduates through the school years 16–17, 17–18, 18–19, 19–20, and 20–21? (see Fig. 3).
Global level analysis questions:
Question 3: How did each object evaluates teaching activities according to each school year 16–17, 17–18, 18–19, 19–20, and 20–21? (see Fig. 4).
Question 4: Parent criterion p1 has child criteria c1.1,c1.2,c1.3 how is evaluated by objects h1, h2, h3, h4, h5 according to each school year 16–17, 17–18, 18–19, 19–20, and 20–21? (see Fig. 5).
Questions to analyze the relationship between variables:
Question 5: Based on Fig. 2, how is the training program of the school years evaluated by the objects {h1,h2,h3,h4,h5}? What’s the difference? Why different if yes, why not different?
– Answer: The analysts who view–think the visual graph (as shown in Fig. 2) can answer that the objects rated for the training program are increasing.
Question 6: Based on Fig. 3, compare the evaluation of the humans {h1,h2,h3,h4,h5} on the student’s ability over each academic year? What’s the difference? What does that difference mean?
– Answer: The analysts who view–think the visual graph (as shown in Fig. 3) can answer that the humans in the school {h1,h2,h3} evaluate increase points, and the business objects {h5} evaluate reduced points according to each university year. This shows that the training program, teaching quality of the university need to be adjusted and changed to be more suitable for students and the actual situation and needs of society.

Fig. 2. Humans h1,h2,h3,h4,h5 evaluate for the area of training programme for the school year 16–17, 17–18, 18–19, 19–20, and 20–21.

Fig. 3. Humans h1,h2,h3,h5 evaluate the competence areas of new graduates for each school year 16–17, 17–18, 18–19, 19–20, and 20–21.

Fig. 4. Humans h1,h2,h3,h4,h5 evaluate the area of teaching activities according to each school year 16–17, 17–18, 18–19, 19–20, and 20–21.

Fig. 5. Humans h1,h2,h3,h4,h5 evaluate the parent criterion p1 with child criteria c1.1,c1.2,c1.3 according to each school year 16–17, 17–18, 18–19, 19–20, and 20–21.
4.2.3. The case of surveying a multivariate dataset
In actual deploy, the process of visual representation and analysis is applied to the survey dataset of a university in the school years 16–17, 17–18, 18–19, 19–20, and 20–21. This paper presents models representing survey datasets in many different cases in knowledge discovery from multivariate datasets. In this section, Fig. 5 is presented as an illustration for the educational quality assessment survey that the humans {h1,h2,h3,h4,h5} evaluate the parent criterion p1 with the child criterion c1.2 according to school years 16–17, 17–18, 18–19, 19–20, and 20–21 are illustrated in general form as shown in Fig. 6.

Fig. 6. An instance of the visual graph representing Rm.1.2(ti) using the arithmetic average for the case m=1,2,3,4,5 and k.j=1.2 for a portion of the survey dataset observed in the school years ti={16−17,17−18,18−19,19−20,and 20−21}.
The analysts who observe Fig. 6 representing a value Rm.k.j(ti) of the evaluation variable on the model (R,H,T) representing the response criteria of the training programme with c1.2 criterion. The values Rm.k.j(ti) display as value bars at positions with coordinates in the model (hm,ti)|m=1,2,3,4;i=1,2,3,4,5 with elevation proportional to the values Rm.1.2(ti)|m=1,2,3,4;k=1,2,3,…;i=1,2,3,4,5. At the university’s education quality meetings, all models representing the survey dataset are presented so that all members of the panel can be seen understood and jointly evaluated to find out strengths, the weakness of the training program to form policies to improve or enhance the university’s educational quality.
In addition, the criterion c1.2 is represented by a visual graph (see Fig. 6) which has attracted the attention of the university’s direct administrators. Here, the responsiveness of the training program is reduced from school year 16–17 to 18–19 in the order of assessment of each human class {h1,h2,h3,h4,h5} while from school year 19–20 to 20–21 is again incremented by these {h1,h2,h3,h4,h5} human classes. Thus, the question posed to the senior administrators of the university at these discussions is why these target groups have assessed the responsiveness of the training program to be lower than in previous years and how the policy to improve should be.
5. Conclusions and Future Work
This paper reviewed the researches related to data visualization, data visual analytics, and made new comments and suggestions. This paper focused on visual analytics of datasets of the quality survey of a university’s training program for use in conferences on accrediting the university’s educational quality. We studied the characteristics of the survey dataset structure and built a process to analyze the survey dataset about the educational quality of a curriculum at a university to establish the set of evaluation data variables to analyze the relations of the evaluative data variable with the evaluative object variable, the evaluation criterion variable, and the temporal variable, to establish visual graphs representing the dataset. This paper builds nSTM to represent a visual graph with bars indicating the value of data variables. This research was deployed to transform all survey datasets for school years 16–17, 17–18, 18–19, 19–20, and 20–21 into visual graphs. In addition, this research can also be extended towards forming a visual analytics model of data related to university education quality in order to support leaders at all levels to make policies to improve and enhance the quality of university education in the future.