Projects:2020s2-7511 SQL Database for Experimental Metadata
Abstract here
Contents
Introduction
In earlier times, all the metadata about each experiment would be kept in laboratory notebooks. This method is reaching the end of its useful lifetime. There is too much data, and physical notebooks are not searchable by machine. If a later researcher studies the data, how are they going to know what it means? What sort of experiment was carried out? Who did the work? Where was it done? What were the objectives of the experiment? What code should be executed in order to process the data? The purpose of this project is to develop a relational database, which can be interrogated using Structures Query Language (SQL). The data will need to be harvested, re-formatted and checked using a scripting language. We are proposing the use of the Python3 programming language. Statistical post-processing of the data can be carried out in several languages, including MATLAB, Python3, or the “R” programming language. There are number of Open-Source SQL data base packages for the Linux operating system, including Bee Keeper, Libre Office BASE, and Keri. Python3 will run under Linux. The early parts of the project would involve programming in a Linux environment. In order to make the work realistic, students will measure a series of RC ladder circuits, using a Picoscope. This will generate large amounts of accurate sampled data, which will then need to be classified and processed. The experimental part of the project is safe and could be carried out off campus, in Adelaide. Remote students, outside of Adelaide, would have to concentrate on the software aspects of the project
Project team
Project students
- Ruoyun Zhou
- Zeyu Fu
- Junwen Zheng
Supervisors
- Dr Andrew Allison
Advisors
Objectives
Set of objectives
Background
Electrical principles used
There are three laws are used in this project, Ohm’s Law, Kirchhoff’s Current Law and Kirchhoff’s Voltage Law respectively. Ohm’s Law is named for physicist Georg Ohm(1789-1854) from Germany. It aims to calculate the relationship between current, resistance and voltage in an electrical circuit. Voltage is the pressure that triggers flow of electrons, its unit is volt which is always abbreviated as (V). While current is the rate of electron flow, its unit is ampere or amp (A). The last variable is resistance which is the flow inhibitor, note that resistance is a constant as it is measured from the resistor, it does not change according to the voltage and current, resistor unit is ohm(). The formula of Ohm’s Law is voltage = current * resistance, which can be formulated as V=I*R in mathematics. From the formula if two variables values are known then the third one can be calculated by applying the mathematical formula. Ohm’s Law is undoubtably the easiest tool for circuit analysis, however, voltages and currents will not be easily obtained when it comes to analyse complex circuits such as T or bridge networks, therefore, Kirchhoff’s Circuit Law comes in place for the calculation. Kirchhoff’s Circuit Law is developed by a German physicist Gustav Kirchhoff back in 1845 where it consists of two law, KCL and KVL. Kirchhoff’s Current Law is used to cope with the current flows around a closed circuit while Kirchhoff’s Voltage Law is used to cope with the voltage sources present in a closed circuit. Kirchhoff’s Current Law is also referred as Kirchhoff’s First Law and it claims that “total current charge entering a junction or node is exactly equal to the charge leaving the node as it has no other place to go except to leave, as no charge is lost within the node”. The previous sentence can be summarised in a mathematical form as: I_exiting+I_entering=0. This Kirchhoff’s Current Law is known as Conservation of Charge. Note that the electrical terminology node refers to a junction of two or more current carrying paths or elements, e.g. components and cables. Kirchhoff’s Current Law will be applicable for parallel circuits analysis only if a closed circuit exist. Kirchhoff’s Voltage Law is Kirchhoff’s Second Law, it claims “in any closed loop network, the total voltage around the loop is equal to the sum of all the voltage drops within the same loop” which is also equivalent to zero. It can be seen in mathematical as the sum of all voltages in the loop must equivalent to zero. This is known as Conservation of Energy. All voltage will be looping in the same direction, it will be either positive or negative and eventually returning to the original starting point. The reason why the direction is important is if it is not the summation of all voltage will not be equivalent to zero. Whenever there is a phase angle there will be power component called reactive power, it is also referred as imaginary power, it is described in a unit called “volt-amperes reactive”, (VAr). The reactive power, Var, represents the products of amperes and volts which are out-of-phase with each other instead of representing the actual power. Reactive power is the fraction of electricity which alternate current equipment in order to help to sustain and establish the electric and magnetic fields required. Phase angle or phase shift between the current and voltage will affect the amount of reactive power present in an AC circuit. Reactive power is negative when it is “consumed” and it is positive when it is “supplied”. Nowadays, reactive power is adopted by majority types of electrical equipment which uses a magnetic field. For example, transformers, motors and generators. Reactive power is indispensable when overhead power transmission lines experienced reactive losses.
Error Estimation
assume that there is a simple linear regression model with the equation as y=β_0+β_1 X+e. Where X is an independent variable and y is the dependent variable. β_0 and β_1 are the intercept term and slope parameter in the equation, they are also deemed as regression coefficients. The failure of data to lie on the straight line and represents the difference between the observed realizations and true of y is described as e which is the unobservable error component in the above equation. However, e is assumed to be observed as identically distributed and independent random variable with the constant variance σ^2 and mean zero characteristics for the reason of statistical inferences. The independent variable is considered as non-stochastic where y is deemed as a random variable with E(y)=β_0+β_1 X and var(y)=σ^2 because it is considered to be controlled by the experimenter. X can be a random variable sometimes, and in that case, the conditional mean of y given X=x will be E(y│x)=β_0+β_1 X and the conditional variance of y given X=x will be var(y|x)=σ^2 instead of simple variance of y and simple mean. The term e is unobserved and β_0,β_1,σ^2 are usually unknown. However, to determine the unknown parameters T pairs of observations (x_t,y_t )(t=1,…,T) on (X,y) are obtained. There are numerous methods of estimation can be used to confirm the estimates of the parameters, and maximum likelihood principles and least squares are the most popular methods of estimation. The Least Square Estimation can be illustrated as a sample of T sets of observations (x_t,y_t) (t=1,…,T) and based on the previous paragraph it can be written as y_t=β_0+β_1 x_t+e_t (t=1,…,T) The principle of least square is to ensure that the sum of squares of difference between the line in the scatter diagram and the observations is minimum by aiming to estimate β_0,β_1. This idea is deemed from various perspectives. The direct regression method is to obtain the estimates of β_0,β_1 at the time being where the vertical difference between the line in the scatter diagram and the observations and its sum of squares is deemed as minimized. The terms β_0,β_1 are obtained through deeming the sum of the absolute deviation of the observations from the line in the vertical direction in the scatter diagram by using the least absolute regression method.
Results
Conclusion
References
[1] a, b, c, "Simple page", In Proceedings of the Conference of Simpleness, 2010.
[2] ...