kuekawa – Page 27 – My Statistical tools

November 18, 2018November 18, 2018

MySQL's Localhost connection

When I try to open my php file located in my PC (e.g., localhost/kaz/indexpage.php, I get the following warning message. The message says that my connection "is not private."

November 18, 2018November 18, 2018

MySQL database character set and collation

The default character set is latin1_swedish_ci . I will use utf8_general_ci instead since it seems to be the most up-to-date character set and the article I found (see reference) suggests we do. However, I have never encountered an issue just using latin1_swedish_ci for Japanese materials. My questions are:

Can I change it back to other options later?
What does "ci" mean?
There are many types of utf8 in the option. Is utf8_general_ci really the good option?

Reference:

https://mediatemple.net/community/products/dv/204403914/default-mysql-character-set-and-collation

November 16, 2018November 20, 2018

SQL basics

I need quotes when specifying a date value.

SELECT *
FROM purchases
WHERE purchased_at <= "2018-11-01";

Pick up rows whose values contain "pudding":

SELECT *
FROM purchases
WHERE name like "%pudding%";

You can use NOT:

SELECT *
FROM purchases
WHERE NOT character_name="Ken";

Another example of NOT:

SELECT *
FROM purchases
WHERE NOT name like "%pudding%";

SELECT *
FROM purchases
WHERE price IS NOT NULL;

Ordering observations

SELECT *
FROM purchases
WHERE character_name = "Ken"
ORDER BY price DESC;

NOT

SELECT *
FROM purchases
WHERE NOT character_name = "Ken";

SELECT *
FROM purchases
WHERE price IS NULL;

SELECT *
FROM purchases
ORDER BY price DESC
LIMIT 5;

Counting the row of observations

This counts non-missing values.

SELECT COUNT(name)
FROM purchases;

This counts the number of rows.

SELECT COUNT(*)
FROM purchases;

Using the where statement:

SELECT COUNT(*)
FROM purchases
WHERE character_name="Ken"
;

Picking up the observation whose value is a maximum value

SELECT name, max(price)
FROM purchases
WHERE character_name="Ken"
;

GET THE SUM PER GROUP

SELECT SUM(price), purchased_at
FROM purchases
GROUP BY purchased_at
;

SELECT COUNT(*), purchased_at
FROM purchases
GROUP BY purchased_at;

SELECT SUM(price), purchased_at
FROM purchases
WHERE character_name="Ken"
GROUP BY purchased_at

;

SELECT SUM(price), purchased_at
FROM purchases
GROUP BY purchased_at
HAVING sum(price) > 20;

This picks up cases with a condition (in this case, whoever had a higher score than the guy Will).

SELECT name
FROM players
WHERE goals > (
-- Write an SQL statement below to get Will's score
SELECT goals
FROM players
WHERE name = "Will"
)
;

SELECT name,goals
FROM players
WHERE goals > (
SELECT AVG(goals)
FROM players
)
;

Using AS:

SELECT name AS "180 cm or taller"
FROM players
WHERE height >= 180
;

SELECT SUM(goals) AS "total team score"
FROM players
;

SELECT *
FROM countries
WHERE rank < (
SELECT rank
FROM countries
WHERE name="Japan"
)
;

Merging two tables:

SELECT *
FROM players
-- Add a name to the combined table
JOIN countries
-- Add a join condition
ON players.country_id = countries.id
;

SELECT players.name, countries.name
FROM players
JOIN countries
ON players.country_id = countries.id
;

SELECT countries.name, SUM(goals)
FROM players
JOIN countries
ON players.country_id = countries.id
GROUP BY countries.name
;

SELECT *
FROM players
JOIN teams
ON players.previous_team_id = teams.id
;

SELECT players.name AS "player name", teams.name AS "team (last year)"
FROM players
JOIN teams
ON players.previous_team_id = teams.id
;

SELECT *
FROM players
LEFT JOIN teams
ON players.previous_team_id =teams.id
;

SELECT players.name AS "player name", teams.name AS "team (last year)"
FROM players
LEFT JOIN teams
ON players.previous_team_id = teams.id
;

SELECT *
FROM players
JOIN countries
ON players.country_id = countries.id
LEFT JOIN teams
ON players.previous_team_id = teams.id
;

SELECT players.name AS "Player name", players.height AS "height"
FROM players
WHERE height > (
SELECT AVG(height)
FROM players
)

select name, price
from items
order by price desc
;

-- get all rows that contain the string "shirt"
SELECT *
FROM items
WHERE name like "%shirt%"
;

SELECT name, price, MAX(price - cost)
FROM items;

SELECT name, price
FROM items
WHERE price > (
SELECT price
FROM items
WHERE name = "grey hoodie"
);

November 16, 2018

Is Multicollinearity the Bogeyman?

Is Multicollinearity the Bogeyman?

November 16, 2018November 16, 2018

The words that are confusing to me

Miscellaneous (I cannot spell)
ventriloquist (I can't remember)
drawer (DROR; this finally I remembered)
multicollinear (I can't spell)

November 16, 2018November 16, 2018

Ascending and decending

In programming, sorting can occur in ascending way or ascending way. I often get confused by this distinction when I use SAS PROC SORT. To summarize:

Sorting by ascending order means:
1
2
3
4
5

Sorting by descending order means:
5
4
3
2
1

PROC SORT:

The following is an example of how descending can be specified. The first SORT procedure sorts the data by first DateModified by natural sequence and TimeModified by the descending order. This means that older data (defined by TimeModified) in the presence of duplicate rows (the same date) will appear first. The second SORT procedure has the nodupkey option, which means that only the first and thus oldest data will be kept and the rest are deleted if the data came from the same date.

proc sort;by Table1_ID child_number DateModified descending TimeModified;
run;
proc sort nodupkey;by Table1_ID child_number;
run;

PROC LOGISTIC

It is common to code the binary outcome as 0 (failure) and 1 (success); however, PROC LOGISTIC models the occurrence of 1 not 0.

November 16, 2018November 16, 2018

MS-ACCESS: How to find a table in the relationship window

The following did not work well. When I find a solution, I will update this post.

***

When I removed a column from a table, ACCESS gave me a message saying I first needed to remove the link(s) from the relationship window.

This means that a column I wanted to delete had an existing relationship (or relationshipS) with another table(s). The following is the summary of what I did.

I have to break these link(s) first. The problem is when there are a lot of tables in the relationship window, relevant tables and links between them are hard to locate.

I suspect in some cases some tables are hidden for an unknown reason.

Pull the table away from other tables, so you can clearly see the links themselves. Click each one of them to see with which table the table is linked. You will eventually find the table you are looking for. Remove the links to break the relationships. Then you can delete the columns you want to delete.

Up to this point, I thought the problem was solved, but I keep getting the message, "Enter Parameter Value." "Birth month" as it appears in the message graphic is one of the two variables/columns I deleted. I got stuck.